Share this article
Latest news
With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low
Copilot in Outlook will generate personalized themes for you to customize the app
Microsoft will raise the price of its 365 Suite to include AI capabilities
Death Stranding Director’s Cut is now Xbox X|S at a huge discount
Outlook will let users create custom account icons so they can tell their accounts apart easier
Firefox Nightly introduces Alt text generation for enhanced web accessibility
This could be beneficial for people with screen readers
4 min. read
Published onJune 4, 2024
published onJune 4, 2024
Share this article
Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more
Mozilla’sFirefox Nightlyversion is working on an experimental feature that will be added to a PDF editor to improve web accessibility for all users.
The feature automatically generates alternative text for images using private on-device AI models. It is all set to be included in Firefox 130, which can empower users with screen readers to understand images better across the web, thereby facilitating a more inclusive browsing experience.
The importance of Alt Text
Alt text is an important part of web accessibility. It provides textual descriptions of images, enabling individuals using assistive technologies such as screen readers to comprehend visual content.
Even if proven significant, many web pages don’t have alt text, making them inaccessible to visually impaired users. According to theWeb Almanac’s 2022 report, nearly 50% of images on the web don’t have alt text.
Addressing the issue
To address this problem, Mozilla uses a Transformer-based machine learning model to describe image content accurately. In a recent blog on Mozilla Hacks, Tarek ZIade said:
These models are getting good at describing the contents of the image, yet are compact enough to operate on devices with limited resources. While can’t outperform a large language model likeGPT-4 Turbo with Vision, orLLaVA, they are sufficiently accurate to provide valuable insights on-device across a diversity of hardware.
Model architectures likeBLIPor evenVITthat were trained on datasets likeCOCO(Common Object In Context) orFlickr30kare good at identifying objects in an image. When combined with a text decoder like OpenAI’sGPT-2, they can produce alternative text with 200M or fewer parameters. Once quantized, these models can be under 200MB on disk, and run in a couple of seconds on a laptop – a big reduction compared to the gigabytes and resources an LLM requires.
Enhancing performance and integration
The experimental feature is integrated into Firefox Nightly’s PDF editor, which signifies an important step towards broader implementation across general browsing to ensure future accessibility for all web users.
By harnessing the capabilities of small open-source models, Mozilla ensures privacy, resource efficiency, and increased transparency. These models work entirely within the device, so users’ data is not transmitted to external servers, and their resource efficiency reduces the environmental impact.
Mozilla extends Firefox Nightly’s infrastructure, thereby adapting the Translations inference architecture to include alt text generation. By using the ONNX runtime and Transformers.js library, Mozilla seamlessly integrates and optimizes model caching within the browser environment for better performance.
What’s in the future?
Mozilla aims to reduce biases and improve alt text accuracy by leveraging ViT (Vision Transformer) + DistilGPT-2 architecture and refining training datasets.
Tarek Ziadé also highlighted that Firefox can incorporate an image into a PDF using a popular open-source pdf.js library.
In Firefox 130, PDF.js will automatically generate alt text for the images added to PDFs, allowing users to validate them.
Thus whenever an image is added, Mozilla gets an array of pixels, which are then passed to the ML engine. After a few seconds, you will get a string corresponding to a description of this image.
Initially, when the user adds an image, there might be a delay in downloading the model; however, with time and usage, the process will speed up as the model is stored locally.
In the future, Mozilla aims to provide alt text for any image in PDFs except images with just text.
Mozilla also plans to continuously work on enhancing the alt text generator with input and collaboration from the community. Once it works well with PDF.js, Mozilla hopes to make the feature available in general browsers for users with screen readers.
What do you think about this feature? Share your views with our readers in the comments section below.
More about the topics:Firefox
Srishti Sisodia
Windows Software Expert
Srishti Sisodia is an electronics engineer and writer with a passion for technology. She has extensive experience exploring the latest technological advancements and sharing her insights through informative blogs.
Her diverse interests bring a unique perspective to her work, and she approaches everything with commitment, enthusiasm, and a willingness to learn. That’s why she’s part of Windows Report’s Reviewers team, always willing to share the real-life experience with any software or hardware product. She’s also specialized in Azure, cloud computing, and AI.
User forum
0 messages
Sort by:LatestOldestMost Votes
Comment*
Name*
Email*
Commenting as.Not you?
Save information for future comments
Comment
Δ
Srishti Sisodia
Windows Software Expert
She is an electronics engineer and writer with a passion for technology. Srishti is specialized in Azure, cloud computing, and AI.