Share this article

Latest news

With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low

Copilot in Outlook will generate personalized themes for you to customize the app

Microsoft will raise the price of its 365 Suite to include AI capabilities

Death Stranding Director’s Cut is now Xbox X|S at a huge discount

Outlook will let users create custom account icons so they can tell their accounts apart easier

Firefox Nightly introduces Alt text generation for enhanced web accessibility

This could be beneficial for people with screen readers

4 min. read

Published onJune 4, 2024

published onJune 4, 2024

Share this article

Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more

Mozilla’sFirefox Nightlyversion is working on an experimental feature that will be added to a PDF editor to improve web accessibility for all users.

The feature automatically generates alternative text for images using private on-device AI models. It is all set to be included in Firefox 130, which can empower users with screen readers to understand images better across the web, thereby facilitating a more inclusive browsing experience.

The importance of Alt Text

Alt text is an important part of web accessibility. It provides textual descriptions of images, enabling individuals using assistive technologies such as screen readers to comprehend visual content.

Even if proven significant, many web pages don’t have alt text, making them inaccessible to visually impaired users. According to theWeb Almanac’s 2022 report, nearly 50% of images on the web don’t have alt text.

Addressing the issue

To address this problem, Mozilla uses a Transformer-based machine learning model to describe image content accurately. In a recent blog on Mozilla Hacks, Tarek ZIade said:

These models are getting good at describing the contents of the image, yet are compact enough to operate on devices with limited resources. While can’t outperform a large language model likeGPT-4 Turbo with Vision, orLLaVA, they are sufficiently accurate to provide valuable insights on-device across a diversity of hardware.

Model architectures likeBLIPor evenVITthat were trained on datasets likeCOCO(Common Object In Context) orFlickr30kare good at identifying objects in an image. When combined with a text decoder like OpenAI’sGPT-2, they can produce alternative text with 200M or fewer parameters. Once quantized, these models can be under 200MB on disk, and run in a couple of seconds on a laptop – a big reduction compared to the gigabytes and resources an LLM requires.

Enhancing performance and integration

The experimental feature is integrated into Firefox Nightly’s PDF editor, which signifies an important step towards broader implementation across general browsing to ensure future accessibility for all web users.

By harnessing the capabilities of small open-source models, Mozilla ensures privacy, resource efficiency, and increased transparency. These models work entirely within the device, so users’ data is not transmitted to external servers, and their resource efficiency reduces the environmental impact.

Mozilla extends Firefox Nightly’s infrastructure, thereby adapting the Translations inference architecture to include alt text generation. By using the ONNX runtime and Transformers.js library, Mozilla seamlessly integrates and optimizes model caching within the browser environment for better performance.

What’s in the future?

Mozilla aims to reduce biases and improve alt text accuracy by leveraging ViT (Vision Transformer) + DistilGPT-2 architecture and refining training datasets.

Tarek Ziadé also highlighted that Firefox can incorporate an image into a PDF using a popular open-source pdf.js library.

In Firefox 130, PDF.js will automatically generate alt text for the images added to PDFs, allowing users to validate them.

Thus whenever an image is added, Mozilla gets an array of pixels, which are then passed to the ML engine. After a few seconds, you will get a string corresponding to a description of this image.

Initially, when the user adds an image, there might be a delay in downloading the model; however, with time and usage, the process will speed up as the model is stored locally.

In the future, Mozilla aims to provide alt text for any image in PDFs except images with just text.

Mozilla also plans to continuously work on enhancing the alt text generator with input and collaboration from the community. Once it works well with PDF.js, Mozilla hopes to make the feature available in general browsers for users with screen readers.

What do you think about this feature? Share your views with our readers in the comments section below.

More about the topics:Firefox

Srishti Sisodia

Windows Software Expert

Srishti Sisodia is an electronics engineer and writer with a passion for technology. She has extensive experience exploring the latest technological advancements and sharing her insights through informative blogs.

Her diverse interests bring a unique perspective to her work, and she approaches everything with commitment, enthusiasm, and a willingness to learn. That’s why she’s part of Windows Report’s Reviewers team, always willing to share the real-life experience with any software or hardware product. She’s also specialized in Azure, cloud computing, and AI.

User forum

0 messages

Sort by:LatestOldestMost Votes

Comment*

Name*

Email*

Commenting as.Not you?

Save information for future comments

Comment

Δ

Srishti Sisodia

Windows Software Expert

She is an electronics engineer and writer with a passion for technology. Srishti is specialized in Azure, cloud computing, and AI.