New accessibility feature coming to Firefox, an “AI powered” alt-text generator.


"Starting in Firefox 130, we will automatically generate an alt text and let the user validate it. So every time an image is added, we get an array of pixels we pass to the ML engine and a few seconds after, we get a string corresponding to a description of this image (see the code).

Our alt text generator is far from perfect, but we want to take an iterative approach and improve it in the open.

We are currently working on improving the image-to-text datasets and model with what we’ve described in this blog post…"

  • Zworf@beehaw.org
    link
    fedilink
    arrow-up
    1
    ·
    6 months ago

    One thing I’d love to see in Firefox is a way to offload the translation engine to my local ollama server. This way I can get much better translations but still have everything private.

  • leanleft@lemmy.ml
    link
    fedilink
    English
    arrow-up
    1
    ·
    5 months ago

    There are way more companies who want to text-mine user content than there are blind people using the internet to read my content.

    • InfiniWheel@lemmy.one
      link
      fedilink
      arrow-up
      0
      ·
      6 months ago

      This is actually one of the few cases where it makes sense. Its for alt-text for people who browse with TTS

      • rho50@lemmy.nz
        link
        fedilink
        arrow-up
        0
        ·
        6 months ago

        Yeah, this is actually a pretty great application for AI. It’s local, privacy-preserving and genuinely useful for an underserved demographic.

        One of the most wholesome and actually useful applications for LLMs/CLIP that I’ve seen.