[Athen] AI tool question

Top Tech Tidbits via athen-list athen-list at u.washington.edu
Sat Feb 1 16:00:20 PST 2025

Previous message: [Athen] AI tool question
Next message: [Athen] AI tool question
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

I could not agree with you more Debee. While there isn’t yet a widely known tool that fully automates the process of describing all images in a Word document or PDF at once, there are some promising approaches and partial solutions:

1. Microsoft 365’s Accessibility Checker & Alt Text Auto-Generation

* Word and PowerPoint in Microsoft 365 have an automatic alt text generation feature. It can generate descriptions for images, but it requires manual review and editing.
* You can run the Accessibility Checker (Review > Check Accessibility) to find missing alt text and fill in some of it automatically.

2. Adobe Acrobat Pro (for PDFs)

* Acrobat has "Auto-Tag Document" in its accessibility tools, which sometimes adds descriptions, but they are basic and often need improvement.
* You can also extract all images and process them separately with an AI tool.

3. Seeing AI & Lookout (for scanning documents)

* If the document is printed or saved as an image-based PDF, Seeing AI (iOS) and Google Lookout (Android) can scan pages and read out descriptions of images alongside the text.

4. Custom AI Workflows (Python + GPT-based models)

* If you're open to a more technical approach, you can extract images from a document using Python (with PyMuPDF or pdf2image for PDFs, or python-docx for Word) and run them through AI vision models like OpenAI's GPT-4V, Google Vision AI, or Microsoft Azure Computer Vision to generate descriptions automatically.

5. Be My Eyes (AI-Powered Virtual Assistant Mode)

* The latest AI-powered "Virtual Volunteer" mode in Be My Eyes (powered by GPT-4V) allows users to upload an entire document with images and get descriptions—though still not fully automated.

Possible Future Solutions

It would be amazing if JAWS, NVDA, or another screen reader implemented a feature where you could just press a button, and it would automatically describe all images in a document in one go. Hopefully, developers will take note of this need!

Aaron Di Blasi, <https://www.pmi.org/> PMP

<https://www.linkedin.com/in/aarondiblasi/>

“The greatest barrier to accessibility is indifference.” 💡

PR Director (2024-Present)

AT-Newswire

Access Technology's Digital Newswire

<https://at-newswire.com> https://at-newswire.com 🌐

Publisher (2024-Present)

AI-Weekly

The Week's News in Artificial Intelligence

<https://ai-weekly.ai/> https://ai-weekly.ai 🌐

Publisher (2022-Present)

Access Information News

The Week's News in Access Information

<https://accessinformationnews.com> https://accessinformationnews.com 🌐

Publisher (2020-Present)

Top Tech Tidbits

The Week's News in Access Technology

<https://toptechtidbits.com/> https://toptechtidbits.com 🌐

Sr. Project Management Professional (2006-Present)

Mind Vault Solutions, Ltd.

<https://mvsltd.com/> https://mvsltd.com 🌐

Certified:

Digital Marketing Associate, <https://mvsltd.com/news/aaron-di-blasi-pmp-mind-vault-solutions-ltd-awarded-digital-marketing-associate-certification-by-meta/> Meta Certified (2022 - Present)

Social Marketing Professional, <https://mvsltd.com/news/aaron-di-blasi-pmp-mind-vault-solutions-ltd-awarded-social-media-marketing-certification-by-hootsuite-world-leader-in-social-media-marketing-solutions/> Hootsuite Certified (2020 - Present)

Email Marketing Professional, <https://mvsltd.com/news/aaron-di-blasi-pmp-mind-vault-solutions-ltd-named-a-constant-contact-certified-solution-provider/> Constant Contact Certified (2019 - Present)

Specializing in:

Digital Strategy and Content Marketing

Social Media Advertising

Online Fundraising

<https://www.ada.gov/> ADA, <https://www.w3.org/WAI/standards-guidelines/wcag/> WCAG and <https://www.justice.gov/crt/section-508-home-page-1> Section 508 Compliance

Website: <https://mvsltd.com> https://mvsltd.com 🌐

Email: <mailto:ad at mvsltd.com> ad at mvsltd.com 📧

Toll Free: <tel:+18555786660> +1 (855) 578-6660📱️

Schedule A Meeting: <https://calendly.com/aarondiblasi> https://calendly.com/aarondiblasi

News: <https://mvsltd.com/news> https://mvsltd.com/news

Services: <https://mvsltd.com/services> https://mvsltd.com/services

Testimonials: <https://mvsltd.com/testimonials> https://mvsltd.com/testimonials

Facebook: <https://mvsltd.com/facebook> https://mvsltd.com/facebook

X (Formerly Twitter): <https://mvsltd.com/x> https://mvsltd.com/x

LinkedIn: <https://mvsltd.com/linkedin> https://mvsltd.com/linkedin

Instagram: <https://mvsltd.com/instagram> https://mvsltd.com/instagram

YouTube: <https://mvsltd.com/youtube> https://mvsltd.com/youtube

Google: <https://mvsltd.com/google> https://mvsltd.com/google

CONFIDENTIALITY NOTICE: This e-mail and attachments, if any, may contain confidential information, which is privileged and protected from disclosure by Federal and State confidentiality laws, rules, and regulations. This e-mail and attachments, if any, are intended for the designated addressee only. If you are not the designated addressee, you are hereby notified that any disclosure, copying, or distribution of this e-mail and its attachments, if any, may be unlawful and may subject you to legal consequences. If you have received this e-mail and attachments in error, please delete the e-mail and its attachments from your computer.

From: athen-list <athen-list-bounces at mailman12.u.washington.edu> On Behalf Of Deborah Armstrong via athen-list
Sent: Saturday, February 1, 2025 5:04 PM
To: Access Technology Higher Education Network <athen-list at u.washington.edu>
Subject: [Athen] AI tool question

Has anyone found a tool that will automatically describe all pictures in a word document or PDF, such as a class handout, a slide ceck or a textbook chapter?

I know JAWS has a great picture smart AI feature that lets you locate a graphic on a web page or in a document and have it thoroughly described, but the user has to locate the picture, focus on it, and hit the right keystrokes.

And users of other screen readers can download the free Be My Eyes app for Windows to do the same thing.

A variety of iPhone and Android apps also describe pictures and scenes for the visually impaired including Seeing AI, Lookout, Speak-A-Boo, Focus Assist and and Be My Eyes. And of course the Meta smart glasses are super for this as well if properly prompted.

But I know of no tool that has automated this for an entire document.

It would be so cool if such a tool existed.

--Debee

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20250201/b309a82c/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 94349 bytes
Desc: not available
URL: <http://mailman12.u.washington.edu/pipermail/athen-list/attachments/20250201/b309a82c/attachment.png>

Previous message: [Athen] AI tool question
Next message: [Athen] AI tool question
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the athen-list mailing list