Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Toward Independent Online Shopping of the Visually Impaired Through Voice-based Computer-Using Agent
Subin Shin, Jeesun Oh, Suhyun Kim, Seoyeon Eom, Sangwon Lee · 2026 · Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems (CHI '26)
This CHI 2026 paper investigates how visually impaired users might shop online independently by interacting with a voice-based Computer-Using Agent (CUA) — an AI agent built on a Large Multimodal Model (LMM) that can perceive a screen, reason about its contents, and manipulate a…
visual impairment · blindness · low vision · voice interface · conversational user interfaces
Probing the Gaps in ChatGPT's Live Video Chat for Real-World Assistance for People who are Blind or Visually Impaired
Ruei-Che Chang, Rosiana Natalie, Wenqian Xu, Jovan Zheng Feng Yap, Anhong Guo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper evaluates ChatGPT's Advanced Voice with Video feature — OpenAI's state-of-the-art live video AI released in December 2024 — as a real-world assistive tool for blind and visually impaired (BVI) individuals. The researchers conducted an in-person exploratory study with…
blind · visually impaired · large multimodal models · live video · ChatGPT
EditScribe: Non-Visual Image Editing with Natural Language Verification Loops
Ruei-Che Chang, Yuxuan Liu, Lotus Zhang, Anhong Guo · 2024 · ASSETS '24: Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces EditScribe, a prototype system that makes image editing accessible to blind and low vision (BLV) users through natural language interaction powered by large multimodal models (LMMs). Image editing is inherently visual and iterative — users need to see the…
blind and low vision · image editing · generative AI · large multimodal models · natural language interaction

3 results.

Reviews

Year

Tag

Search results

Toward Independent Online Shopping of the Visually Impaired Through Voice-based Computer-Using Agent

Probing the Gaps in ChatGPT's Live Video Chat for Real-World Assistance for People who are Blind or Visually Impaired

EditScribe: Non-Visual Image Editing with Natural Language Verification Loops