Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

AI4XR: AI in Extended Reality for 3D Scene Editing and Accessibility Design
Junlong Chen · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26) — Doctoral Consortium
This CHI '26 Doctoral Consortium paper summarises Junlong Chen's PhD research at the University of Cambridge on integrating AI — specifically large language models (LLMs) and vision-language models (VLMs) — into extended reality (XR) workflows. The research covers two…
extended reality · virtual reality · artificial intelligence · large language models · vision-language models
Measuring the Semantic Accessibility Gap in LLM-Generated Web UIs
Tommaso Calo, Alexandra-Elena Gurita, Luigi De Russis · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)
Calo, Gurita, and De Russis investigate a blind spot of mainstream automated accessibility tools: while scanners like Axe-core reliably catch missing alt attributes or unlabelled form fields (syntactic violations), they cannot tell whether the values present are actually…
web accessibility · large language models · LLM code generation · semantic accessibility · WCAG
Auto-Generating Personas from User Reviews in VR App Stores
Yi Wang, Kexin Cheng, Xiao Liu, Chetan Arora, John Grundy, Thuong Hoang, Henry Been-Lirn Duh · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA ’26)
This CHI 2026 Extended Abstract reports on an auto-generated persona system developed to help undergraduate students surface accessibility requirements in virtual reality design projects. The authors argue that personas are well-established in user-centered design and…
virtual reality · VR accessibility · personas · requirements engineering · large language models
Investigating the Role of Agentic AI in Facilitating Travel Planning for People with Low Vision
Ranran Ding, Maryam Bandukda · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA ’26)
This CHI 2026 Extended Abstract examines a stage of accessible travel that most assistive-technology research has overlooked: the pre-trip planning work people with low vision (PLV) do before ever leaving the house. The authors argue that most existing tools — navigation apps,…
low vision · wayfinding · agentic AI · large language models · conversational agents
Reassurance Robots: OCD in the Age of Generative AI
Grace Barkhuff · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA ’26)
This CHI 2026 Extended Abstracts paper by Grace Barkhuff (Georgia Tech) is an exploratory qualitative study of how generative AI tools, particularly ChatGPT, are reshaping the lived experience of Obsessive-Compulsive Disorder (OCD). OCD is a mental health disorder characterized…
OCD · Obsessive-Compulsive Disorder · generative AI · ChatGPT · mental health
Making Charts Speak: LLM-Based Conversational Chart Question Answering for Blind and Low-Vision Users
Amit Kumar Das, Mohammad Tarun, Klaus Mueller · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA '26)
Das, Tarun, and Mueller present GraphWhisper, a conversational system that lets blind and low-vision (BLV) users explore chart images (JPEG, PNG) through natural-language questions, without requiring the chart data to be pre-structured in formats like Vega-Lite. The authors…
chart accessibility · data visualization · blind and low vision · large language models · conversational interface
NarrAid: Supporting Storytelling of People with Aphasia via Generative Visual Scene Displays
Xiangfei Hu, Xiuqi Zheng, Qi Liu, Zejian Li, Ying Zhang, Lu Wang, Xipei Ren · 2026 · Extended Abstracts of the 2026 CHI Conference on Human Factors in Computing Systems (CHI EA ’26)
NarrAid is a generative AI-driven Visual Scene Display (VSD) system designed to support storytelling, not just basic wants and needs, for people with aphasia (PWA). The authors argue that existing AAC tools, including traditional VSDs, support functional communication well but…
aphasia · augmentative and alternative communication · visual scene display · storytelling · generative AI
NeuroBridge: Using Generative AI to Bridge Cross-neurotype Communication Differences through Neurotypical Perspective-taking
Rukhshan Haroon, Kyle Wigdor, Katie Yang, Nicole Toumanios, Eileen T Crehan, Fahad Dogar · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents NeuroBridge, an LLM-powered interactive platform designed to help neurotypical individuals better understand autistic communication styles and reflect on their own role in cross-neurotype communication breakdowns. The system is grounded in the double empathy…
autism · neurodiversity · large language models · cross-neurotype communication · perspective-taking
Benchmarking PDF Accessibility Evaluation: A Dataset and Framework for Assessing Automated and LLM-Based Approaches for Accessibility Testing
Anukriti Kumar, Tanushree Padath, Lucy Lu Wang · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper addresses a critical gap in PDF accessibility evaluation by introducing the first expert-validated benchmark dataset and standardized evaluation framework for assessing how well different tools and approaches can evaluate PDF accessibility. Despite PDFs being the…
PDF accessibility · automated testing · large language models · WCAG · PDF/UA
AccessGuru: Leveraging LLMs to Detect and Correct Web Accessibility Violations in HTML Code
Nadeen Fathallah, Daniel Hernández, Steffen Staab · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces AccessGuru, a novel method that combines traditional automated accessibility testing tools with large language models (LLMs) to both detect and correct web accessibility violations in HTML code. The work addresses a persistent gap in accessibility tooling:…
automated testing · web accessibility · large language models · HTML remediation · prompt engineering
Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers
Aziz N Zeidieh, JooYoung Seo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This experience report documents the lived experiences of two blind travelers — Aziz (28, blind in left eye, 20/2200 in right) and JooYoung (35, blind in right eye, limited vision in left) — as they adapted commercially available voice and video-capable language models (VVLMs)…
artificial intelligence · navigation · blindness and visual impairment · multimodal AI · large language models
CapTune: Adapting Non-Speech Captions With Anchored Generative Models
Jeremy Zhengqi Huang, Caluã De Lacerda Pataca, Saelyne Yang Wu, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
CapTune is a system that enables customization of non-speech captions—descriptions of environmental sounds, music, and other audio cues—for Deaf and Hard of Hearing (DHH) viewers. Current captioning practices follow a one-size-fits-all model based on standardized guidelines like…
closed captioning · non-speech information · caption customization · deaf and hard of hearing · generative AI
Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions
Meng Chen, Akhil Iyer, Amy Pavel · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper addresses a critical safety problem in AI-powered visual access technology: multimodal large language models (MLLMs) like GPT-4o, Gemini, and Claude produce fluent, confident image descriptions that can contain fabricated content, misinterpretations, and omissions…
blindness · low vision · image descriptions · multimodal AI · large language models
DescribePro: Collaborative Audio Description with Human-AI Interaction
Maryam S Cheema, Sina Elahimanesh, Samuel Martin, Pooyan Fazli, Hasti Seifi · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper presents DescribePro, a web-based platform that combines human expertise with AI capabilities to create and refine audio descriptions (AD) for video content. The system addresses the fundamental tension in AD production: human-crafted descriptions are high quality but…
audio description · video accessibility · human-AI collaboration · authoring tools · blind and low vision
CARTGPT: Real-Time Correction of CART Captions Using Large Language Models
Liang-Yuan Wu, Andrea Kleiver, Dhruv Jain · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces CARTGPT, a real-time system that enhances Communication Access Realtime Translation (CART) captions by combining human-generated CART transcripts with automatic speech recognition (ASR) output and using GPT-4 to detect and correct transcription errors. CART…
deaf and hard of hearing · real-time captioning · CART · large language models · automatic speech recognition
Understanding Human-AI Misalignment in LLM-Based Job-Seeking Support for Neurodivergent Users
Kaely Hall, Marcus Ma, Xinyue Zhang, Vedant Das Swain, Jennifer G Kim · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper examines how misalignments manifest between neurodivergent job-seekers and a GPT-4-powered career support chatbot deployed by Mentra, a neuroinclusive employment platform with over 46,000 neurodivergent users. The researchers analysed 348 real-world chat logs from 271…
neurodivergence · large language models · employment · AI alignment · autism
Examining Age-Bias and Stereotypes of Aging in LLMs
Sherwin Dewan, Ismail Shaikh, Connie Shaw, Abhilash Sahoo, Akshita Jha, Alisha Pradhan · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility
This paper investigates how large language models encode and reproduce age-related stereotypes about older adults. Using prompts from the Bias Benchmarking Questionnaire (BBQ), a well-established fairness dataset, the researchers administered 1,648 age-bias prompts to ChatGPT…
ageism · AI bias · large language models · older adults · stereotypes
Making Lecture Videos Accessible for Students who are Blind or have Low Vision through AI-Assisted Navigation and Visual Question Answering
Katharina Anderer, Karin Müller, Lukas Strobel, Matthias Wölfel, Jan Niehues, Kathrin Gerling · 2025 · Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2025)
This paper presents the design and evaluation of LectureAssistant, an AI-powered prototype that makes lecture videos more accessible for students who are blind or have low vision. The research follows a three-part human-centred design process. First, need-finding interviews with…
blind and low vision · lecture accessibility · higher education · large language models · vision-language models
When LLM-Generated Code Perpetuates User Interface Accessibility Barriers, How Can We Break the Cycle?
Alexandra-Elena Gurita, Radu-Daniel Vatavu · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This paper evaluates the ability of large language models (LLMs) to generate accessible web user interfaces, comparing ChatGPT (GPT-4-turbo) and Claude (3.5 Haiku) across two prompting strategies: accessibility-agnostic prompts ("Design the homepage of a banking app") and…
large language models · WCAG compliance · automated accessibility · prompt engineering · code generation
LLMs for Accessibility in Mobile Apps: Detection and Repair
Wajdi Aljedaani, Ahmed Aljohani, Marcelo M. Eler, Abdulrahman Habib, Hyunsook Do · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This study evaluates the capacity of three large language models—GPT-4o, Gemini 1.0 Pro, and Llama 3—to detect, classify, and remediate accessibility violations in Android mobile applications. While prior LLM accessibility research has focused primarily on web applications, this…
mobile accessibility · large language models · Android accessibility · automated accessibility testing · accessibility remediation
QuickQue: Enabling Quick Access to Information in User Reviews for Screen Reader Users
Mohan Sunkara, Akshay Kolgar Nayak, Sandeep Kalari, Yash Prakash, Sampath Jayarathna, Hae-Na Lee, Vikas Ashok · 2025 · Proceedings of the 22nd International Web for All Conference (W4A 2025)
This paper presents QuickCue, a Google Chrome browser extension that helps blind screen reader users efficiently access online customer reviews by using LLM-powered aspect and sentiment classification to organize and summarize review content. The current experience of reading…
screen readers · blind users · online reviews · large language models · browser extension
Morae: Proactively Pausing UI Agents for User Choices
Yi-Hao Peng, Dingzeyu Li, Jeffrey P. Bigham, Amy Pavel · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces Morae, a UI agent that proactively pauses during automated task execution to involve blind and low-vision (BLV) users in critical decisions, rather than completing tasks end-to-end without user input. The work is motivated by a field study with four BLV…
UI agents · blind and low vision · large language models · human-agent interaction · user agency
StepWrite: Adaptive Planning for Speech-Driven Text Generation
Hamza El Alaoui, Atieh Taheri, Yi-Hao Peng, Jeffrey P. Bigham · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces StepWrite, an LLM-powered voice-based writing system that enables structured, hands-free and eyes-free composition of longer-form texts. While speech-to-text tools handle short dictation well, composing structured emails or detailed responses requires…
voice interface · speech-to-text · hands-free interaction · eyes-free interaction · large language models
CodeA11y: Making AI Coding Assistants Useful for Accessible Web Development
Peya Mowar, Yi-Hao Peng, Jason Wu, Aaron Steinfeld, Jeffrey P. Bigham · 2025 · Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems (CHI '25)
This paper addresses a persistent problem: despite decades of accessibility standards and tools, ~96% of web pages contain accessibility violations. The authors argue that AI coding assistants like GitHub Copilot represent an untapped opportunity because developers already use…
web accessibility · AI coding assistants · developer tools · WCAG · automated testing
Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors
Michelle S. Lam, Fred Hohman, Dominik Moritz, Jeffrey P. Bigham, Kenneth Holstein, Mary Beth Kery · 2025 · Proceedings of the 38th Annual ACM Symposium on User Interface Software and Technology (UIST '25)
This paper introduces "policy maps," an approach to AI policy design for large language models inspired by physical mapmaking. The core insight is that comprehensive policy coverage over an unbounded space of LLM inputs and outputs is impossible — just as no map can capture…
AI safety · AI policy · large language models · AI ethics · model evaluation

Reviews

Year

Tag

Search results

AI4XR: AI in Extended Reality for 3D Scene Editing and Accessibility Design

Measuring the Semantic Accessibility Gap in LLM-Generated Web UIs

Auto-Generating Personas from User Reviews in VR App Stores

Investigating the Role of Agentic AI in Facilitating Travel Planning for People with Low Vision

Reassurance Robots: OCD in the Age of Generative AI

Making Charts Speak: LLM-Based Conversational Chart Question Answering for Blind and Low-Vision Users

NarrAid: Supporting Storytelling of People with Aphasia via Generative Visual Scene Displays

NeuroBridge: Using Generative AI to Bridge Cross-neurotype Communication Differences through Neurotypical Perspective-taking

Benchmarking PDF Accessibility Evaluation: A Dataset and Framework for Assessing Automated and LLM-Based Approaches for Accessibility Testing

AccessGuru: Leveraging LLMs to Detect and Correct Web Accessibility Violations in HTML Code

Check Now, Can You See It?: Exploring Voice and Video-Capable Language Models for Identifying and Spatially Locating Items of Interest for Blind and Low-Vision Travelers

CapTune: Adapting Non-Speech Captions With Anchored Generative Models

Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions

DescribePro: Collaborative Audio Description with Human-AI Interaction

CARTGPT: Real-Time Correction of CART Captions Using Large Language Models

Understanding Human-AI Misalignment in LLM-Based Job-Seeking Support for Neurodivergent Users

Examining Age-Bias and Stereotypes of Aging in LLMs

Making Lecture Videos Accessible for Students who are Blind or have Low Vision through AI-Assisted Navigation and Visual Question Answering

When LLM-Generated Code Perpetuates User Interface Accessibility Barriers, How Can We Break the Cycle?

LLMs for Accessibility in Mobile Apps: Detection and Repair

QuickQue: Enabling Quick Access to Information in User Reviews for Screen Reader Users

Morae: Proactively Pausing UI Agents for User Choices

StepWrite: Adaptive Planning for Speech-Driven Text Generation

CodeA11y: Making AI Coding Assistants Useful for Accessible Web Development

Policy Maps: Tools for Guiding the Unbounded Space of LLM Behaviors