← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • Probing the Gaps in ChatGPT's Live Video Chat for Real-World Assistance for People who are Blind or Visually Impaired

    Ruei-Che Chang, Rosiana Natalie, Wenqian Xu, Jovan Zheng Feng Yap, Anhong Guo · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper evaluates ChatGPT's Advanced Voice with Video feature — OpenAI's state-of-the-art live video AI released in December 2024 — as a real-world assistive tool for blind and visually impaired (BVI) individuals. The researchers conducted an in-person exploratory study with…

    blind · visually impaired · large multimodal models · live video · ChatGPT

  • Exploring Object Status Recognition for Recipe Progress Tracking in Non-Visual Cooking

    Franklin Mingzhe Li, Kaitlyn Ng, Bin Zhu, Patrick Carrington · 2025 · ASSETS 2025: 27th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper presents OSCAR (Object Status Context Awareness for Recipes), a technical pipeline that uses object status recognition—tracking the condition and transformation of ingredients and tools—to support recipe progress tracking for blind and low vision (BLV) cooks. Unlike…

    blind and low vision · cooking accessibility · context awareness · object recognition · computer vision

  • Understanding How Blind Users Handle Object Recognition Errors: Strategies and Challenges

    Jonggi Hong, Hernisa Kacorri · 2024 · ASSETS '24: Proceedings of the 26th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper investigates how blind and low-vision users interact with object recognition systems, specifically focusing on how they identify and handle recognition errors. While object recognition technologies powered by computer vision and machine learning have enormous…

    blind users · object recognition · AI errors · computer vision · camera-based assistive technology

  • Understanding Personalized Accessibility through Teachable AI: Designing and Evaluating Find My Things for People who are Blind or Low Vision

    Cecily Morrison, Martin Grayson, Rita Faia Marques, Daniela Massiceti, Camilla Longden, Linda Wen, Edward Cutrell · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '23)

    This paper from Microsoft Research presents Find My Things, one of the first fully realized end-to-end teachable AI applications for accessibility. The app allows people who are blind or low vision to teach their phone to recognize personal objects — keys, earbuds, lip balm,…

    teachable AI · object recognition · blind and low vision · personalization · few-shot learning

  • Blind Users Accessing Their Training Images in Teachable Object Recognizers

    Jonggi Hong, Jaina Gandhi, Ernest Essuah Mensah, Farnaz Zamiri Azar, Kyungjun Lee, Hernisa Kacorri · 2022 · Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '22)

    This paper introduces MYCam, an open-source iOS testbed application designed to help blind users build and improve personalized object recognizers. While teachable object recognizers allow users to train custom models by taking photos of objects they want to recognize, blind…

    teachable AI · object recognition · blind users · machine learning · camera interaction

  • Fairness Issues in AI Systems that Augment Sensory Abilities

    Leah Findlater, Steven Goodman, Yuhang Zhao, Shiri Azenkot, Margot Hanley · 2020 · SIGACCESS Accessibility and Computing

    This paper examines the unique fairness challenges that arise when AI systems are used to augment sensory abilities for people with disabilities — a context distinct from other AI applications because these systems provide information that is already available to non-disabled…

    AI fairness · sensory augmentation · visual impairment · deaf and hard of hearing · privacy

  • ReCog: Supporting Blind People in Recognizing Personal Objects

    Dragan Ahmetovic, Daisuke Sato, Uran Oh, Tatsuya Ishihara, Kris Kitani, Chieko Asakawa · 2020 · Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems

    ReCog is a smartphone application designed to help blind users recognize their own personal objects — items like specific clothing, handmade goods, medicines, or family photos that cannot be identified by general-purpose recognizers such as Seeing AI or TapTapSee. The authors…

    visual impairment · blindness · object recognition · computer vision · deep learning

  • Revisiting Blind Photography in the Context of Teachable Object Recognizers

    Kyungjun Lee, Jonggi Hong, Simone Pimento, Ebrima Jarjue, Hernisa Kacorri · 2019 · Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)

    This paper introduces a real-time audio-haptic feedback system to help people with visual impairments frame objects in their smartphone camera when training teachable object recognizers. The challenge is that teachable recognizers — which let users train personalized models to…

    blind photography · teachable object recognizer · computer vision · deep learning · visual impairment

  • Closing the Gap: Designing for the Last-Few-Meters Wayfinding Problem for People with Visual Impairments

    Manaswi Saha, Alexander J. Fiannaca, Melanie Kneisel, Edward Cutrell, Meredith Ringel Morris · 2019 · Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2019)

    This paper comprehensively investigates the last-few-meters wayfinding problem — the gap between where GPS navigation ends (within approximately 5 meters of a destination) and where a person with visual impairment actually needs to arrive (the specific door, entrance, or…

    blindness · wayfinding · navigation · GPS · computer vision

  • A TensorFlow-based Assistive Technology System for Users with Visual Impairments

    Davide Mulfari · 2018 · Proceedings of the 15th International Web for All Conference (W4A 2018)

    This extended abstract presents a wearable computer vision system that uses deep learning to classify objects in a blind user’s surroundings and provide audio descriptions via text-to-speech. The system addresses a limitation of smartphone-based object recognition apps: people…

    computer vision · deep learning · blind · visual impairment · wearable technology

  • Investigating Cursor-based Interactions to Support Non-Visual Exploration in the Real World

    Anhong Guo, Saige McVea, Xu Wang, Patrick Clary, Ken Goldman, Yang Li, Yu Zhong, Jeffrey P. Bigham · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)

    This paper from Google and Carnegie Mellon University defines and compares three cursor-based interaction techniques designed to help blind and low vision people attend to specific items within complex real-world visual scenes. While computer vision systems like Seeing AI and…

    blindness · low vision · computer vision · non-visual exploration · mobile accessibility

  • "Hands On" Visual Recognition for Visually Impaired Users

    Joan Sosa-García, Francesca Odone · 2017 · ACM Transactions on Accessible Computing

    This paper presents a collaborative visual recognition system designed to help blind or visually impaired (BVI) users identify specific product instances — distinguishing between brands, models, or types of objects that feel similar when handled. While BVI individuals can often…

    visual impairment · object recognition · computer vision · assistive technology · wearable technology

  • People with Visual Impairment Training Personal Object Recognizers: Feasibility and Challenges

    Hernisa Kacorri, Kris M. Kitani, Jeffrey P. Bigham, Chieko Asakawa · 2017 · CHI Conference on Human Factors in Computing Systems

    This paper explores whether people with visual impairments can train their own personalized object recognition systems using a smartphone camera and a small number of example photos. The authors address a fundamental limitation of existing object recognition tools for blind…

    object recognition · computer vision · blindness · transfer learning · personalization

  • Marker-Assisted Recognition of Dynamic Content in Public Spaces

    Andréa Britto Mattos, Ricardo Herrmann, Carlos Cardonha, Diego Gallo, Priscilla Avegliano, Sergio Borger · 2014 · Proceedings of the 11th Web for All Conference (W4A)

    This paper from IBM Research Brazil presents an image processing system that helps visually impaired and situationally disabled people (such as tourists in foreign countries) recognize dynamic content displayed on public information boards. The system addresses a common…

    computer vision · visual impairment · object recognition · fiducial markers · public spaces

  • Marker-based image recognition of dynamic content for the visually impaired

    Andréa Britto Mattos, Carlos Cardonha, Diego Gallo, Priscilla Avegliano, Ricardo Herrmann, Sergio Borger · 2014 · Proceedings of the 11th Web for All Conference (W4A)

    This paper from IBM Research Brazil introduces a marker-based image recognition technique to help visually impaired people access information displayed on public panels and boards with fixed layouts but dynamic content — such as vending machines, split-flap airport displays,…

    computer vision · visual impairment · mobile accessibility · object recognition · situational disability

  • Real Time Object Scanning Using a Mobile Phone and Cloud-based Visual Search Engine

    Yu Zhong, Pierre J. Garrigues, Jeffrey P. Bigham · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2013)

    This paper presents Scan Search, an iPhone application that enables blind users to identify everyday objects in real time by continuously scanning with their phone camera rather than taking individual photos. The core challenge addressed is that blind people struggle with the…

    visual accessibility · object recognition · blind users · mobile accessibility · computer vision

  • Non-Visual-Cueing-Based Sensing and Understanding of Nearby Entities in Aided Navigation

    Juan Diego Gomez, Guido Bologna, Thierry Pun · 2012 · Proceedings of the 14th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2012)

    This demonstration paper presents a context-aware navigation aid system for blind individuals that combines three levels of assistance to enhance understanding of the surrounding environment. The system addresses a fundamental challenge: blind people navigating unfamiliar…

    visual impairment · blind navigation · context-aware computing · computer vision · spatial audio

  • An Integrated System for Blind Day-to-Day Life Autonomy

    Hugo Fernandes, José Faria, Hugo Paredes, João Barroso · 2011 · Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2011)

    This demonstration paper presents nav4b, an integrated system designed to support the day-to-day autonomy of blind people by combining indoor/outdoor navigation guidance with object recognition in a single platform. Unlike existing solutions that require users to carry multiple…

    blindness · navigation · RFID · object recognition · white cane

  • Analyzing Visual Questions from Visually Impaired Users

    Erin L. Brady · 2011 · The Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS)

    This doctoral consortium paper presents an analysis of the types of visual questions that visually impaired users ask through VizWiz, a mobile phone application that provides near-realtime answers to visual questions. VizWiz allows users to take a photo with their phone, speak a…

    blindness and low vision · crowdsourcing · computer vision · mobile accessibility · visual question answering

  • Toward 3D Scene Understanding via Audio-description: Kinect-iPad Fusion for the Visually Impaired

    Juan Diego Gomez, Sinan Mohammed, Guido Bologna, Thierry Pun · 2011 · Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2011)

    This demonstration paper presents a computer-vision-based framework that combines a Microsoft Kinect 3D depth sensor with an iPad touchscreen to enable visually impaired users to understand the spatial layout of indoor scenes through audio. The system works in several stages:…

    sonification · visual substitution · Kinect · computer vision · blindness

  • A Camera Phone Based Currency Reader for the Visually Impaired

    Xu Liu · 2008 · Proceedings of the 10th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '08)

    This paper presents a camera phone-based system for identifying U.S. paper currency denominations for people who are blind or visually impaired. The work addresses a specific accessibility gap: unlike currencies in many other countries, U.S. paper bills are identical in size and…

    visual impairment · computer vision · assistive technology · mobile accessibility · object recognition

  • Interactive Tracking of Movable Objects for the Blind on the Basis of Environment Models and Perception-Oriented Object Recognition Methods

    Andreas Hub, Tim Hartter, Thomas Ertl · 2006 · Proceedings of the 8th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '06)

    This paper from the University of Stuttgart presents advances to an indoor navigation and object identification system for blind and deafblind users. The system combines a stereo camera and 3D inertial sensor mounted on a bicycle helmet with detailed 3D environment models of…

    indoor navigation · object recognition · blind users · computer vision · assistive technology

  • Design and Development of an Indoor Navigation and Object Identification System for the Blind

    Andreas Hub, Joachim Diepstraten, Thomas Ertl · 2003 · Proceedings of the 6th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '04)

    This paper presents a multi-sensor orientation assistant for blind users navigating unknown indoor environments. The system addresses three core problems identified by blind users: determining one's position, determining head direction and movement direction, and identifying…

    indoor navigation · blindness and low vision · object recognition · orientation and mobility · mobile technology

23 results.