← Writing · Glossary →

Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

  • A Personalizable Mobile Sound Detector App Design for Deaf and Hard-of-Hearing Users

    Danielle Bragg, Nicholas Huynh, Richard E. Ladner · 2016 · Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '16)

    This paper presents the design and evaluation of a personalizable mobile phone app that detects sounds of interest to deaf and hard-of-hearing (DHH) users by learning from training examples recorded by the user themselves. Unlike existing commercial sound detection products —…

    deaf and hard of hearing · mobile accessibility · machine learning · sound detection · personalization

  • SlidePacer: A Presentation Delivery Tool for Instructors of Deaf and Hard of Hearing Students

    Alessandra Brandão, Hugo Nicolau, Shreya Tadas, Vicki L. Hanson · 2016 · ASSETS '16: Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility

    SlidePacer addresses a fundamental challenge for deaf and hard-of-hearing (DHH) students in mainstream classrooms: the cognitive overload caused by splitting attention between multiple visual sources—instructor, slides, and sign language interpreter. Unlike previous classroom…

    deaf and hard of hearing · sign language interpreting · cognitive load · multimedia learning · classroom accessibility

  • Closed ASL Interpreting for Online Videos

    Matthew Seita · 2016 · ASSETS '16: Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility

    This paper introduces "closed interpreting"—a concept analogous to closed captions where an ASL interpreter video can be toggled on and customized alongside online video content. The motivation is straightforward but often overlooked: many deaf and hard of hearing people rely on…

    deaf and hard of hearing · American Sign Language · video accessibility · closed interpreting · multimedia accessibility

  • Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications

    Kehuang Li, Zhengyu Zhou, Chin-Hui Lee · 2016 · ACM Transactions on Accessible Computing

    This paper presents a scalable framework for continuous sign language recognition (SLR) designed to work in real-world conditions using affordable hardware. The researchers address a fundamental challenge in SLR: modeling the transitions between signs. Unlike spoken language…

    sign language recognition · hidden Markov models · machine learning · deaf and hard of hearing · wearable technology

  • Isolated Sign Language Recognition with Grassmann Covariance Matrices

    Hanjie Wang, Xiujuan Chai, Xiaopeng Hong, Guoying Zhao, Xilin Chen · 2016 · ACM Transactions on Accessible Computing

    This paper proposes a novel method for isolated sign language recognition using Grassmann Covariance Matrices (GCM) to fuse multimodal features captured by Microsoft Kinect. With 360 million people worldwide affected by hearing loss—21 million in China alone—automatic sign…

    sign language recognition · Chinese sign language · computer vision · machine learning · deaf and hard of hearing

  • Evaluating Intelligibility and Battery Drain of Mobile Sign Language Video Transmitted at Low Frame Rates and Bit Rates

    Jessica J. Tran, Eve A. Riskin, Richard E. Ladner, Jacob O. Wobbrock · 2015 · ACM Transactions on Accessible Computing (TACCESS)

    This paper investigates the lower limits of intelligible sign language video on mobile devices, seeking to determine the minimum frame rate and bit rate that still allow deaf users to understand ASL video conversations. Mobile video communication is essential for deaf and…

    deaf and hard of hearing · sign language · video communication · mobile accessibility · bandwidth

  • A mean for communication between deaf and hearing pairs in inclusive educational settings: the Sessai app

    Soraia Silva Prietch, Emanuel José dos Santos, Lucia Vilela Leite Filgueiras · 2015 · Proceedings of the 12th International Web for All Conference (W4A)

    This extended abstract presents Sessai, an Android application designed to facilitate communication between Deaf or Hard of Hearing (D/HH) students and their hearing peers and teachers in inclusive classroom settings in Brazil. The app uses a WhatsApp-inspired chat interface…

    deaf and hard of hearing · sign language · inclusive education · mobile application · assistive technology

  • Evaluation of Real-time Captioning by Machine Recognition with Human Support

    Hironobu Takagi, Takashi Itoh, Kaoru Shinkawa · 2015 · Proceedings of the 12th International Web for All Conference (W4A)

    This paper from IBM Research Tokyo investigates a hybrid approach to real-time captioning that combines Automated Speech Recognition (ASR) with human correction to make workplace meetings accessible for deaf and hard of hearing (DHH) employees. Professional stenography services…

    real-time captioning · deaf and hard of hearing · automated speech recognition · workplace accessibility · Japanese

  • Responsive Design for Personalised Subtitles

    Chris J. Hughes, Mike Armstrong, Rhianne Jones, Michael Crabb · 2015 · Proceedings of the 12th International Web for All Conference (W4A)

    This paper from BBC R&D and the University of Dundee proposes applying responsive web design principles to subtitle display, moving away from the legacy Teletext format that has constrained subtitling since 1979. Traditional subtitles are pre-blocked into fixed 38-character-wide…

    subtitles · captions · responsive design · video accessibility · personalization

  • Tracked Speech-To-Text Display: Enhancing Accessibility and Readability of Real-Time Speech-To-Text

    Raja S. Kushalnagar, Gary W. Behm, Aaron W. Kelstone, Shareef Ali · 2015 · ASSETS '15: Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility

    This research addresses a subtle but significant barrier facing deaf and hard of hearing (DHH) students in educational settings: visual dispersion. While hearing students can simultaneously watch lecture visuals (slides, demonstrations, whiteboard) and listen to the speaker's…

    deaf and hard of hearing · speech-to-text · CART · captioning · education

  • Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts

    Hernisa Kacorri, Kaoru Shinkawa, Shin Saito · 2014 · Proceedings of the 11th Web for All Conference (W4A)

    This paper from CUNY Graduate Center and IBM Research Tokyo presents a gamified crowdsourcing platform for video captioning that combines ASR output with non-expert human transcription to improve caption accuracy without monetary rewards. The system builds on the Collaborative…

    captioning · crowdsourcing · deaf and hard of hearing · gamification · automatic speech recognition

  • Helping students keep up with real-time captions by pausing and highlighting

    Walter S. Lasecki, Raja Kushalnagar, Jeffrey P. Bigham · 2014 · Proceedings of the 11th Web for All Conference (W4A)

    This paper addresses a fundamental problem with real-time captioning for deaf and hard of hearing (DHH) students: the mismatch between speaking rates (approximately 170 words per minute) and reading rates, which causes students to fall progressively behind the live content. The…

    deaf and hard of hearing · captioning · real-time captioning · education · inclusive classrooms

  • Accessibility Evaluation of Classroom Captions

    Raja S. Kushalnagar, Walter S. Lasecki, Jeffrey P. Bigham · 2014 · ACM Transactions on Accessible Computing

    This paper presents a comprehensive evaluation of real-time captioning approaches for classroom lectures, comparing Communication Access Realtime Translation (CART), Automatic Speech Recognition (ASR), and a novel collaborative captioning system called Legion:Scribe. The authors…

    real-time captioning · deaf and hard of hearing · classroom accessibility · crowdsourcing · eye tracking

  • Identifying Sign Language Videos in Video Sharing Sites

    Frank M. Shipman, Ricardo Gutierrez-Osuna, Caio D. D. Monteiro · 2014 · ACM Transactions on Accessible Computing

    This paper addresses the challenge of finding sign language videos within general video sharing platforms like YouTube. While these platforms contain growing libraries of sign language content created by deaf community members, locating this content is difficult because…

    sign language · ASL · video classification · machine learning · computer vision

  • Enhancing Caption Accessibility through Simultaneous Multimodal Information: Visual-Tactile Captions

    Raja S. Kushalnagar, Gary W. Behm, Joseph S. Stanislow, Vasu Gupta · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)

    This paper addresses a fundamental limitation of captions (subtitles) for deaf and hard of hearing (DHH) viewers: captions force viewers to split attention between reading text at the bottom of the screen and watching the visual action, inevitably causing them to miss…

    captioning · deaf and hard of hearing · haptic feedback · multimodal interaction · non-speech information

  • Real-Time Caption Challenge: C-Print

    Michael S. Stinson, Pamela Francis, Lisa B. Elliot, Donna Easton · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS '14)

    This demonstration paper presents C-Print, a typing-based real-time captioning system developed over 25 years by researchers at the National Technical Institute for the Deaf (NTID) at Rochester Institute of Technology. C-Print provides communication access for deaf and hard of…

    deaf and hard of hearing · real-time captioning · communication access · transcription · mobile accessibility

  • Real-Time Captioning with the Crowd

    Walter S. Lasecki, Jeffrey P. Bigham · 2014 · Interactions

    This article presents Scribe, a crowdsourced real-time captioning system that allows groups of non-expert typists to collectively produce captions at the speed of natural speech — a task that normally requires highly trained professional stenographers. The authors motivate the…

    real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation

  • Legion Scribe: Real-Time Captioning by Non-Experts

    Walter S. Lasecki, Raja Kushalnagar, Jeffrey P. Bigham · 2014 · ACM SIGACCESS Conference on Computers and Accessibility

    This demonstration paper presents Legion:Scribe, a crowd-powered captioning system that enables groups of 3-5 non-expert typists to collectively produce real-time captions with less than 5 seconds of latency. The system addresses the prohibitive cost of professional…

    real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation

  • Implementation and Evaluation of Animation Controls Sufficient for Conveying ASL Facial Expressions

    Hernisa Kacorri, Matt Huenerfauth · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS '14)

    This ASSETS 2014 short paper (2 pages) reports an infrastructure contribution to sign-language-animation research: the authors extended an existing virtual human character (Max, on the open-source EMBR animation platform) with a full set of MPEG-4 Facial Action Parameter (FAP)…

    american sign language · sign language animation · signing avatar · deaf and hard of hearing · facial expression

  • Legion Scribe: Real-Time Captioning by the Non-Experts

    Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility (W4A)

    This demo paper introduces Legion Scribe, a system that enables real-time captioning of speech by harnessing 3-5 ordinary typists working simultaneously, rather than relying on expensive professional stenographers. Real-time captioning provides text equivalents of spoken…

    captioning · deaf and hard of hearing · crowdsourcing · real-time captioning · communication accessibility

  • Warping Time for More Effective Real-Time Crowdsourcing

    Walter S. Lasecki, Christopher D. Miller, Jeffrey P. Bigham · 2013 · Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2013)

    This paper introduces TimeWarp, a technique that manipulates audio playback speed to improve crowd workers performance on real-time speech captioning. The core problem is that non-expert typists cannot keep up with natural speaking rates of 150-225 words per minute, forcing them…

    real-time captioning · crowdsourcing · deaf and hard of hearing · human computation · speech accessibility

  • Real-Time Captioning by Non-Experts with Legion Scribe

    Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2013)

    This short paper introduces Legion Scribe (Scribe), a system that enables 3-5 non-expert typists to collectively caption speech in real time, achieving accuracy approaching that of a professional stenographer at 20-30% of the cost. The system addresses a critical accessibility…

    real-time captioning · deaf and hard of hearing · crowdsourcing · human computation · assistive technology

  • Adaptive Time Windows for Real-Time Crowd Captioning

    Matthew J. Murphy, Christopher D. Miller, Walter S. Lasecki, Jeffrey P. Bigham · 2013 · CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems

    This paper addresses a key barrier to real-time captioning access for deaf and hard of hearing people: the high cost of professional stenographers, who can charge up to $200 per hour. Building on the Legion:Scribe system, which demonstrated that groups of non-expert crowd…

    real-time captioning · crowdsourcing · deaf and hard of hearing · assistive technology · human computation

  • Crowd Caption Correction (CCC)

    Rebecca Perkins Harrington, Gregg C. Vanderheiden · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '13)

    This short paper presents Crowd Caption Correction (CCC), a feature that allows meeting participants or authorized third parties to correct errors in real-time captions during telecollaboration sessions. Captions are critical for deaf and hard of hearing people to participate in…

    captioning · deaf and hard of hearing · crowdsourcing · telecollaboration · real-time captioning

  • Enhancing Learning Accessibility through Fully Automatic Captioning

    Maria Federico, Marco Furini · 2012 · Proceedings of the International Cross-Disciplinary Conference on Web Accessibility (W4A)

    This paper proposes an architecture for automatically generating synchronized captions for video lectures using off-the-shelf automatic speech recognition (ASR) software, aimed at making educational content accessible to hearing impaired students, dyslexic students, ESL (English…

    captioning · speech recognition · education accessibility · deaf and hard of hearing · automatic speech recognition