Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

A Personalizable Mobile Sound Detector App Design for Deaf and Hard-of-Hearing Users
Danielle Bragg, Nicholas Huynh, Richard E. Ladner · 2016 · Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '16)
This paper presents the design and evaluation of a personalizable mobile phone app that detects sounds of interest to deaf and hard-of-hearing (DHH) users by learning from training examples recorded by the user themselves. Unlike existing commercial sound detection products —…
deaf and hard of hearing · mobile accessibility · machine learning · sound detection · personalization
SlidePacer: A Presentation Delivery Tool for Instructors of Deaf and Hard of Hearing Students
Alessandra Brandão, Hugo Nicolau, Shreya Tadas, Vicki L. Hanson · 2016 · ASSETS '16: Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility
SlidePacer addresses a fundamental challenge for deaf and hard-of-hearing (DHH) students in mainstream classrooms: the cognitive overload caused by splitting attention between multiple visual sources—instructor, slides, and sign language interpreter. Unlike previous classroom…
deaf and hard of hearing · sign language interpreting · cognitive load · multimedia learning · classroom accessibility
Closed ASL Interpreting for Online Videos
Matthew Seita · 2016 · ASSETS '16: Proceedings of the 18th International ACM SIGACCESS Conference on Computers and Accessibility
This paper introduces "closed interpreting"—a concept analogous to closed captions where an ASL interpreter video can be toggled on and customized alongside online video content. The motivation is straightforward but often overlooked: many deaf and hard of hearing people rely on…
deaf and hard of hearing · American Sign Language · video accessibility · closed interpreting · multimedia accessibility
Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications
Kehuang Li, Zhengyu Zhou, Chin-Hui Lee · 2016 · ACM Transactions on Accessible Computing
This paper presents a scalable framework for continuous sign language recognition (SLR) designed to work in real-world conditions using affordable hardware. The researchers address a fundamental challenge in SLR: modeling the transitions between signs. Unlike spoken language…
sign language recognition · hidden Markov models · machine learning · deaf and hard of hearing · wearable technology
Isolated Sign Language Recognition with Grassmann Covariance Matrices
Hanjie Wang, Xiujuan Chai, Xiaopeng Hong, Guoying Zhao, Xilin Chen · 2016 · ACM Transactions on Accessible Computing
This paper proposes a novel method for isolated sign language recognition using Grassmann Covariance Matrices (GCM) to fuse multimodal features captured by Microsoft Kinect. With 360 million people worldwide affected by hearing loss—21 million in China alone—automatic sign…
sign language recognition · Chinese sign language · computer vision · machine learning · deaf and hard of hearing
Evaluating Intelligibility and Battery Drain of Mobile Sign Language Video Transmitted at Low Frame Rates and Bit Rates
Jessica J. Tran, Eve A. Riskin, Richard E. Ladner, Jacob O. Wobbrock · 2015 · ACM Transactions on Accessible Computing (TACCESS)
This paper investigates the lower limits of intelligible sign language video on mobile devices, seeking to determine the minimum frame rate and bit rate that still allow deaf users to understand ASL video conversations. Mobile video communication is essential for deaf and…
deaf and hard of hearing · sign language · video communication · mobile accessibility · bandwidth
A mean for communication between deaf and hearing pairs in inclusive educational settings: the Sessai app
Soraia Silva Prietch, Emanuel José dos Santos, Lucia Vilela Leite Filgueiras · 2015 · Proceedings of the 12th International Web for All Conference (W4A)
This extended abstract presents Sessai, an Android application designed to facilitate communication between Deaf or Hard of Hearing (D/HH) students and their hearing peers and teachers in inclusive classroom settings in Brazil. The app uses a WhatsApp-inspired chat interface…
deaf and hard of hearing · sign language · inclusive education · mobile application · assistive technology
Evaluation of Real-time Captioning by Machine Recognition with Human Support
Hironobu Takagi, Takashi Itoh, Kaoru Shinkawa · 2015 · Proceedings of the 12th International Web for All Conference (W4A)
This paper from IBM Research Tokyo investigates a hybrid approach to real-time captioning that combines Automated Speech Recognition (ASR) with human correction to make workplace meetings accessible for deaf and hard of hearing (DHH) employees. Professional stenography services…
real-time captioning · deaf and hard of hearing · automated speech recognition · workplace accessibility · Japanese
Responsive Design for Personalised Subtitles
Chris J. Hughes, Mike Armstrong, Rhianne Jones, Michael Crabb · 2015 · Proceedings of the 12th International Web for All Conference (W4A)
This paper from BBC R&D and the University of Dundee proposes applying responsive web design principles to subtitle display, moving away from the legacy Teletext format that has constrained subtitling since 1979. Traditional subtitles are pre-blocked into fixed 38-character-wide…
subtitles · captions · responsive design · video accessibility · personalization
Tracked Speech-To-Text Display: Enhancing Accessibility and Readability of Real-Time Speech-To-Text
Raja S. Kushalnagar, Gary W. Behm, Aaron W. Kelstone, Shareef Ali · 2015 · ASSETS '15: Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility
This research addresses a subtle but significant barrier facing deaf and hard of hearing (DHH) students in educational settings: visual dispersion. While hearing students can simultaneously watch lecture visuals (slides, demonstrations, whiteboard) and listen to the speaker's…
deaf and hard of hearing · speech-to-text · CART · captioning · education
Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts
Hernisa Kacorri, Kaoru Shinkawa, Shin Saito · 2014 · Proceedings of the 11th Web for All Conference (W4A)
This paper from CUNY Graduate Center and IBM Research Tokyo presents a gamified crowdsourcing platform for video captioning that combines ASR output with non-expert human transcription to improve caption accuracy without monetary rewards. The system builds on the Collaborative…
captioning · crowdsourcing · deaf and hard of hearing · gamification · automatic speech recognition
Helping students keep up with real-time captions by pausing and highlighting
Walter S. Lasecki, Raja Kushalnagar, Jeffrey P. Bigham · 2014 · Proceedings of the 11th Web for All Conference (W4A)
This paper addresses a fundamental problem with real-time captioning for deaf and hard of hearing (DHH) students: the mismatch between speaking rates (approximately 170 words per minute) and reading rates, which causes students to fall progressively behind the live content. The…
deaf and hard of hearing · captioning · real-time captioning · education · inclusive classrooms
Accessibility Evaluation of Classroom Captions
Raja S. Kushalnagar, Walter S. Lasecki, Jeffrey P. Bigham · 2014 · ACM Transactions on Accessible Computing
This paper presents a comprehensive evaluation of real-time captioning approaches for classroom lectures, comparing Communication Access Realtime Translation (CART), Automatic Speech Recognition (ASR), and a novel collaborative captioning system called Legion:Scribe. The authors…
real-time captioning · deaf and hard of hearing · classroom accessibility · crowdsourcing · eye tracking
Identifying Sign Language Videos in Video Sharing Sites
Frank M. Shipman, Ricardo Gutierrez-Osuna, Caio D. D. Monteiro · 2014 · ACM Transactions on Accessible Computing
This paper addresses the challenge of finding sign language videos within general video sharing platforms like YouTube. While these platforms contain growing libraries of sign language content created by deaf community members, locating this content is difficult because…
sign language · ASL · video classification · machine learning · computer vision
Enhancing Caption Accessibility through Simultaneous Multimodal Information: Visual-Tactile Captions
Raja S. Kushalnagar, Gary W. Behm, Joseph S. Stanislow, Vasu Gupta · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS)
This paper addresses a fundamental limitation of captions (subtitles) for deaf and hard of hearing (DHH) viewers: captions force viewers to split attention between reading text at the bottom of the screen and watching the visual action, inevitably causing them to miss…
captioning · deaf and hard of hearing · haptic feedback · multimodal interaction · non-speech information
Real-Time Caption Challenge: C-Print
Michael S. Stinson, Pamela Francis, Lisa B. Elliot, Donna Easton · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS '14)
This demonstration paper presents C-Print, a typing-based real-time captioning system developed over 25 years by researchers at the National Technical Institute for the Deaf (NTID) at Rochester Institute of Technology. C-Print provides communication access for deaf and hard of…
deaf and hard of hearing · real-time captioning · communication access · transcription · mobile accessibility
Real-Time Captioning with the Crowd
Walter S. Lasecki, Jeffrey P. Bigham · 2014 · Interactions
This article presents Scribe, a crowdsourced real-time captioning system that allows groups of non-expert typists to collectively produce captions at the speed of natural speech — a task that normally requires highly trained professional stenographers. The authors motivate the…
real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation
Legion Scribe: Real-Time Captioning by Non-Experts
Walter S. Lasecki, Raja Kushalnagar, Jeffrey P. Bigham · 2014 · ACM SIGACCESS Conference on Computers and Accessibility
This demonstration paper presents Legion:Scribe, a crowd-powered captioning system that enables groups of 3-5 non-expert typists to collectively produce real-time captions with less than 5 seconds of latency. The system addresses the prohibitive cost of professional…
real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation
Implementation and Evaluation of Animation Controls Sufficient for Conveying ASL Facial Expressions
Hernisa Kacorri, Matt Huenerfauth · 2014 · Proceedings of the 16th International ACM SIGACCESS Conference on Computers & Accessibility (ASSETS '14)
This ASSETS 2014 short paper (2 pages) reports an infrastructure contribution to sign-language-animation research: the authors extended an existing virtual human character (Max, on the open-source EMBR animation platform) with a full set of MPEG-4 Facial Action Parameter (FAP)…
american sign language · sign language animation · signing avatar · deaf and hard of hearing · facial expression
Legion Scribe: Real-Time Captioning by the Non-Experts
Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility (W4A)
This demo paper introduces Legion Scribe, a system that enables real-time captioning of speech by harnessing 3-5 ordinary typists working simultaneously, rather than relying on expensive professional stenographers. Real-time captioning provides text equivalents of spoken…
captioning · deaf and hard of hearing · crowdsourcing · real-time captioning · communication accessibility
Warping Time for More Effective Real-Time Crowdsourcing
Walter S. Lasecki, Christopher D. Miller, Jeffrey P. Bigham · 2013 · Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2013)
This paper introduces TimeWarp, a technique that manipulates audio playback speed to improve crowd workers performance on real-time speech captioning. The core problem is that non-expert typists cannot keep up with natural speaking rates of 150-225 words per minute, forcing them…
real-time captioning · crowdsourcing · deaf and hard of hearing · human computation · speech accessibility
Real-Time Captioning by Non-Experts with Legion Scribe
Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2013)
This short paper introduces Legion Scribe (Scribe), a system that enables 3-5 non-expert typists to collectively caption speech in real time, achieving accuracy approaching that of a professional stenographer at 20-30% of the cost. The system addresses a critical accessibility…
real-time captioning · deaf and hard of hearing · crowdsourcing · human computation · assistive technology
Adaptive Time Windows for Real-Time Crowd Captioning
Matthew J. Murphy, Christopher D. Miller, Walter S. Lasecki, Jeffrey P. Bigham · 2013 · CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems
This paper addresses a key barrier to real-time captioning access for deaf and hard of hearing people: the high cost of professional stenographers, who can charge up to $200 per hour. Building on the Legion:Scribe system, which demonstrated that groups of non-expert crowd…
real-time captioning · crowdsourcing · deaf and hard of hearing · assistive technology · human computation
Crowd Caption Correction (CCC)
Rebecca Perkins Harrington, Gregg C. Vanderheiden · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (Assets '13)
This short paper presents Crowd Caption Correction (CCC), a feature that allows meeting participants or authorized third parties to correct errors in real-time captions during telecollaboration sessions. Captions are critical for deaf and hard of hearing people to participate in…
captioning · deaf and hard of hearing · crowdsourcing · telecollaboration · real-time captioning
Enhancing Learning Accessibility through Fully Automatic Captioning
Maria Federico, Marco Furini · 2012 · Proceedings of the International Cross-Disciplinary Conference on Web Accessibility (W4A)
This paper proposes an architecture for automatically generating synchronized captions for video lectures using off-the-shelf automatic speech recognition (ASR) software, aimed at making educational content accessible to hearing impaired students, dyslexic students, ESL (English…
captioning · speech recognition · education accessibility · deaf and hard of hearing · automatic speech recognition

Reviews

Year

Tag

Search results

A Personalizable Mobile Sound Detector App Design for Deaf and Hard-of-Hearing Users

SlidePacer: A Presentation Delivery Tool for Instructors of Deaf and Hard of Hearing Students

Closed ASL Interpreting for Online Videos

Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications

Isolated Sign Language Recognition with Grassmann Covariance Matrices

Evaluating Intelligibility and Battery Drain of Mobile Sign Language Video Transmitted at Low Frame Rates and Bit Rates

A mean for communication between deaf and hearing pairs in inclusive educational settings: the Sessai app

Evaluation of Real-time Captioning by Machine Recognition with Human Support

Responsive Design for Personalised Subtitles

Tracked Speech-To-Text Display: Enhancing Accessibility and Readability of Real-Time Speech-To-Text

Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts

Helping students keep up with real-time captions by pausing and highlighting

Accessibility Evaluation of Classroom Captions

Identifying Sign Language Videos in Video Sharing Sites

Enhancing Caption Accessibility through Simultaneous Multimodal Information: Visual-Tactile Captions

Real-Time Caption Challenge: C-Print

Real-Time Captioning with the Crowd

Legion Scribe: Real-Time Captioning by Non-Experts

Implementation and Evaluation of Animation Controls Sufficient for Conveying ASL Facial Expressions

Legion Scribe: Real-Time Captioning by the Non-Experts

Warping Time for More Effective Real-Time Crowdsourcing

Real-Time Captioning by Non-Experts with Legion Scribe

Adaptive Time Windows for Real-Time Crowd Captioning

Crowd Caption Correction (CCC)

Enhancing Learning Accessibility through Fully Automatic Captioning