Reviews

The literature-review database. Every paper Bob has reviewed (he has read many more), with a short summary, key findings, and tags. Browse, filter, search.

Search results

Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users
Raymond Fok, Harmanpreet Kaur, Skanda Palani, Martez E. Mott, Walter S. Lasecki · 2018 · Proceedings of the 20th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2018)
This University of Michigan study addresses a largely overlooked accessibility gap: while much research has focused on providing deaf users access to spoken output (via captioning or sign language), almost no work has addressed improving deaf users' ability to provide speech…
deaf and hard of hearing · automatic speech recognition · deaf speech · crowdsourcing · speech intelligibility
Crowd-AI Camera Sensing in the Real World
Anhong Guo, Anuraag Jain, Shomiron Ghose, Gierad Laput, Chris Harrison, Jeffrey P. Bigham · 2018 · Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies
This paper presents Zensors++, a hybrid crowd-AI camera sensing system that allows users to point a networked camera at a scene, define a natural language question about it (such as "Is the coffee machine in use?" or "How many people are in the room?"), and receive continuous,…
crowdsourcing · computer vision · human computation · machine learning · smart environments
In-context Q&A to Support Blind People Using Smartphones
André Rodrigues, Kyle Montague, Hugo Nicolau, João Guerreiro, Tiago Guerreiro · 2017 · Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '17)
This paper addresses a pervasive but understudied problem: the daily challenges blind people face when using smartphone applications go far beyond the touchscreen gesture difficulties that most accessibility research focuses on. The researchers conducted workshops with 42 blind…
blindness · smartphone accessibility · screen readers · crowdsourcing · human computation
Scribe: Deep Integration of Human and Machine Intelligence to Caption Speech in Real Time
Walter S. Lasecki, Christopher D. Miller, Iftekhar Naim, Raja Kushalnagar, Adam Sadilek, Daniel Gildea, Jeffrey P. Bigham · 2017 · Communications of the ACM
Scribe is a system that provides on-demand, real-time captioning of live speech for deaf and hard of hearing (DHH) people by combining groups of non-expert human captionists with machine intelligence. The system addresses a critical accessibility gap: professional CART…
real-time captioning · deaf and hard of hearing · crowdsourcing · human computation · speech recognition
WearMail: On-the-Go Access to Information in Your Email with a Privacy-Preserving Human Computation Workflow
Saiganesh Swaminathan, Raymond Fok, Fanglin Chen, Ting-Hao (Kenneth) Huang, Irene Lin, Rohan Jadvani, Walter S. Lasecki, Jeffrey P. Bigham · 2017 · Proceedings of the 30th Annual ACM Symposium on User Interface Software and Technology (UIST 2017)
WearMail is a conversational system that extracts specific information from a user's email via voice queries on wearable devices (such as smartwatches), using a novel privacy-preserving human computation workflow. The system addresses the challenge that email functions as…
crowdsourcing · human computation · privacy · wearable technology · information extraction
The Effects of Automatic Speech Recognition Quality on Human Transcription Latency
Yashesh Gaur, Walter S. Lasecki, Florian Metze, Jeffrey P. Bigham · 2016 · Proceedings of the 13th International Web for All Conference (W4A)
This paper from Carnegie Mellon University and the University of Michigan empirically investigates when automatic speech recognition (ASR) output helps or hinders human transcriptionists producing captions for deaf and hard of hearing people. Manual transcription remains…
speech recognition · captioning · deaf and hard of hearing · crowdsourcing · human computation
WearWrite: Crowd-Assisted Writing from Smartwatches
Michael Nebeling, Alexandra To, Anhong Guo, Adrian A. de Freitas, Jaime Teevan, Steven P. Dow, Jeffrey P. Bigham · 2016 · Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI 2016)
WearWrite is a system that enables users to write documents from their smartwatches by leveraging crowd workers to translate ideas into text. The system addresses the fundamental limitation that smartwatches have severely constrained input/output — touch-based text input is…
crowdsourcing · wearable technology · human computation · collaborative writing · mobile accessibility
The Effects of Automatic Speech Recognition Quality on Human Transcription Latency
Yashesh Gaur · 2015 · ASSETS '15: Proceedings of the 17th International ACM SIGACCESS Conference on Computers & Accessibility
This paper investigates a practical question for accessibility: when does providing automatic speech recognition (ASR) output help human captionists work faster, and when does it slow them down? Converting speech to text is fundamental for making audio content accessible to deaf…
deaf · hard of hearing · automatic speech recognition · ASR · captioning
Guiding Novice Web Workers in Making Image Descriptions Using Templates
Valerie S. Morash, Yue-Ting Siu, Joshua A. Miele, Lucia Hasty, Steven Landau · 2015 · ACM Transactions on Accessible Computing (TACCESS)
This study compares two approaches for using non-expert crowdworkers to create accessible descriptions of STEM images (charts, graphs, diagrams) for people who are blind or have print-reading disabilities. The researchers tested Free-Response Image Description (FRID), where…
image description · alt text · crowdsourcing · human computation · STEM accessibility
Zensors: Adaptive, Rapidly Deployable, Human-Intelligent Sensor Feeds
Gierad Laput, Walter S. Lasecki, Jason Wiese, Robert Xiao, Jeffrey P. Bigham, Chris Harrison · 2015 · Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI 2015)
Zensors presents a novel sensing platform that combines real-time human intelligence from online crowd workers with machine learning to create adaptive, rapidly deployable intelligent sensors. The system addresses a fundamental gap in smart environment technology: traditional…
crowdsourcing · human computation · computer vision · smart environments · machine learning
Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts
Hernisa Kacorri, Kaoru Shinkawa, Shin Saito · 2014 · Proceedings of the 11th Web for All Conference (W4A)
This paper from CUNY Graduate Center and IBM Research Tokyo presents a gamified crowdsourcing platform for video captioning that combines ASR output with non-expert human transcription to improve caption accuracy without monetary rewards. The system builds on the Collaborative…
captioning · crowdsourcing · deaf and hard of hearing · gamification · automatic speech recognition
Real-Time Captioning with the Crowd
Walter S. Lasecki, Jeffrey P. Bigham · 2014 · Interactions
This article presents Scribe, a crowdsourced real-time captioning system that allows groups of non-expert typists to collectively produce captions at the speed of natural speech — a task that normally requires highly trained professional stenographers. The authors motivate the…
real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation
Legion Scribe: Real-Time Captioning by Non-Experts
Walter S. Lasecki, Raja Kushalnagar, Jeffrey P. Bigham · 2014 · ACM SIGACCESS Conference on Computers and Accessibility
This demonstration paper presents Legion:Scribe, a crowd-powered captioning system that enables groups of 3-5 non-expert typists to collectively produce real-time captions with less than 5 seconds of latency. The system addresses the prohibitive cost of professional…
real-time captioning · crowdsourcing · deaf and hard of hearing · speech-to-text · human computation
Legion Scribe: Real-Time Captioning by the Non-Experts
Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility (W4A)
This demo paper introduces Legion Scribe, a system that enables real-time captioning of speech by harnessing 3-5 ordinary typists working simultaneously, rather than relying on expensive professional stenographers. Real-time captioning provides text equivalents of spoken…
captioning · deaf and hard of hearing · crowdsourcing · real-time captioning · communication accessibility
Warping Time for More Effective Real-Time Crowdsourcing
Walter S. Lasecki, Christopher D. Miller, Jeffrey P. Bigham · 2013 · Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI 2013)
This paper introduces TimeWarp, a technique that manipulates audio playback speed to improve crowd workers performance on real-time speech captioning. The core problem is that non-expert typists cannot keep up with natural speaking rates of 150-225 words per minute, forcing them…
real-time captioning · crowdsourcing · deaf and hard of hearing · human computation · speech accessibility
Real-Time Captioning by Non-Experts with Legion Scribe
Walter S. Lasecki, Christopher D. Miller, Raja Kushalnagar, Jeffrey P. Bigham · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2013)
This short paper introduces Legion Scribe (Scribe), a system that enables 3-5 non-expert typists to collectively caption speech in real time, achieving accuracy approaching that of a professional stenographer at 20-30% of the cost. The system addresses a critical accessibility…
real-time captioning · deaf and hard of hearing · crowdsourcing · human computation · assistive technology
Real-Time Crowd Labeling for Deployable Activity Recognition
Walter S. Lasecki, Young Chol Song, Henry Kautz, Jeffrey P. Bigham · 2013 · Proceedings of the 2013 Conference on Computer Supported Cooperative Work (CSCW 2013)
Legion:AR is a system that provides deployable activity recognition by combining real-time crowd labeling with automatic recognition using Hidden Markov Models (HMMs). The system addresses a critical limitation of current activity recognition: automated systems must be trained…
activity recognition · crowdsourcing · human computation · machine learning · aging in place
Answering Visual Questions with Conversational Crowd Assistants
Walter S. Lasecki, Phyo Thiha, Yu Zhong, Erin Brady, Jeffrey P. Bigham · 2013 · Proceedings of the 15th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2013)
This paper introduces Chorus:View, a system that enables blind users to get visual questions answered through continuous conversational interaction with multiple crowd workers viewing a live video stream from the user's mobile device. The system addresses key limitations of…
blind and low vision · crowdsourcing · human computation · assistive technology · visual assistance
Adaptive Time Windows for Real-Time Crowd Captioning
Matthew J. Murphy, Christopher D. Miller, Walter S. Lasecki, Jeffrey P. Bigham · 2013 · CHI EA '13: CHI '13 Extended Abstracts on Human Factors in Computing Systems
This paper addresses a key barrier to real-time captioning access for deaf and hard of hearing people: the high cost of professional stenographers, who can charge up to $200 per hour. Building on the Legion:Scribe system, which demonstrated that groups of non-expert crowd…
real-time captioning · crowdsourcing · deaf and hard of hearing · assistive technology · human computation
Chorus: A Crowd-Powered Conversational Assistant
Walter S. Lasecki, Rachel Wesley, Jeffrey Nichols, Anand Kulkarni, James F. Allen, Jeffrey P. Bigham · 2013 · UIST '13: Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology
This paper presents Chorus, a crowd-powered conversational assistant that enables users to hold natural, continuous conversations with what appears to be a single partner, but is actually backed by multiple crowd workers operating in real-time. The system addresses a fundamental…
crowdsourcing · conversational assistants · human computation · dialog systems · real-time systems
Using Real-time Feedback to Improve Visual Question Answering
Yu Zhong, Phyo Thiha, Grant He, Walter Lasecki, Jeffrey Bigham · 2012 · CHI EA '12: CHI '12 Extended Abstracts on Human Factors in Computing Systems
This work-in-progress paper introduces Legion:View, a system that extends the VizWiz model of crowd-powered visual question answering by adding a real-time feedback loop between blind users and crowd workers. The original VizWiz allowed blind users to take a still photograph,…
visual question answering · crowdsourcing · blind users · real-time systems · assistive technology
Real-Time Captioning by Groups of Non-Experts
Walter Lasecki, Christopher Miller, Adam Sadilek, Andrew Abumoussa, Donato Borrello, Raja Kushalnagar, Jeffrey Bigham · 2012 · UIST '12: Proceedings of the 25th Annual ACM Symposium on User Interface Software and Technology
This paper presents Legion:Scribe, an end-to-end system that enables groups of non-expert typists to collectively produce real-time captions for deaf and hard of hearing (DHH) people, offering a cheaper and more available alternative to professional stenographers (CART, costing…
real-time captioning · crowdsourcing · deaf and hard of hearing · human computation · text alignment
Online Quality Control for Real-Time Crowd Captioning
Walter S. Lasecki, Jeffrey P. Bigham · 2012 · Proceedings of the 14th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2012)
This paper addresses quality control in Legion:Scribe, a system that provides real-time captioning by having multiple non-expert crowd workers simultaneously type what they hear, then automatically merging their partial transcriptions into a single caption stream. Real-time…
real-time captioning · crowdsourcing · deaf and hard of hearing · automatic speech recognition · human computation
The Design of Human-Powered Access Technology
Jeffrey P. Bigham, Richard E. Ladner, Yevgen Borodin · 2011 · Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2011)
This paper presents a comprehensive framework of 13 design dimensions for evaluating and comparing human-powered access technology — systems that use human assistance, facilitated by technology, to overcome accessibility barriers that automation alone cannot solve. The authors…
human computation · crowdsourcing · assistive technology · accessibility framework · design principles
VizWiz: Nearly Real-Time Answers to Visual Questions
Jeffrey P. Bigham, Chandrika Jayant, Hanjie Ji, Greg Little, Andrew Miller, Robert C. Miller, Aubrey Tatarowicz, Brandyn White, Samuel White, Tom Yeh · 2010 · Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A)
This paper introduces VizWiz, a pioneering mobile application that enables blind and low-vision people to get nearly real-time answers to visual questions by connecting their smartphone cameras to remote paid workers on Amazon Mechanical Turk. Users take a photo with their…
blind and low vision · crowdsourcing · assistive technology · mobile accessibility · human computation

Reviews

Year

Tag

Search results

Towards More Robust Speech Interactions for Deaf and Hard of Hearing Users

Crowd-AI Camera Sensing in the Real World

In-context Q&A to Support Blind People Using Smartphones

Scribe: Deep Integration of Human and Machine Intelligence to Caption Speech in Real Time

WearMail: On-the-Go Access to Information in Your Email with a Privacy-Preserving Human Computation Workflow

The Effects of Automatic Speech Recognition Quality on Human Transcription Latency

WearWrite: Crowd-Assisted Writing from Smartwatches

The Effects of Automatic Speech Recognition Quality on Human Transcription Latency

Guiding Novice Web Workers in Making Image Descriptions Using Templates

Zensors: Adaptive, Rapidly Deployable, Human-Intelligent Sensor Feeds

Introducing Game Elements in Crowdsourced Video Captioning by Non-Experts

Real-Time Captioning with the Crowd

Legion Scribe: Real-Time Captioning by Non-Experts

Legion Scribe: Real-Time Captioning by the Non-Experts

Warping Time for More Effective Real-Time Crowdsourcing

Real-Time Captioning by Non-Experts with Legion Scribe

Real-Time Crowd Labeling for Deployable Activity Recognition

Answering Visual Questions with Conversational Crowd Assistants

Adaptive Time Windows for Real-Time Crowd Captioning

Chorus: A Crowd-Powered Conversational Assistant

Using Real-time Feedback to Improve Visual Question Answering

Real-Time Captioning by Groups of Non-Experts

Online Quality Control for Real-Time Crowd Captioning

The Design of Human-Powered Access Technology

VizWiz: Nearly Real-Time Answers to Visual Questions