Exploring Collection of Sign Language Datasets: Privacy, Participation, and Model Performance
Danielle Bragg, Oscar Koller, Naomi Caselli, William Thies · 2020 · Proceedings of the 22nd International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2020)
This paper tackles a fundamental tension in building machine learning systems for marginalized communities: the need for large training datasets versus the privacy risks of collecting data from small, identifiable populations. The authors focus on sign language video collection,…
sign language · privacy · machine learning · data collection · Deaf culture