The Potential of a Visual Dialogue Agent In a Tandem Automated Audio Description System for Videos
Abigale Stangl, Shasta Ihorn, Yue-Ting Siu, Aditya Bodi, Mar Castanon, Lothar D Narins, Ilmi Yoon · 2023 · Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS 2023)
This paper presents and evaluates a tandem AI-based audio description (AD) system for videos that combines two complementary tools: NarrationBot, which delivers automated minimum viable descriptions (MVD) of video content, and InfoBot, a visual dialogue agent that allows users…
audio description · blind and low vision · visual question answering · visual dialogue · AI