Multi-Perspective Visual Contrastive Decoding for Reliable Assistance
Bocheng Pan, Hailong Shi, Xingyu Gao · 2026 · ACM Transactions on Internet of Things
This technical paper presents MPVCD (Multi-Perspective Visual Contrastive Decoding), a framework designed to address the reliability of AI-generated visual descriptions for people who are blind or have low vision (BLV). The core problem it tackles: when BLV users photograph…
blindness and low vision · multimodal AI · image captioning · visual hallucination · assistive technology