← All terms

Image Description

Also known as: Image Caption, Visual Description

A textual representation of the content of an image, providing information about objects, people, scenes, text, colors, spatial relationships, and other visual elements. Image descriptions serve as a primary means for blind and low vision users to access visual content. They can be human-written (as in traditional alt text) or AI-generated (using computer vision or multimodal language models). AI-generated image descriptions have become increasingly detailed and context-aware but may contain errors including fabrications, misinterpretations, and omissions. The quality and reliability of image descriptions directly impacts BLV users' ability to understand their environment, make decisions, and participate in social and professional life.

Category: digital accessibility · artificial intelligence

Related: Alt Text · AI-Generated Alt Text · Multimodal Large Language Model · Visual Access Technology

Sources