Keyframe
Also known as: Key Frame
A keyframe is a single representative frame selected from a video scene or shot that best captures the essential visual content of that segment. In automated audio description and video captioning systems, keyframe selection is a critical step — the chosen frame is analyzed by computer vision models to generate descriptions of the visual content. The quality and representativeness of keyframe selection directly affects the accuracy of the resulting audio descriptions, as the selected frame must capture the most salient objects, actions, and context within the scene.
Category: video processing · computer vision · video accessibility · multimedia
Related: Scene Segmentation · Image Captioning · Audio Description · Object Detection