Landmark Extraction

Also known as: Keypoint Detection, Skeletal Tracking

A computer vision technique that identifies and tracks specific anatomical points (landmarks or keypoints) on the human body, hands, and face from images or video. In sign language technology, landmark extraction is a critical preprocessing step that converts raw video into structured coordinate data representing hand positions, finger configurations, body posture, and facial expressions. Frameworks like Google MediaPipe can extract hundreds of landmarks in real time, enabling privacy-preserving sign language recognition by processing only skeletal data rather than storing or transmitting the original video. This approach reduces computational requirements and addresses privacy concerns in webcam-based applications.

Category: technology · sign language · computer vision

Related: Pose Estimation · Sign Language Recognition · MediaPipe · Video-Based Sign Language Dictionary

Sources

https://developers.google.com/mediapipe/solutions/vision/hand_landmarker