Dynamic Hand Gesture Detector Using Python and Open CV
Dynamic Hand Gesture Detector Using Python and Open CV
*Lakshya Gaur
**Kapil Tomar
***Dr. Naveen Tyagi
Abstract 1. Introduction
Hand Gesture Recognition and Image Overlay
Using OpenCV and MediaPipe Hand gesture recognition is a crucial technology in
This research paper presents a method for real- the field of human-computer interaction. It allows
time hand gesture recognition and image overlay users to interact with digital devices using natural
using OpenCV and MediaPipe. The system hand movements, enhancing the user experience.
captures live video feed from a webcam, detects This project focuses on creating a hand gesture
hand gestures, and overlays corresponding images recognition system that identifies specific gestures
based on the detected gestures. The and overlays images accordingly. The system uses
implementation leverages the capabilities of Python, OpenCV, and MediaPipe, leveraging their
MediaPipe for hand tracking and OpenCV for powerful image processing and machine learning
image processing and display. The proposed capabilities.
method is efficient and runs in real-time, providing
immediate feedback on detected gestures. This The journey towards gesture-based interaction
paper discusses the system model, the underlying represents a departure from the conventional
algorithm, and the results obtained from the notion of computer interfaces as passive tools
implementation. controlled solely through manual manipulation.
Instead, it seeks to imbue technology with a
deeper understanding of human intention and
expression, enabling seamless interaction that
mirrors the fluidity of human communication.
At its core, gesture-based interaction draws
inspiration from the rich tapestry of human
movement, encompassing gestures, postures,
facial expressions, and even subtle nuances
of body language. By decoding and
interpreting these non-verbal cues, computers
can discern user intent and respond in a
manner that feels more natural and intuitive.
Images to Include
1. Development Environment:
Screenshot of the code editor with the
project open.
Photo of the setup with the webcam.
Hand Detection Process:
Raw webcam feed before processing.
Hand landmarks detection with
landmarks highlighted.
Gesture Recognition Output:
Examples of each gesture ("LIKE",
"DISLIKE", "NEUTRAL", "DYBALA
CELEBRATION") with the overlay
image.