0% found this document useful (0 votes)

2 views10 pages

Automative Media Player (Explanation of Code)

The document describes a Python script that utilizes OpenCV, MediaPipe, and PyAutoGUI to detect hand gestures via webcam and simulate keyboard inputs based on the number of fingers shown. It includes a function to count fingers based on hand landmarks and implements a loop to process video frames, detect hand gestures, and trigger corresponding keyboard actions. The script continuously captures video, recognizes gestures, and displays the processed frames until the user exits the program.

Uploaded by

subharoopkabi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views10 pages

Automative Media Player (Explanation of Code)

Uploaded by

subharoopkabi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

import cv2

import mediapipe as mp
import pyautogui
import time

Certainly! The code snippet you provided imports several Python

libraries. Let me break down each one for you:

1. `cv2`: OpenCV (Open Source Computer Vision Library) is a popular

library used for various computer vision tasks such as image and video
manipulation, object detection, face recognition, etc. In your code, it's
imported with the alias `cv2`.

2. `mediapipe`: MediaPipe is an open-source framework developed by

Google for building cross-platform, real-time multimodal applied
machine learning pipelines. It provides ready-to-use ML solutions for
various tasks such as hand tracking, pose detection, face detection, etc.
Here, it's imported with the alias `mp`.

3. `pyautogui`: PyAutoGUI is a Python library for automating keyboard

and mouse movements, especially useful for tasks like GUI automation,
testing, etc. It allows you to control the mouse and keyboard
programmatically. In your code, it's imported as `pyautogui`.

4. `time`: This is a standard Python library providing various time-related

functions. It allows you to manipulate time, measure time intervals, and
so on. Here, it's imported without an alias.

Combining these libraries, it seems like you might be working on a

project that involves computer vision, potentially using MediaPipe for
tasks like hand tracking or pose detection, and automating mouse and
keyboard actions using PyAutoGUI. The `time` library might be used for
introducing time delays or for timing operations within your code.
def count_fingers(lst):
cnt = 0

thresh = (lst.landmark[0].y100 - lst.landmark[9].y100)/2

if (lst.landmark[5].y100 - lst.landmark[8].y100) > thresh:

cnt += 1

if (lst.landmark[9].y100 - lst.landmark[12].y100) > thresh:

cnt += 1

if (lst.landmark[13].y100 - lst.landmark[16].y100) > thresh:

cnt += 1

if (lst.landmark[17].y100 - lst.landmark[20].y100) > thresh:

cnt += 1

if (lst.landmark[5].x100 - lst.landmark[4].x100) > 6:

cnt += 1

return cnt

This Python function `count_fingers(lst)` seems to be designed to

count the number of fingers detected in a hand pose. Let's break down
the code:

1. **Function Definition**:
```python
def count_fingers(lst):
```
- This line defines a function named `count_fingers` that takes one
argument `lst`. It seems `lst` is expected to be an object containing
landmarks detected on a hand using some library like MediaPipe.

2. **Initialization**:
```python
cnt = 0
```
- Initializes a variable `cnt` to count the number of fingers detected.
It's set to 0 initially.

3. **Threshold Calculation**:
```python
thresh = (lst.landmark[0].y*100 - lst.landmark[9].y*100)/2
```
- Calculates a threshold value based on the difference in y-
coordinates of specific landmarks on the hand. This threshold might be
used to determine whether a finger is open or closed.

4. **Finger Counting**:
- The following `if` statements check certain conditions to determine
whether each finger is open or closed:
```python
if (lst.landmark[5].y*100 - lst.landmark[8].y*100) > thresh:
cnt += 1
```
- Checks if the distance between landmarks corresponding to the tip
of the thumb and the base of the index finger is greater than the
threshold.
- Similar logic is applied to other fingers, from index finger to pinky.

5. Thumb Position Check:

```python
if (lst.landmark[5].x*100 - lst.landmark[4].x*100) > 6:
cnt += 1
```
- Checks if the x-coordinate difference between two landmarks
associated with the thumb is greater than 6. This might be used to check
if the thumb is extended or not.

6. **Return Count**:
```python
return cnt
```
- Returns the count of fingers detected based on the conditions
checked in the function.
So, overall, this function calculates the number of fingers based on the
landmarks detected in a hand pose, using some predefined conditions
and thresholds.

cap = cv2.VideoCapture(0)

drawing = mp.solutions.drawing_utils
hands = mp.solutions.hands
hand_obj = hands.Hands(max_num_hands=1)

This code segment seems to be capturing video from a webcam feed

using OpenCV (`cv2`). Let's break down each part:

1. Camera Capture Initialization:

```python
cap = cv2.VideoCapture(0)
```
- This line initializes a video capture object named `cap` which will
capture video from the default camera (index 0). If you have multiple
cameras connected, you can specify a different index to capture from a
different camera.

2. **Mediapipe Setup**:
```python
drawing = mp.solutions.drawing_utils
hands = mp.solutions.hands
hand_obj = hands.Hands(max_num_hands=1)
```
- These lines import the necessary modules from the MediaPipe
library (`mediapipe`) and set up a hand tracking object.
- `mp.solutions.drawing_utils` provides utility functions to draw
landmarks and connections on the image.
- `mp.solutions.hands` provides a pre-trained model for hand
tracking.
- `hand_obj = hands.Hands(max_num_hands=1)` initializes the hand
tracking object, specifying that it should detect a maximum of 1 hand in
the frame.
Overall, this code segment sets up a video capture object using
OpenCV and initializes a hand tracking object using the MediaPipe
library. The hand tracking object will be used to detect and track hands
in the video frames captured by the webcam.

start_init = False

prev = -1

while True:
end_time = time.time()
_, frm = cap.read()
frm = cv2.flip(frm, 1)

res = hand_obj.process(cv2.cvtColor(frm, cv2.COLOR_BGR2RGB))

if res.multi_hand_landmarks:

hand_keyPoints = res.multi_hand_landmarks[0]

cnt = count_fingers(hand_keyPoints)

if not(prev==cnt):
if not(start_init):
start_time = time.time()
start_init = True

elif (end_time-start_time) > 0.2:

if (cnt == 1):
pyautogui.press("right")

elif (cnt == 2):

pyautogui.press("left")

elif (cnt == 3):

pyautogui.press("up")

elif (cnt == 4):

pyautogui.press("down")

elif (cnt == 5):

pyautogui.press("space")

prev = cnt
start_init = False

drawing.draw_landmarks(frm, hand_keyPoints,
hands.HAND_CONNECTIONS)

cv2.imshow("window", frm)

if cv2.waitKey(1) == 27:
cv2.destroyAllWindows()
cap.release()
Break

This code appears to be a Python script designed to control keyboard

inputs based on hand gestures detected in real-time using a webcam.
Let's break it down:

1. **Variable Initialization**:
- `start_init` is a boolean variable initialized to `False`. It's used to
track whether the hand gesture recognition process has started.
- `prev` is a variable initialized to `-1`. It's used to track the previous
finger count.

2. **Main Loop**:
- `while True:` initiates an infinite loop to continuously process video
frames from the webcam.

3. Capturing Video Frame:

- `end_time = time.time()`: Records the current time for timing
purposes.
- `_, frm = cap.read()`: Captures a frame from the webcam feed using
`cap.read()`. The underscore `_` is used to discard the return value, and
the captured frame is stored in the variable `frm`.
- `frm = cv2.flip(frm, 1)`: Flips the frame horizontally, probably to
correct for the mirror effect of the webcam.

4. Hand Detection and Gesture Recognition:

- `res = hand_obj.process(cv2.cvtColor(frm, cv2.COLOR_BGR2RGB))`:
Processes the frame to detect hands using the hand tracking object
(`hand_obj`).
- `if res.multi_hand_landmarks:`: Checks if hand landmarks are
detected in the frame.
- `hand_keyPoints = res.multi_hand_landmarks[0]`: Retrieves the
landmarks of the first detected hand.
- `cnt = count_fingers(hand_keyPoints)`: Calculates the number of
fingers based on the detected hand landmarks using the `count_fingers`
function.

5. Gesture Recognition and Keyboard Control:

- Checks if the finger count has changed compared to the previous
count:
- If it has changed and `start_init` is `False`, it initializes the start
time (`start_time`) and sets `start_init` to `True`.
- If the time elapsed since `start_time` is greater than 0.2 seconds,
it recognizes the hand gesture based on the finger count (`cnt`) and
simulates keyboard inputs using `pyautogui.press()` accordingly. It then
updates `prev` with the current finger count and sets `start_init` back to
`False`.

6. Drawing Hand Landmarks:

- `drawing.draw_landmarks(frm, hand_keyPoints,
hands.HAND_CONNECTIONS)`: Draws landmarks and connections on the
frame to visualize hand tracking.

7. Displaying the Frame:

- `cv2.imshow("window", frm)`: Displays the processed frame in a
window named "window".

8. Exiting the Program:

- `if cv2.waitKey(1) == 27:`: Waits for a key press for 1 millisecond. If
the key pressed is the escape key (ESC, ASCII value 27), it destroys all
OpenCV windows, releases the webcam (`cap.release()`), and exits the
loop, terminating the program.

Overall, this code continuously captures video frames, detects hand

gestures, recognizes specific gestures based on finger counts, and
controls keyboard inputs accordingly. It uses OpenCV for webcam access,
MediaPipe for hand tracking, and PyAutoGUI for simulating keyboard
inputs.

DSWD Interview Questions and Answers Guide.: Global Guideline
100% (1)
DSWD Interview Questions and Answers Guide.: Global Guideline
12 pages
Documentation For Hand Landmark Detection Project
No ratings yet
Documentation For Hand Landmark Detection Project
7 pages
Introduction To Data Science Lab Manual
100% (1)
Introduction To Data Science Lab Manual
76 pages
2G - 3G - 4G - Formula - ISAT PA2
No ratings yet
2G - 3G - 4G - Formula - ISAT PA2
24 pages
Training Plan CSS NC II
No ratings yet
Training Plan CSS NC II
5 pages
2X 2 46
No ratings yet
2X 2 46
30 pages
Hand Track
No ratings yet
Hand Track
6 pages
Ss
No ratings yet
Ss
50 pages
Class 109 Notes
No ratings yet
Class 109 Notes
3 pages
Mmte-001 P 1
No ratings yet
Mmte-001 P 1
35 pages
Instant Download Cybersecurity Myths and Misconceptions Avoiding The Hazards and Pitfalls That Derail Us 1st Edition Spafford PDF All Chapter
100% (5)
Instant Download Cybersecurity Myths and Misconceptions Avoiding The Hazards and Pitfalls That Derail Us 1st Edition Spafford PDF All Chapter
39 pages
Ai Virtual Mouse
100% (1)
Ai Virtual Mouse
35 pages
New Text Document
No ratings yet
New Text Document
11 pages
Virtual Mouse Synopsis
No ratings yet
Virtual Mouse Synopsis
5 pages
Hassan MVI
No ratings yet
Hassan MVI
16 pages
Hand Tracking Presentation
No ratings yet
Hand Tracking Presentation
13 pages
Anexo 4 Proyecto Final Ingles
No ratings yet
Anexo 4 Proyecto Final Ingles
8 pages
Virtual Painting Mini Project Synopsis
No ratings yet
Virtual Painting Mini Project Synopsis
15 pages
Lab Manual1
No ratings yet
Lab Manual1
10 pages
Yolo Detect
No ratings yet
Yolo Detect
5 pages
Mohamed Munish Hameed - Updated CV Bundle
No ratings yet
Mohamed Munish Hameed - Updated CV Bundle
12 pages
Project Report - AI Virtual Mouse
No ratings yet
Project Report - AI Virtual Mouse
10 pages
Legal, Ethical, and Societal Issues in Media and Information Literacy
No ratings yet
Legal, Ethical, and Societal Issues in Media and Information Literacy
8 pages
Sign
No ratings yet
Sign
23 pages
Cide
No ratings yet
Cide
19 pages
08.block Diagram PDF
No ratings yet
08.block Diagram PDF
3 pages
Intel VROC VMD Supported Configs
No ratings yet
Intel VROC VMD Supported Configs
19 pages
Computer Vision Report
No ratings yet
Computer Vision Report
5 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
Signals
No ratings yet
Signals
17 pages
Applied Computer Vision
No ratings yet
Applied Computer Vision
12 pages
Finger Detection PPT Uk
No ratings yet
Finger Detection PPT Uk
10 pages
Code 1
No ratings yet
Code 1
18 pages
Silent Voice
No ratings yet
Silent Voice
2 pages
Appendix A
No ratings yet
Appendix A
8 pages
Lecture 3 - WSN For Robotics
No ratings yet
Lecture 3 - WSN For Robotics
33 pages
Rancho Labs Project 1 Documentation (PROJECT GESTUREDRIVE)
No ratings yet
Rancho Labs Project 1 Documentation (PROJECT GESTUREDRIVE)
8 pages
Air Canvas Project Synopsis IGNOU BCA
No ratings yet
Air Canvas Project Synopsis IGNOU BCA
2 pages
Ai RPT
No ratings yet
Ai RPT
11 pages
Final Datesheet 08nov
No ratings yet
Final Datesheet 08nov
15 pages
ML Learning 2
No ratings yet
ML Learning 2
19 pages
Pps Project
No ratings yet
Pps Project
13 pages
Python Code
No ratings yet
Python Code
2 pages
# Code To Control Webcam Based On U
No ratings yet
# Code To Control Webcam Based On U
3 pages
Hand Gesture Project Synopsis
No ratings yet
Hand Gesture Project Synopsis
3 pages
Jai
No ratings yet
Jai
4 pages
Experiment
No ratings yet
Experiment
2 pages
Dynamic Hand Gesture Detector Using Python and Open CV
No ratings yet
Dynamic Hand Gesture Detector Using Python and Open CV
3 pages
Virtual Mouse - Document
No ratings yet
Virtual Mouse - Document
3 pages
Research Paper
No ratings yet
Research Paper
3 pages
Project Report Fingerprint and Iris Authentication System: Product Design and Development
No ratings yet
Project Report Fingerprint and Iris Authentication System: Product Design and Development
24 pages
Most Used Design Patterns You Need To Know 1689676916
No ratings yet
Most Used Design Patterns You Need To Know 1689676916
1 page
Virtual AI Mouse
No ratings yet
Virtual AI Mouse
6 pages
Python Report
No ratings yet
Python Report
12 pages
Lab Task 5
No ratings yet
Lab Task 5
5 pages
Handgesture
No ratings yet
Handgesture
2 pages
Ais Reviewer
No ratings yet
Ais Reviewer
3 pages
Virtual Keyboard Using Machine Learnin1
No ratings yet
Virtual Keyboard Using Machine Learnin1
7 pages
React JS Question Bank
No ratings yet
React JS Question Bank
9 pages
Gesture Based Volume Control
No ratings yet
Gesture Based Volume Control
5 pages
How To Develop A Virtual Keyboard Using Opencv
No ratings yet
How To Develop A Virtual Keyboard Using Opencv
7 pages
Visualizing Netcdf Panoply
No ratings yet
Visualizing Netcdf Panoply
19 pages
PDF Editor
No ratings yet
PDF Editor
13 pages
Gesture Servo
No ratings yet
Gesture Servo
2 pages
Hand Gesture Recognition On Python and Opencv
No ratings yet
Hand Gesture Recognition On Python and Opencv
11 pages
Mini Project Code
No ratings yet
Mini Project Code
1 page
IJCRT2003114
No ratings yet
IJCRT2003114
4 pages
Task
No ratings yet
Task
3 pages
Minor Project File
No ratings yet
Minor Project File
26 pages
Book Your Dumpster USA Google Page
No ratings yet
Book Your Dumpster USA Google Page
4 pages
Coding Notes
No ratings yet
Coding Notes
1 page
Air Canvas Project
No ratings yet
Air Canvas Project
6 pages
Reaper Keyboard Shortcut Sos X
No ratings yet
Reaper Keyboard Shortcut Sos X
6 pages
Pyhton Code
No ratings yet
Pyhton Code
2 pages
Updated Individual Aop
No ratings yet
Updated Individual Aop
4 pages
AI Mouse Hand
No ratings yet
AI Mouse Hand
2 pages
Yuri Yarichin - Google Search
No ratings yet
Yuri Yarichin - Google Search
1 page
D. Utilization Is The Action Phase
No ratings yet
D. Utilization Is The Action Phase
18 pages
A Vision Base Application For Virtual Mouse Interface Using Hand Gesture
No ratings yet
A Vision Base Application For Virtual Mouse Interface Using Hand Gesture
6 pages
Question#10: Architectural Structure
No ratings yet
Question#10: Architectural Structure
7 pages
Hand Detector and Servo Control
No ratings yet
Hand Detector and Servo Control
2 pages
Geometric Shapes On Images:: Impor T
No ratings yet
Geometric Shapes On Images:: Impor T
5 pages
Aishwarya MiniProjectReport - SC
No ratings yet
Aishwarya MiniProjectReport - SC
6 pages
SKY02S Manual-V1.0
No ratings yet
SKY02S Manual-V1.0
11 pages
GS CMZ900D E 1st
No ratings yet
GS CMZ900D E 1st
13 pages
FFMS Frame Python Script
No ratings yet
FFMS Frame Python Script
2 pages
P2 - Ch4 - Cheat Sheet
No ratings yet
P2 - Ch4 - Cheat Sheet
1 page
Base Unit Test Paper - 1 Solution
No ratings yet
Base Unit Test Paper - 1 Solution
16 pages
Data Flow Diagram
No ratings yet
Data Flow Diagram
5 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet

Automative Media Player (Explanation of Code)

Uploaded by

Automative Media Player (Explanation of Code)

Uploaded by

import cv2

Certainly! The code snippet you provided imports several Python

1. `cv2`: OpenCV (Open Source Computer Vision Library) is a popular

2. `mediapipe`: MediaPipe is an open-source framework developed by

3. `pyautogui`: PyAutoGUI is a Python library for automating keyboard

4. `time`: This is a standard Python library providing various time-related

Combining these libraries, it seems like you might be working on a

thresh = (lst.landmark[0].y*100 - lst.landmark[9].y*100)/2

if (lst.landmark[5].y*100 - lst.landmark[8].y*100) > thresh:

if (lst.landmark[9].y*100 - lst.landmark[12].y*100) > thresh:

if (lst.landmark[13].y*100 - lst.landmark[16].y*100) > thresh:

if (lst.landmark[17].y*100 - lst.landmark[20].y*100) > thresh:

if (lst.landmark[5].x*100 - lst.landmark[4].x*100) > 6:

This Python function `count_fingers(lst)` seems to be designed to

5. **Thumb Position Check**:

This code segment seems to be capturing video from a webcam feed

1. **Camera Capture Initialization**:

res = hand_obj.process(cv2.cvtColor(frm, cv2.COLOR_BGR2RGB))

elif (end_time-start_time) > 0.2:

elif (cnt == 2):

elif (cnt == 3):

elif (cnt == 4):

elif (cnt == 5):

This code appears to be a Python script designed to control keyboard

3. **Capturing Video Frame**:

4. **Hand Detection and Gesture Recognition**:

5. **Gesture Recognition and Keyboard Control**:

6. **Drawing Hand Landmarks**:

7. **Displaying the Frame**:

8. **Exiting the Program**:

Overall, this code continuously captures video frames, detects hand

You might also like

thresh = (lst.landmark[0].y100 - lst.landmark[9].y100)/2

if (lst.landmark[5].y100 - lst.landmark[8].y100) > thresh:

if (lst.landmark[9].y100 - lst.landmark[12].y100) > thresh:

if (lst.landmark[13].y100 - lst.landmark[16].y100) > thresh:

if (lst.landmark[17].y100 - lst.landmark[20].y100) > thresh:

if (lst.landmark[5].x100 - lst.landmark[4].x100) > 6:

5. Thumb Position Check:

1. Camera Capture Initialization:

3. Capturing Video Frame:

4. Hand Detection and Gesture Recognition:

5. Gesture Recognition and Keyboard Control:

6. Drawing Hand Landmarks:

7. Displaying the Frame:

8. Exiting the Program: