Volume Report Final
Volume Report Final
BACHELOR OF ENGINEERING
In
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
BY
CHETHAN K (1RR20AI007)
K CHANDAN(1RR20AI011)
CERTIFICATE
“VOLUME CONTROL WITH HAND DETECTION”
Carried out by
CHETHAN K (1RR20AI007)
K CHANDAN(1RR20AI011)
The students of “Rajarajeswari College of Engineering” in partial fulfillment for the sixth
semester of Bachelor of Engineering in Artificial Intelligence and Machine Learning of the
Visvesvaraya Technological University, Belagavi during the year 2023. It is certified that all
corrections/suggestions indicated for Internal Assessment have been incorporated in the report
deposited in the departmental library. The mini project report has been approved as it satisfies
the academic requirements in respect of mini project work prescribed for the sixth semester.
……………………… ……………………………
External Viva-Voce
Name of the examiners: Signature with date:
1. 1.
2. 2.
ACKNOWLEDGEMENT
I also thank my parents who have encouraged me and supported me in every stage
of development of this project.
Last but not the least, my wishes to the entire Artificial Intelligence and Machine
Learning for their help and guidance, encouragement, inspiration and co-
operation at all stages of the development of this project without which this
project wouldn’t have been a success.
CHETHAN K(1RR20AI007)
K CHANDAN(1RR20AI011)
RAJARAJESWARI COLLEGE OF ENGINEERING
MYSORE ROAD, BANGALORE-560074
DECLARATION
I, Chethan k and K Chandan the student of sixth semester B.E, Department of Artificial
Intelligence and Machine learning, Rajarajeswari college of Engineering, Mysore Road,
Bengaluru, declare, that the Project Work entitled “VOLUME CONTROL WITH
HAND DETECTION” has been carried out by and submitted in partial fulfillment of the
requirement of 6th semester 2023. The matter embodied in this report has been submitted
to any university or institute for the award of any other degree or diploma.
Date:
ABSTRACT
The “Volume Control With Hand Detection Using Digital Image Processing
with Python" project aims to develop volume controlling of the device with the
help of the camera using the tips for our thumb and index finger. This project
leverages the power of digital image processing techniques implemented using
the Python programming language.
We can use our hand gestures to control the basic operation of a computer like
increasing and decreasing volume. Therefore, people will not have to learn
machine-skills which are a burden most of the time.
The purpose of this project is to discuss a volume control using hand gesture
recognition system based on detections of hand gestures .
The main aim is to create a system which can identify the human hand gestures
and use same input as the information for controlling the device and by using
real time gesture.
1. INTRODUCTION 1
2. RELATED WORK 2
3. LITERATURE STUDY 2
4. SOFTWARE REQUIREMENT 3
5. SYSTEM ARCHITECTURE 4
8. GUIDING PRINCIPLES 8
9. SNAPSHOT 9
11. CONCLUSION 11
12. REFERENCES 12
INTRODUCTION
Dept Of AIML,RRCE 1
RELATED WORK
In vision community hand gesture is an active area of research, for the purpose of
sign language recognition and human computer interaction. In this we have used
some algorithms and some modules to detect the gestures of the person and these
gestures are taken as the input in the system . Here, several modules are used like
opencv-python, mediapipe, numpy etc for the purpose of tracking the gestures.
After capturing the input from the user the image is used in the hand tracking
system to check the dimensions and shape of the gesture which is received in the
system.
Hand tracking module plays a important role in identifying the input recorded in
the system, after that classification and segmentation process is used to classify the
gestures in the system .Machine learning and deep learning is also used to identify
the training data from the system and identify it according to the requirement of
the system .After this the gestures are identified from the trained data and on the
basis of that data the gestures rae recognized and is used for processing of the the
system to implement the functions like increase and decrease in volume.
LITERATURE STUDY
As per various literature surveys it is found that for implementing this project the
basic concepts involved are:
1. Hand Detection and Localization
2. Image Preprocessing Techniques
3. Image Thresholding
4. Convex Hull Analysis
5. Finger Segmentation
6. Finger Tracking Algorithms
7. Real-time Video Processing
8. Python Libraries for Image Processing (OpenCV, NumPy, scikit-image)
Dept Of AIML,RRCE 2
SOFTWARE REQUIREMENTS
ANACONDA
It is a free and open source distribution of the Python for data science and machine
learning related applications (large-scale data processing, predictive analytics,
scientific computing), that aims to simplify package management and
deployment. Package versions are managed by the package management system.
The Anaconda distribution is used by over 6 million users, and it includes more
than 250 popular data science packages suitable for Windows, Linux, and
MacOS.
JUPYTER NOTEBOOK
The Jupyter Notebook is an open source web application that you can use to create
and share documents that contain live code, equations, visualizations and text.
Jupyter notebook is maintained by the people at project Jupyter. Jupyter
notebooks are a spin of project from the IPython project, which used to have an
IPython notebook project itself.
HARDWARE INTERFACES:
1. Processor: Intel CORE i7 processor with minimum 2.9 GHz speed.
2. RAM: Minimum 4 GB.
3. Hard Disk: Minimum 500 GB
SOFTWARE INTERFACES:
1. Python Programming Language
2. OpenCV framework
3. Windows 11
4. MediaPipe library
5. Time library
Dept Of AIML,RRCE 3
SYSTEM ARCHITECTURE
In this project we are using python technology to develop the project , the code
is written and designed in python language using Opencv and NumPy modules.
In this project firstly we import the libraries which are to be used for further
processing of the input and the output. The libraries which are used in this
project which needs to be imported are OpenCV, mediapipe, math, ctypes,
pycaw and numpy.We get video inputs from our primary camera.
Now, here mediapipe is used to detect the video as the input from our camera
and use mphand.hands module to detect the gesture .Then , in order to access the
speaker we have used the pycaw and we have provided the range of the volume
from minimum volume to maximum volume.
Next step is to convert the input image to rgb image to complete the processing
of the input captured. Then its turn to specify the points of thumb in input and
fingers.
Volume range id processed using the hand range in this process numpy is used
to convert this process and process the required output. NumPy package is
fundamental package for computing in Python language. It is consist of
several things like-
• powerful N-dimensional array
• object broadcasting
• tools to integrate C.
• Fourier transform, and random number capabilities
Dept Of AIML,RRCE 4
➢ System Architecture
Dept Of AIML,RRCE 5
THE LIBRARY AND PACKAGE
1. MEDIAPIPE
2. OPENCV
Dept Of AIML,RRCE 6
software under Apache License 2. Starting in 2011, OpenCV features GPU
acceleration for real-time operations.
3. NUMPY
Is the module of the Python. The numpy word basically shows Numerical Python
and it is utilized. This is the module which is basically written in c language and is
said as expansion module . Numpy guarantee remarkable execution speed. Numpy
is mostly used for performing calculations, tasks using certain functions it provides
like multiply, divide, power etc.
ADVANTAGES:
• Easy to use
• Hassle free
• Fun to use
• More interactive
DISADVANTAGES:
Dept Of AIML,RRCE 7
GUIDING PRINCIPLES
Dept Of AIML,RRCE 8
SNAPSHOT
Dept Of AIML,RRCE 9
FUTURE SCOPE
While the current implementation of the finger counter detection system offers
valuable advantages and capabilities, there are several avenues for future
development and expansion such as:
Dept Of AIML,RRCE 10
CONCLUSION
Dept Of AIML,RRCE 11
REFERENCES
Dept Of AIML,RRCE 12