(10 13) Voice

This research article presents a dual approach to assist aphonic individuals through voice conversion and hand gesture recognition technologies. The system captures hand gestures and translates them into text or speech, enabling effective communication for those who cannot speak. Additionally, it incorporates speech-to-text conversion to facilitate interactions between deaf and aphonic individuals and the hearing population, ultimately aiming to improve their quality of life.

Uploaded by

yogi singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views4 pages

(10 13) Voice

Uploaded by

yogi singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

10

Research Article
Volume-1 | Issue-1| Jan-Jun-2024|
JOURNAL OF
Image Processing and Image
Restoration
Double Blind Peer Reviewed Journal
DOI: https://doi.org/10.48001/JoIPIR

Voice Conversion and Hand Gesture Recognition for Aponic

People

Monith M1, Punith Kumar N1, Naveen Kumar R1, Lokesh B S1, Raghunath B H1*
1
Department of Electronics and Communication Engineering, Acharya Institute of Technology, Bengaluru,
Karnataka, India
*
Corresponding Author’s Email: [email protected]

ARTICLE HISTORY: ABSTRACT: Aphonia, a condition resulting in the loss of voice, presents significant
Received: 14th Dec, 2023
challenges in interpersonal interactions. This project proposes a dual-pronged
approach involving hand gesture recognition and voice conversion techniques to
Revised: 18th Jan, 2024
facilitate effective communication for aphonic individuals. The integration of real-
Accepted: 28th Jan, 2024
time hand gesture recognition provides an alternative means of expressing ideas and
Published: 9th Feb, 2024 emotions. By capturing and translating hand gestures into textual or auditory output,
KEYWORDS: this approach offers a versatile mode of communication. Additionally, advanced
voice conversion algorithms are employed to synthesize natural and intelligible
Aponic people,
speech from typed or selected text. This innovative coupling of technologies
Communicate, Database,
Hand gesture, Voice empowers aphonic individuals to engage in fluid conversations, fostering improved
conversion social interactions and enhancing their overall quality of life. A webcam is used to
communicate with deaf and aphonic people. When there are modalities of
communication, such as speech, that are unavailable, the human hand is the preferred
option. Hand gestures that transmit concepts utilizing diverse forms and finger
alignment enable human-machine interaction. The purpose of this work is to develop
a hand gesture detection model and translate the results to text and audio formats. The
model also responds to user voice commands and displays hand signs from the
database.

1. INTRODUCTION hearing voice and text format. Machine learning will be

utilized to train hand gesture photos in this proposed
Creating a system that allows aphonic people who rely
project, and this trained model will then be used to
heavily on sign language to communicate with others. It is
predict these taught hand motions from a camera. The
extremely difficult for aphonic persons to communicate
main goal of the system’s initiative is to offer deaf and
their message to non-aphonic people. Normal people are
dumb persons with a regular existence. This type of
not taught sign language. It is extremely tough to express
system allows visually impaired people to readily
their message during an emergency. As a result, the
understandable language. People who are deaf or hard of
solution is to transform sign language into human-
hearing can communicate their message utilizing text
DOI: https://doi.org/10.48001/JoIPIR.2023.1110-13 Copyright (c) 2024 QTanalytics India (Publications)
11

and gestures. Deaf persons can interpret other people's

speech according to the text shown. It also enables them
to live lives that are more autonomous (Al-Obodi et al.,
2020).
The ability to perceive, listen, talk, and respond to
situations is one of the most valuable gifts a human
being can have. However, some unfortunate people are
denied this. It is difficult to create a single compact
model for those with visual, hearing, and vocal
disabilities. Communication between deaf- dumb and
hearing people has always been difficult. In a single
compact model, this project presents a communication
system for deaf and mute individuals. We present a
method for a blind person to read a text by taking an
image with a camera that translates a text to speech
(TTS). It enables deaf people to read text using speech-
to- text (STT) conversion technology. It also includes a
method for dumb people to use text-to-voice conversion.
Blind individuals can read words using PyTesseract
Figure 1: Flowchart to Train the Model.
OCR (Online Character Recognition). A laptop is used to
carry out all these duties (Amrutha & Prabu, 2021). In case the image matches accurately with the named
information within the database, the hand signal is
2. OBJECTIVES AND METHODOLOGY
recognized and shows the yield. If the picture does not
The proposed hand signal acknowledgment and voice coordinate with the information with in the database as
change for hard of hearing and idiotic people, as well as shown in Figure 2.
the strategies for accomplishing these objectives, are as
takes after.

• To recognize distinctive hand signals.

• To change over hand signals to content arrange.

• To change over hand motions to discourse arrange.

• To change over discourse to content organize.

Objective 1: To Recognize Different Hand Gestures
At first, pictures of the hand signals are taken employing
a web camera and put away in JPEG arrange. Utilizing Figure 2: Flowchart to Recognize the Hand Gesture.
the Media Pipe calculation, points of interest are
Objective 2: To Convert Hand Gestures to Text Format
recognized from the palm of the hand and arranges are
found. There are 21 points of interest accessible, each The picture which is captured by the internet camera is
hand sign has its interesting point of interest and compared with the named information put away within the
arranges. Each hand sign is labelled and put away within database. If the picture matches accurately with the
the dataset. Different pictures of different hand signals information put away within the information base, hand
are captured and put away within the dataset to prepare the motion is recognized. This hand signal which was gotten
show precisely Pictures are captured employing a web camera by comparing the information put away in database is
to prepare the show as given in Figure 1 (Liu et al., 2016). recognized by utilizing Py Tesseract (OCR) and the
Points of interest are stamped and coordinates are named content message is shown on the yield screen.
and put away within the dataset. While running the
A few highlights, such as Eigen values and Eigen vectors, are
demonstrate, the picture is captured employing a web
extricated and utilized in acknowledgment. The direct
camera and is compared with the named information
discriminate investigation (LDA) Calculation is at that point
within the database.
utilized to recognize movements some time recently being
DOI: https://doi.org/10.48001/JoIPIR.2023.1110-13 Copyright (c) 2024 QTanalytics India (Publications)
12

changed to content and sound arrange. Clamor will be Objective 4: To Convert Speech to Text Format
diminished as a result of dimensionality lessening, and the
The discourse is given as input with the assistance of a
framework will work with awesome accuracy as given in
mouthpiece show on the laptop/computer. The
Figure 3.
framework at that point recognizes the discourse. The
framework checks on the off chance that the voice was
capable of beingheard and clear.
If yes, it changes over discourse to content utilizing (STT)
converter and shows the content on the screen. On the off
chance that no, it shows a mistake expressing that the
framework did not capture the voice legitimately. Once
content is gotten, it checks with the database and returns
the hand-sign pictures.
3. BLOCK DIAGRAM

Figure 3: Flowchart to Convert Hand Gestures to Word

Format.
Objective 3: To Convert Hand Gestures to Speech
Format
The obtained content record ought to too be changed over to
a sound record to permit outwardly disabled individuals to
get it the message being passed on. The pictures which are
captured will be compared with information within the
database and the output is shown within the shape of content
utilizing Py Tesseract (OCR). This content message is
changed over to speech format using the e-Speak device. e-
Speak apparatus could be a text-to-speech (TTS) converter Figure 5: Block Diagram.
instrument (Sawant & Kumbhar, 2014). The gotten discourse In this system, the image of the hand is taken from the
message is played with the assistance of a portable web camera and the captured images are pre- processed to
workstation or a computer as given in Figure 4. eliminate noise. Then the features of the images are
extracted and compared to the features dataset and the
image is classified to its correct hand gesture and it is
recognized. With the use of the audio and text dataset the
recognized image is converted into text and speech
format as shown in Figure 5 (Vijayalakshmi & Aarthi,
2016).
In the proposed system, we make use of media pipe
algorithm which classifies hand gestures using 21
landmarks present in a person’s palm. We make use of
python modules such as cv2 for image processing,
numpy to work on arrays, gTTs and pyttsx3 for text-to-
speech conversion, pygame to play mp3 file, pytesseract
for character recognition, google speech recognizer module
to recognize speech and convert it to text.
4. CONCLUSION
Hand gesture recognition framework may be a keen and
mental frame work for hard of hearing and aphonic
Figure 4: Flowchart to Convert Hand Gestures to Audio individuals for communication. It helps them in
Format.
communicating to ordinary individuals in crisis
DOI: https://doi.org/10.48001/JoIPIR.2023.1110-13 Copyright (c) 2024 QTanalytics India (Publications)
13

circumstances additionally decrease a hole between the

typical individuals and the hard of hearing and aphonic
individuals. The techniques point to help the hard of
hearing and aphonic individuals by making an interface
which aid recognize hand signals and change over into
content and discourse organize moreover to change over
voice and text input to hand signals. The proposed
frameworks may be a bidirectional system. It is ordinarily
difficult for the hard of hearing and aphonic individuals to
communicate with other individuals within the society.
This might lead to hard of hearing and aphonic people
failing to realize their dreams or accomplishing more
noteworthy statures in their life. This framework makes a
difference to diminish the communication hole conjointly
expel the one boundary hard of hearing and aphonic
individuals confront in their travel to victory.
REFERENCES
Al-Obodi, A. H., Al-Hanine, A. M., Al-Harbi, K. N., Al-
Dawas, M. S., & Al-Shargabi, A. A. (2020). A Saudi
sign language recognition system based on
convolutional neural networks. Department of
Information Technology, College of Computer,
Qassim University, Buraydah, Saudi Arabia.
https://dx.doi.org/10.37624/IJERT/13.11.2020.3328-
3334.
Amrutha, K., & Prabu, P. (2021, February). ML based sign
language recognition system. In 2021 International
Conference on Innovative Trends in Information
Technology (ICITIIT) (pp. 1-6). IEEE.
https://doi.org/10.1109/ICITIIT51526.2021.9399594.
Liu, X., Sacks, J., Zhang, M., Richardson, A. G., Lucas, T.
H., & Van der Spiegel, J. (2016). The virtual
trackpad: An electromyography-based, wireless, real-
time, low-power, embedded hand-gesture-recognition
system using an event-driven artificial neural
network. IEEE Transactions on Circuits and Systems
II: Express Briefs, 64(11), 1257-1261. https://doi.
org/10.1109/TCSII.2016.2635674.
Sawant, S. N., & Kumbhar, M. S. (2014, May). Real time
sign language recognition using pca. In 2014 IEEE
International Conference on Advanced
Communications, Control and Computing
Technologies (pp. 1412-1415). IEEE.
https://doi.org/10.1109/ICACCCT.2014.7019333.
Vijayalakshmi, P., & Aarthi, M. (2016, April). Sign
language to speech conversion. In 2016 International
Conference on Recent Trends in Information
Technology (ICRTIT) (pp. 1-6).
IEEE. https://doi.org/10.1109/ICRTIT.2016.756954.
DOI: https://doi.org/10.48001/JoIPIR.2023.1110-13 Copyright (c) 2024 QTanalytics India (Publications)

Answer Key: 1 Assistive Technology A
100% (3)
Answer Key: 1 Assistive Technology A
2 pages
Complete Black Book
57% (7)
Complete Black Book
59 pages
Hand Signs To Audio Converte1
No ratings yet
Hand Signs To Audio Converte1
11 pages
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
No ratings yet
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
8 pages
From - Table - of - Content - Report - s2t (1) (1) 2
No ratings yet
From - Table - of - Content - Report - s2t (1) (1) 2
33 pages
Sign Laguage To Text Convertor - Synopsis - Docx - Google Drive
No ratings yet
Sign Laguage To Text Convertor - Synopsis - Docx - Google Drive
12 pages
American Sign Language Real Time Detection Using TensorFlow and Keras in Python
No ratings yet
American Sign Language Real Time Detection Using TensorFlow and Keras in Python
6 pages
Voice To Hand Gesture Recognition App For People With Hearing Disabilities
No ratings yet
Voice To Hand Gesture Recognition App For People With Hearing Disabilities
6 pages
Glide A Communication Aid For Deaf-Mute
No ratings yet
Glide A Communication Aid For Deaf-Mute
4 pages
Assignment: Shubam Thakyal (2021A1R032)
No ratings yet
Assignment: Shubam Thakyal (2021A1R032)
51 pages
G7 Synopsis
No ratings yet
G7 Synopsis
14 pages
Synopsis PPT Template
No ratings yet
Synopsis PPT Template
13 pages
Hand Talk Interpretation System For Deaf and Dumb Using Machine Learning
No ratings yet
Hand Talk Interpretation System For Deaf and Dumb Using Machine Learning
22 pages
Blind Deaf and Dumb PPT 1st Review
No ratings yet
Blind Deaf and Dumb PPT 1st Review
16 pages
Deaf People'S Hand Gesture Detection and Translator: MR - Yuvan Prasath V Iii-B.Sc It Mrs. M.Sagarban Asst - Professor
No ratings yet
Deaf People'S Hand Gesture Detection and Translator: MR - Yuvan Prasath V Iii-B.Sc It Mrs. M.Sagarban Asst - Professor
7 pages
Sign Language Detection
No ratings yet
Sign Language Detection
5 pages
Reserch Paper Smartgloves
No ratings yet
Reserch Paper Smartgloves
6 pages
Sign Language Translator Presentation
No ratings yet
Sign Language Translator Presentation
19 pages
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
No ratings yet
Hand Gesture Recognition and Voice Conversion For Deaf and Dumb
2 pages
AI-Powered Hand Gestures-To-Speech System For Bridging Communication With The Hearing Impaired
No ratings yet
AI-Powered Hand Gestures-To-Speech System For Bridging Communication With The Hearing Impaired
5 pages
Sign Language To Text Converter
No ratings yet
Sign Language To Text Converter
18 pages
Project Synopsis
No ratings yet
Project Synopsis
17 pages
Hand Sign Language Research
No ratings yet
Hand Sign Language Research
7 pages
Final Minor Report
No ratings yet
Final Minor Report
24 pages
Dual Mode Sign Language Recognizer-An Android Based CNN and LSTM Prediction Model
No ratings yet
Dual Mode Sign Language Recognizer-An Android Based CNN and LSTM Prediction Model
5 pages
Design Project 2
No ratings yet
Design Project 2
9 pages
Research Paper5
No ratings yet
Research Paper5
6 pages
63fcc603799b24 06976942
No ratings yet
63fcc603799b24 06976942
11 pages
Project Review 1
No ratings yet
Project Review 1
24 pages
Hand Gesture Recognition For Deaf and Blind People
No ratings yet
Hand Gesture Recognition For Deaf and Blind People
4 pages
Sign Language Recogntion Report
No ratings yet
Sign Language Recogntion Report
29 pages
Synopsis
No ratings yet
Synopsis
20 pages
"Asl To Text Conversion": Bachelor of Technology
No ratings yet
"Asl To Text Conversion": Bachelor of Technology
15 pages
DIY Hand Gesture Speaking System
No ratings yet
DIY Hand Gesture Speaking System
21 pages
IJCRT2304544
No ratings yet
IJCRT2304544
5 pages
IJRPR20645
No ratings yet
IJRPR20645
9 pages
Supriya-Plagiarism - Report
No ratings yet
Supriya-Plagiarism - Report
11 pages
Synopsis PPT Template
No ratings yet
Synopsis PPT Template
13 pages
Automated Device For Deaf and Dumb People: Group Members
No ratings yet
Automated Device For Deaf and Dumb People: Group Members
21 pages
Hand Gesture Recognition and Voice Text Conversion Using
No ratings yet
Hand Gesture Recognition and Voice Text Conversion Using
5 pages
Project Synopsis
No ratings yet
Project Synopsis
22 pages
Project Synopsis
No ratings yet
Project Synopsis
31 pages
Conference Paper Signmeet
No ratings yet
Conference Paper Signmeet
6 pages
Project Synopsis
No ratings yet
Project Synopsis
20 pages
Paper For Project
No ratings yet
Paper For Project
20 pages
Review 3
No ratings yet
Review 3
17 pages
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
No ratings yet
Mudratalk: Indian Sign Language Translator: Bharati Vidyapeeth Deemed To Be University
18 pages
Research Paper
No ratings yet
Research Paper
16 pages
Sign Language Recognition Using LSTM and Media Pipe
No ratings yet
Sign Language Recognition Using LSTM and Media Pipe
6 pages
Sign Recognition Research Paper
No ratings yet
Sign Recognition Research Paper
16 pages
Sign Language Report
No ratings yet
Sign Language Report
32 pages
Portable Assistive Device For Deaf Dumb and Blind Using Ai
No ratings yet
Portable Assistive Device For Deaf Dumb and Blind Using Ai
4 pages
IJCRT2402561
No ratings yet
IJCRT2402561
9 pages
Report SLD
No ratings yet
Report SLD
21 pages
Irjet V11i5155
No ratings yet
Irjet V11i5155
7 pages
Paper 3+ijisae
No ratings yet
Paper 3+ijisae
15 pages
Sign Language Recognition Using Hand Gestures
No ratings yet
Sign Language Recognition Using Hand Gestures
5 pages
Visual Language Interpreter
No ratings yet
Visual Language Interpreter
7 pages
Sign
No ratings yet
Sign
70 pages
JETIRDV06004
No ratings yet
JETIRDV06004
3 pages
Aa PORTABLE ASSISTIVE DEVICE FOR DEAF DUMB AND BLIND USING AI
100% (2)
Aa PORTABLE ASSISTIVE DEVICE FOR DEAF DUMB AND BLIND USING AI
4 pages
Visual Word: Unlocking the Power of Image Understanding
From Everand
Visual Word: Unlocking the Power of Image Understanding
Fouad Sabry
No ratings yet
Beautiful-Thing in Bb Original
No ratings yet
Beautiful-Thing in Bb Original
1 page
Cello D Majoe 1 Octave
No ratings yet
Cello D Majoe 1 Octave
1 page
Math
No ratings yet
Math
1 page
Learn To Read Music Printable
No ratings yet
Learn To Read Music Printable
7 pages
Fractions Practice Sheet
No ratings yet
Fractions Practice Sheet
2 pages
Projectcse Smartglass
No ratings yet
Projectcse Smartglass
26 pages
KH
No ratings yet
KH
7 pages
IIIT-H Indic Speech Databases
No ratings yet
IIIT-H Indic Speech Databases
4 pages
Records Management Practices in Ecumenical Tertiary Institutions - The Trinity Theological Seminary I
No ratings yet
Records Management Practices in Ecumenical Tertiary Institutions - The Trinity Theological Seminary I
13 pages
Version Comparison Epiplex500
No ratings yet
Version Comparison Epiplex500
2 pages
Job and Career
No ratings yet
Job and Career
10 pages
Multilingual Chatbot
No ratings yet
Multilingual Chatbot
25 pages
United States Patent: (10) Patent No.: (45) Date of Patent
No ratings yet
United States Patent: (10) Patent No.: (45) Date of Patent
12 pages
Hindi
No ratings yet
Hindi
6 pages
Sygic
No ratings yet
Sygic
40 pages
Improvization of Malayalam Speech Output in Espeak Text-To-Speech Synthesizer
No ratings yet
Improvization of Malayalam Speech Output in Espeak Text-To-Speech Synthesizer
6 pages
Mobile Computing Through Telephony
100% (10)
Mobile Computing Through Telephony
65 pages
演讲论文声明生成器
100% (1)
演讲论文声明生成器
5 pages
Alpine Catalogue
No ratings yet
Alpine Catalogue
20 pages
WellSaid Labs API Ebook
No ratings yet
WellSaid Labs API Ebook
13 pages
Voice Browser Seminar Report
0% (1)
Voice Browser Seminar Report
5 pages
Webex Contact Center Enterprise Add-On Overview
No ratings yet
Webex Contact Center Enterprise Add-On Overview
13 pages
Multilingual Text-To-Speech Training Using Cross Language Voice Conversion and Self-Supervised Learning of Speech Representations
No ratings yet
Multilingual Text-To-Speech Training Using Cross Language Voice Conversion and Self-Supervised Learning of Speech Representations
5 pages
Voice Bot Literature Review-WPS Office
No ratings yet
Voice Bot Literature Review-WPS Office
3 pages
Assistive Technologies
No ratings yet
Assistive Technologies
12 pages
7th Sem Syllabus
No ratings yet
7th Sem Syllabus
6 pages
AWS AI Services
No ratings yet
AWS AI Services
30 pages
Android - Practical 2025 B.Sc. IT
No ratings yet
Android - Practical 2025 B.Sc. IT
55 pages
Parallel Networks That Learn To Pronounce English Text: Terrence J. Sejnowski
No ratings yet
Parallel Networks That Learn To Pronounce English Text: Terrence J. Sejnowski
24 pages
Method To Study Speech Synthesis
No ratings yet
Method To Study Speech Synthesis
43 pages
Assistive Technology For Children With Learning Difficulties
No ratings yet
Assistive Technology For Children With Learning Difficulties
27 pages
Bilingual Users of AAC
No ratings yet
Bilingual Users of AAC
15 pages
Research DR Rajiv Dharaskar
No ratings yet
Research DR Rajiv Dharaskar
35 pages

(10 13) Voice

Uploaded by

(10 13) Voice

Uploaded by

10

Voice Conversion and Hand Gesture Recognition for Aponic

1. INTRODUCTION hearing voice and text format. Machine learning will be

and gestures. Deaf persons can interpret other people's

• To recognize distinctive hand signals.

• To change over hand signals to content arrange.

• To change over hand motions to discourse arrange.

• To change over discourse to content organize.

Figure 3: Flowchart to Convert Hand Gestures to Word

circumstances additionally decrease a hole between the

You might also like