(10 13) Voice
(10 13) Voice
Research Article
Volume-1 | Issue-1| Jan-Jun-2024|
JOURNAL OF
Image Processing and Image
Restoration
Double Blind Peer Reviewed Journal
DOI: https://doi.org/10.48001/JoIPIR
Monith M1, Punith Kumar N1, Naveen Kumar R1, Lokesh B S1, Raghunath B H1*
1
Department of Electronics and Communication Engineering, Acharya Institute of Technology, Bengaluru,
Karnataka, India
*
Corresponding Author’s Email: [email protected]
ARTICLE HISTORY: ABSTRACT: Aphonia, a condition resulting in the loss of voice, presents significant
Received: 14th Dec, 2023
challenges in interpersonal interactions. This project proposes a dual-pronged
approach involving hand gesture recognition and voice conversion techniques to
Revised: 18th Jan, 2024
facilitate effective communication for aphonic individuals. The integration of real-
Accepted: 28th Jan, 2024
time hand gesture recognition provides an alternative means of expressing ideas and
Published: 9th Feb, 2024 emotions. By capturing and translating hand gestures into textual or auditory output,
KEYWORDS: this approach offers a versatile mode of communication. Additionally, advanced
voice conversion algorithms are employed to synthesize natural and intelligible
Aponic people,
speech from typed or selected text. This innovative coupling of technologies
Communicate, Database,
Hand gesture, Voice empowers aphonic individuals to engage in fluid conversations, fostering improved
conversion social interactions and enhancing their overall quality of life. A webcam is used to
communicate with deaf and aphonic people. When there are modalities of
communication, such as speech, that are unavailable, the human hand is the preferred
option. Hand gestures that transmit concepts utilizing diverse forms and finger
alignment enable human-machine interaction. The purpose of this work is to develop
a hand gesture detection model and translate the results to text and audio formats. The
model also responds to user voice commands and displays hand signs from the
database.
changed to content and sound arrange. Clamor will be Objective 4: To Convert Speech to Text Format
diminished as a result of dimensionality lessening, and the
The discourse is given as input with the assistance of a
framework will work with awesome accuracy as given in
mouthpiece show on the laptop/computer. The
Figure 3.
framework at that point recognizes the discourse. The
framework checks on the off chance that the voice was
capable of beingheard and clear.
If yes, it changes over discourse to content utilizing (STT)
converter and shows the content on the screen. On the off
chance that no, it shows a mistake expressing that the
framework did not capture the voice legitimately. Once
content is gotten, it checks with the database and returns
the hand-sign pictures.
3. BLOCK DIAGRAM