RRL
RRL
According to Sunita Chavan et al., (2023) the research primary goal of is to help those visually impaired
in recognizing text. Meanwhile, This goal is achieved by building a module that turns text into speech
and speaks it into the provided speaker or headphones. The text is extracted using an application built
into the system, and the image is captured using the webcam of the system. The text is then recognized
for words and spoken aloud through headphones or the system's audio. The Python programming
language provides PIL (PythonImagingLibrary), which is used to perform basic image operations like
creating thumbnails, resizing, rotating, and converting between different file formats.
Image Processing Based on Optical Character Recognition with Text to Speech for Visually
Impaired
The findings of Vijayanarayanan et al.,(2023) , Image Processing Based on Optical Character Recognition
with Text to Speech for Visually Impaired allows users to hear text images' contents rather than reading
them thru them. It blends the ideas of text to speech and optical character recognition (OCR) to a
camera is used in the Speech Synthesizer (TTS). The main issue that visually handicapped people
encounter individuals these days is that they have to rely on others to do text recognition for them since
they can't do it alone. to rely on others for daily tasks like reading newspapers and sending mail via mail,
book recommendations, etc. The project's ultimate goal is to assist the blind and visually handicapped
for readers to identify the text. A written text that is shown in front of the webcam must take a picture,
remove the text from it, and then either read the text out through speakers on a PC or headphones. Text-
to-Speech (TTS) refers to a computer's capacity to generate spoken words by translating text to speech.
Stated differently, text-to-speech software is a voice synthesizer that naturally voices text in real time.
This essay explains the layout, deployment and test outcomes of the apparatus. There are two modules
in this device: both a voice and an image processing module.