Dsaa Project Initial Proposal GROUP-11 Members: Vaishnavi NV, Syed Jahangir Peeran, V Tejkiran, V Sai Rathan, Srilekha N

The document proposes extracting text from images using optical character recognition and machine learning, then converting the text to speech. It would allow applications like reading bedtime stories aloud or speaking street names to blind users. Challenges include recognizing broken text and improving accuracy through machine learning and grammar/spelling checks. The input is a photo with text, which would undergo text detection, character segmentation/classification, word merging, correctness checking, and final text-to-speech output.

Uploaded by

Harshitha Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views1 page

Dsaa Project Initial Proposal GROUP-11 Members: Vaishnavi NV, Syed Jahangir Peeran, V Tejkiran, V Sai Rathan, Srilekha N

Uploaded by

Harshitha Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 1

DSAA PROJECT

INITIAL PROPOSAL
GROUP-11
Members: Vaishnavi NV, Syed Jahangir Peeran, V Tejkiran, V Sai Rathan, Srilekha N.

Aim
Extracting meaningful text from a given image using Optical Character recognition
(OCR) along with machine learning and then converting the extracted text into speech.

Applications
(i) A software which reads out bedtime stories or as a textbook reader for students.
(ii) An OCR based app with access to camera and speech which communicates the
street name/door number to a blind person (through speech).

Challenges
(i) We will try to recognise even broken text in an image.
(ii) With the addition of using machine learning algorithm, the accuracy of OCR will
be lifted up.
(iii) For better accuracy, we will run the converted text through grammar/spelling
checker to predict the word/sentence better.

Input
A photo with text written on it, will be sent as the input for processing.

Processing
(i) Detect text regions alone from the image using sliding windows technique.
(ii) Segmenting characters.
(iii) Classifying characters.
(iv) Combining the letters and merging words to form sentences.
(v) Checking its correctness.
(vi) Converting the obtained text to speech.

Output
A voice signal speaking out the text which was processed in the above step.

References
(i) Chen, Huizhong, et al. "Robust Text Detection in Natural Images with Edge-
Enhanced Maximally Stable Extremal Regions." Image Processing (ICIP), 2011 18th
IEEE International Conference on. IEEE, 2011.
(ii) https://www.coursera.org/learn/machine-learning

Dip PDF
No ratings yet
Dip PDF
30 pages
Math El
No ratings yet
Math El
17 pages
On Text To Speech Conversion Using OCR
50% (2)
On Text To Speech Conversion Using OCR
26 pages
AI Based Reading System For Blind Using OCR
No ratings yet
AI Based Reading System For Blind Using OCR
4 pages
Text To Speech Conversion
No ratings yet
Text To Speech Conversion
4 pages
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
No ratings yet
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
3 pages
Image To Speech Conversion in Multi Languages
No ratings yet
Image To Speech Conversion in Multi Languages
31 pages
Optical Character Recognition Based Speech Synthesis: Project Report
0% (1)
Optical Character Recognition Based Speech Synthesis: Project Report
17 pages
Leslie Mashonga T2082163F
No ratings yet
Leslie Mashonga T2082163F
9 pages
PRE Synopsis
No ratings yet
PRE Synopsis
3 pages
Text To Voice Conversion of Text Embedded in Images
No ratings yet
Text To Voice Conversion of Text Embedded in Images
7 pages
Hindi
No ratings yet
Hindi
6 pages
Image To Speech Conversion PDF
No ratings yet
Image To Speech Conversion PDF
7 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
No ratings yet
Department of Computer Science: Image To Text Using Text Recognition & Text To Speech
66 pages
An Efficient Approach For Text-to-Speech Conversio
No ratings yet
An Efficient Approach For Text-to-Speech Conversio
6 pages
First Review 1MS21LVS06
No ratings yet
First Review 1MS21LVS06
12 pages
Presentation 4
No ratings yet
Presentation 4
17 pages
6.python Text To Speech
No ratings yet
6.python Text To Speech
2 pages
Text To Speech
No ratings yet
Text To Speech
9 pages
Smart Glasses For Blind People: Abstract
No ratings yet
Smart Glasses For Blind People: Abstract
7 pages
APP2
No ratings yet
APP2
16 pages
DL Based Speech To Text Converter For Audio Visual Applications
No ratings yet
DL Based Speech To Text Converter For Audio Visual Applications
4 pages
Text To Speech Conversion Using Raspberry - PI
No ratings yet
Text To Speech Conversion Using Raspberry - PI
3 pages
Devel Projevct
No ratings yet
Devel Projevct
59 pages
IMLA AI Based Learning Project Report
No ratings yet
IMLA AI Based Learning Project Report
19 pages
Advanced Image To Speech Conversion
No ratings yet
Advanced Image To Speech Conversion
46 pages
Voice Assisted Text Reading System For Visually Impaired Persons
No ratings yet
Voice Assisted Text Reading System For Visually Impaired Persons
6 pages
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
No ratings yet
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
3 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Sign Board Reader
No ratings yet
Sign Board Reader
22 pages
Journals Uja I Ej
No ratings yet
Journals Uja I Ej
13 pages
Department of Electronics and Communication Engineering
No ratings yet
Department of Electronics and Communication Engineering
25 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
34 pages
Image To Text and Speech Conversion
No ratings yet
Image To Text and Speech Conversion
3 pages
IJCRT2108410
No ratings yet
IJCRT2108410
5 pages
Text Recognition in Images and Converting Recognized Text To Speech Image Processing
No ratings yet
Text Recognition in Images and Converting Recognized Text To Speech Image Processing
4 pages
Speech To Image Conversion: Shaik Karishma, Siddu Devi Naga Susmitha, Nanditha Katari, G. Sirisha
No ratings yet
Speech To Image Conversion: Shaik Karishma, Siddu Devi Naga Susmitha, Nanditha Katari, G. Sirisha
5 pages
Text To Speech Conversion Module
No ratings yet
Text To Speech Conversion Module
8 pages
Ijaret 09 05 015
No ratings yet
Ijaret 09 05 015
10 pages
Last Edited
No ratings yet
Last Edited
8 pages
Text Reader For Visually Impaired Person Using Image Processing Open-CV
No ratings yet
Text Reader For Visually Impaired Person Using Image Processing Open-CV
8 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
Ocr Gtts PDF
No ratings yet
Ocr Gtts PDF
53 pages
Another Researc Papers
No ratings yet
Another Researc Papers
11 pages
Blind Reader: Project Guide:Dr. Jayanand Gawande
No ratings yet
Blind Reader: Project Guide:Dr. Jayanand Gawande
8 pages
Ocr Gtts
No ratings yet
Ocr Gtts
49 pages
Text To Speech Using Labview
No ratings yet
Text To Speech Using Labview
12 pages
Integration of OCR With TTS
No ratings yet
Integration of OCR With TTS
6 pages
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
No ratings yet
"Text Recognition and Face Detection Aid For Visually Impaired Person Using Raspberry Pi
62 pages
Tamil Textual Image Reader
No ratings yet
Tamil Textual Image Reader
4 pages
Final PPT 8th Sem33
No ratings yet
Final PPT 8th Sem33
12 pages
Project Proposal: Project Title: Speech To Text Conversion Problem Statement
No ratings yet
Project Proposal: Project Title: Speech To Text Conversion Problem Statement
2 pages
Tess2Speech: An Intelligent Character Recognition-To-Speech Application For Android Using Google's Tesseract Optical Character Recognition Engine
No ratings yet
Tess2Speech: An Intelligent Character Recognition-To-Speech Application For Android Using Google's Tesseract Optical Character Recognition Engine
197 pages
Smart Reader For Blind People
No ratings yet
Smart Reader For Blind People
3 pages
Image To Audio Content Reader Project
No ratings yet
Image To Audio Content Reader Project
8 pages
Multilingual Translator
No ratings yet
Multilingual Translator
16 pages