0% found this document useful (1 vote)

177 views17 pages

Optical Character Recognition Based Speech Synthesis: Project Report

This document provides information about an optical character recognition (OCR) based speech synthesis system developed using LabVIEW. The system uses an OCR algorithm to recognize text from scanned documents and then converts the recognized text to speech that can be listened to. It describes the objectives, introduction, VI components, methodology including the OCR and text-to-speech processes, the LabVIEW program, experiments conducted, applications, and conclusion.

Uploaded by

isoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

177 views17 pages

Optical Character Recognition Based Speech Synthesis: Project Report

Uploaded by

isoi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 17

OPTICAL CHARACTER

RECOGNITION BASED
SPEECH SYNTHESIS

PROJECT REPORT

COURSE
VIRTUAL INSTRUMENTATION
(EEE4035)
GUIDIED BY:
PROF. ABHISHEK G

YASHKARAN SINGH 18BEI0122

APRIL 2019

1
CONTENTS

Serial Number Particulars Page Number

1 Objective 3
2 Introduction 4
3 VI Components 5
4 Methodology 6
5 LabVIEW Program 8
6 Result 9
7 Application 11
8 Conclusion 12
9 References 13

2
OBJECTIVE

Speech signal is more effective means of communication than text because blind and
visually impaired persons can also respond to sounds. Knowledge extraction by just
listening to sounds is a distinctive property. The OCR based speech synthesis system
will significantly improve the degree to which the visually impaired can interact with
their environment as that of a sighted person. This project aims to develop a cost
effective, and user friendly optical character recognition (OCR) based speech
synthesis system. The OCR based speech synthesis system has been developed using
Laboratory virtual instruments engineering workbench (LabVIEW).

Keywords: Optical character recognition, Speech, Synthesis, Recognition, LabVIEW

3
INTRODUCTION

Machine replication of human functions, like reading, is an ancient dream. However,

over the last few decades, machine reading has grown from a dream to reality. Text is
being present everywhere in our day to day life, either in the form of documents
(newspapers, books, mails, magazines etc.) or in the form of natural scenes (signs,
screen, schedules) which can be read by a normal person. Unfortunately, the blind
and visually impaired persons are deprived from such information, because their
vision troubles do not allow them to have access of this textual information which
limits their mobility in unconstrained environments. The OCR based speech synthesis
system will significantly improve the degree to which the visually impaired can
interact with their environment as that of a sighted person .
This project is related to existing research in text detection from general background
or video image, and Bangla optical character recognition (OCR) system. Some
researchers published their efforts on texture-based text detection also. OCR based
speech recognition system using LabVIEW utilizes a scanner to capture the images
of printed or handwritten text, recognize that text and translate the recognized text as
voice output message using Microsoft Speech SDK (Text To Speech).

4
VI COMPONENTS

1. Read soundinput

2. Spectral Measurements

3. Write soundoutput

4. Waveformgraph

5. Array to Clusterblock

6. Path

5
METHODOLOGY

FLOW CHART OF OCR SYSTEM:

7
METHODOLOGY

Optical Character Recognition

8
METHODOLOGY
Optical character recognition (OCR) is the mechanical or electronic translation of images of hand-
written or printed text into machine-editable text [12]. The OCR based system consists of following
process steps:
a) Image Acquisition
b) Image Pre-processing (Binarization)
c) Image Segmentation
d) Matching and Recognition

Image Acquisition: The image has been captured using a digital HP scanner. The flap of the scanner
had been kept open during the acquisition process in order toobtain a uniform black background.
The image had been acquired using the program developed in LabVIEW. The configuration of the
Image has been done with the help of Image create subvi function of LabVIEW. The configuration
of the image means selecting the image type and border size of the image as per the requirement. In
this work 8 bit image with border size of 3 has been used.

Image Pre-processing (Binarization): Binarization is the process of converting a gray scale image (0 to 255
pixel values) into binary image (0 to1 pixel values) by using a threshold value. The pixels lighter than the
threshold are turned to white and the remainder to black pixels. In this work a global thresholding with a
threshold value of 175 has been used to binarize the image i.e. the values of pixel which are from 175 to 255
has been converted to 1 while the of pixel which have gray scale value less than 175 have been converted to
0.

Image Segmentation: The segmentation process consists of line segmentation, word segmentation
9
METHODOLOGY
and finally character segmentation.
1. Line segmentation is the first step of the segmentation process. It takes the array of the image as an input
and scans the image horizontally to find first ON pixel and remember that coordinate as y1.
2. In the word segmentation process the line segmented images have been vertically scanned to find first ON
pixel. When this happen the system remember the coordinate of this point as x1. This is the starting
coordinate for the word.
3. Character segmentation has been performed by scanning the word segmented image vertically. This
process is different from the word segmentation in following two ways: i) Number of horizontal OFF pixels
between the different characters are less in comparison to number of OFF pixels between the words ii) Total
number of characters and their order in the word has been determined so as to reproduce the word correctly
during speech synthesis.

1
0
METHODOLOGY

Matching and Recognition: In this process, correlation between stored templates and segmented character
has been obtained by using correlation VI. The correlation VI determines the correlation between segmented
character and stored templates of each character. The value of the highest correlation recognizes a particular
character. In this way in order to recognize the character every segmented character has been compared with
the predefined data stored in the system. Since same font size has been used for recognition, a unique match
for the each character has been obtained. Figure 6 shows the LabVIEW program of correlation between two
images.

Text To Speech Synthesis:

In text to speech module text recognised by OCR system will be the inputs of speech synthesis
system which is to be converted into speech in .wav file format and creates a wave file named
output wav, which can be listen by using wave file player. Two steps are involved in text to speech
synthesis:

1
1
METHODOLOGY
i) Text to speech conversion:

ii) Play speech in wave file format:

1
2
LabVIEW PROGRAM

1. BLOCKDIAGRAM

2. FRONTPANEL

8
Experiments have been performed to test the proposed system developed using
LabVIEW 7.1 version. The developed OCR based speech synthesis system
has two steps:
a. Optical Character Recognition
b. Speech Synthesis

Step 1. The scanner scans the printed text and the system reads the image using
IMAQ ReadFile and display the image by using IMAQ WindDraw function
of the LabVIEW
Step 2. In this step binarization of the image has been done with a threshold of
175 and the resulting image.
Step-3.-In this step line segmentation of thresholded image has been done.
Step 4. In this step words have been segmented from the line.
Step 5. In this step character segmentation has been performed and all the chacter
in word image window have been segmentated. The segmenatation of first
three characters of word ”Optical”
Step 6. Finally the output of OCR system is in text format which has been stored
in a computer system. The result of recognized text can also be shown on
Front pane

A wave file output.wav is created containing text converted into speech which
10
can listen using wave file player. The waveform will vary according to the
different text from OCR output in the text box and can be listened on the
speaker.

11
APPLICATION

 The uses of OCR vary across different fields. One widely known OCR
application is in banking, where OCR is used to process checks without
human involvement. A check can be inserted into a machine, the writing
on it is scanned instantly, and the correct amount of money is transferred.
This technology has nearly been perfected for printed checks, and is fairly
accurate for handwritten checks as well, though it occasionally requires
manual confirmation. Overall, this reduces wait times in many banks.

 The blind source separation technique can be used to separate the source
from all other types of noise. This technique can also be used to extract
audio when we have no idea about the sources. For example, if three
people are talking at the same time, the audio signal of each person can be
extracted as separate audio signals.

 Signal separation using these blind techniques has found many

applications in acoustics, where different sound sources are recorded
simultaneously either with individual microphones or microphone arrays.

 It can also be used for Undetermined Blind Signal Separation, Blind

Source Separation for Noise Reduction in Mobile, Speech Separation with
Microphone Arrays, Blind Signal Separation using Particle Swarm
Optimization and Speech Separation and Beamforming

11
CONCLUSION

In this report, an OCR based speech synthesis system (which can be used as a good
mode of communication between people) has been discussed. The system has been
implemented on LabVIEW 7.1 platform. The developed system consists of OCR and
speech synthesis. In OCR printed or written character documents have been scanned
and image has been acquired by using IMAQ Vision for LabVIEW. The different
characters have been recognized using segmentation and correlation based methods
developed in LabVIEW. In second section recognized text has been converted into
speech using Microsoft Speech Object Library (Version 5.1). The developed OCR
based speech synthesis system is user friendly, cost effective and gives the result in
the real time. Moreover, the program has the required flexibility to be modified easily
if required.

12
REFERENCES

https://www.slideshare.net/BharatThakur1/ocr-speech-using-labview
https://www.nr.no/~eikvil/OCR.pdf
https://en.wikipedia.org/wiki/Optical_character_recognition
https://supplychainminded.com/3-practical-applications-ocr-technology-business-easier/

Android Text To Speech Documentation by Paige
No ratings yet
Android Text To Speech Documentation by Paige
27 pages
Devel Projevct
No ratings yet
Devel Projevct
59 pages
Advanced Image To Speech Conversion
No ratings yet
Advanced Image To Speech Conversion
46 pages
Ocr Gtts PDF
No ratings yet
Ocr Gtts PDF
53 pages
Dip PDF
No ratings yet
Dip PDF
30 pages
Learning OpenCV 3 Application Development
From Everand
Learning OpenCV 3 Application Development
Samyak Datta
No ratings yet
Semantic Computing
From Everand
Semantic Computing
Phillip C.-Y. Sheu
No ratings yet
Technical Data Sheet For: AUMA India Multi-Turn Worm Gearboxes
No ratings yet
Technical Data Sheet For: AUMA India Multi-Turn Worm Gearboxes
2 pages
Final PPT 8th Sem33
No ratings yet
Final PPT 8th Sem33
12 pages
5 Year Procurement Projection 30032023
No ratings yet
5 Year Procurement Projection 30032023
26 pages
Sign Board Reader
No ratings yet
Sign Board Reader
22 pages
Open Source Computer Vision
No ratings yet
Open Source Computer Vision
79 pages
On Text To Speech Conversion Using OCR
50% (2)
On Text To Speech Conversion Using OCR
26 pages
Presentation 4
No ratings yet
Presentation 4
17 pages
Spring 2.5 Aspect Oriented Programming
From Everand
Spring 2.5 Aspect Oriented Programming
Massimiliano DessÃ¬
No ratings yet
Leslie Mashonga T2082163F
No ratings yet
Leslie Mashonga T2082163F
9 pages
Ocr Gtts
No ratings yet
Ocr Gtts
49 pages
Lesson7 ASIAN-REGIONALISM
No ratings yet
Lesson7 ASIAN-REGIONALISM
18 pages
Speech Recognition Using Python
No ratings yet
Speech Recognition Using Python
49 pages
6.python Text To Speech
No ratings yet
6.python Text To Speech
2 pages
Text To Speech Conversion
No ratings yet
Text To Speech Conversion
75 pages
Review-3: Eee2004-Measurement & Instrumentation Dr. Gunabalan R
No ratings yet
Review-3: Eee2004-Measurement & Instrumentation Dr. Gunabalan R
23 pages
7143cem Portfolio November2024 Brief
No ratings yet
7143cem Portfolio November2024 Brief
12 pages
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
No ratings yet
Hand Written Character Recognition Using Neural Network: BACHELOR OF ENGINEERING (Computer Engineering)
46 pages
Raspberry Pi
No ratings yet
Raspberry Pi
21 pages
Katalog Siemens
No ratings yet
Katalog Siemens
80 pages
Learn OpenCV with Python by Examples
From Everand
Learn OpenCV with Python by Examples
James Chen
No ratings yet
SGS-Supplier Code of Conduct
No ratings yet
SGS-Supplier Code of Conduct
10 pages
A12REVIEW
No ratings yet
A12REVIEW
18 pages
Novellini KUADRAH13 Brochure
No ratings yet
Novellini KUADRAH13 Brochure
6 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Image To Speech Conversion in Multi Languages
No ratings yet
Image To Speech Conversion in Multi Languages
31 pages
First Review 1MS21LVS06
No ratings yet
First Review 1MS21LVS06
12 pages
Blind Reader: Project Guide:Dr. Jayanand Gawande
No ratings yet
Blind Reader: Project Guide:Dr. Jayanand Gawande
8 pages
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
No ratings yet
Optical Character Recognition (Ocr) : Karan Panjwani T.E - B, 68 Guided By: Prof. Shalini Wankhade
24 pages
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
No ratings yet
Jagruthi Institute of Engineering and Technology: Optical Character Recognition
28 pages
Math El
No ratings yet
Math El
17 pages
Text To Speech
No ratings yet
Text To Speech
9 pages
Optical Character Recognition Techniques
No ratings yet
Optical Character Recognition Techniques
6 pages
Text To Speech Using Labview
No ratings yet
Text To Speech Using Labview
12 pages
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
No ratings yet
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
3 pages
Tooooo
No ratings yet
Tooooo
92 pages
JEEMAINJAN AdmitCard PDF
No ratings yet
JEEMAINJAN AdmitCard PDF
1 page
Ocr & Cbir
No ratings yet
Ocr & Cbir
13 pages
OCR Presentation
No ratings yet
OCR Presentation
16 pages
Lit - Ch01 - Kimmel Et Al. 2013 - Ch13-2
No ratings yet
Lit - Ch01 - Kimmel Et Al. 2013 - Ch13-2
28 pages
Survey Paper Image Reader For Blind Pers
No ratings yet
Survey Paper Image Reader For Blind Pers
3 pages
Assignment of Trademark
No ratings yet
Assignment of Trademark
3 pages
Image To Speech Conversion PDF
No ratings yet
Image To Speech Conversion PDF
7 pages
Practical Assignment 01: OCR - Optical Character Recognition
No ratings yet
Practical Assignment 01: OCR - Optical Character Recognition
16 pages
Electric Bill
No ratings yet
Electric Bill
1 page
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
No ratings yet
Real-Time Braille To Speech Conversion: Project Reference No.: 41S - Be - 1713
3 pages
Erosion and Erosion-Corrosion of Metals: A.V. Levy
No ratings yet
Erosion and Erosion-Corrosion of Metals: A.V. Levy
12 pages
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
No ratings yet
Optical Character Recognition: Divyanshu Sagar Ahmed Zaid Faizee Vidyut Singhania
11 pages
Adapting A Tts System To A Reading Machine For The Blind
No ratings yet
Adapting A Tts System To A Reading Machine For The Blind
4 pages
AI Based Reading System For Blind Using OCR
No ratings yet
AI Based Reading System For Blind Using OCR
4 pages
KH
No ratings yet
KH
7 pages
Barton Liquid Level (Mechanical)
No ratings yet
Barton Liquid Level (Mechanical)
36 pages
Raspberry Pi Based Reader For Blind People
No ratings yet
Raspberry Pi Based Reader For Blind People
4 pages
SL NO. Name Usn Number Roll No
No ratings yet
SL NO. Name Usn Number Roll No
10 pages
Ballistic Limit Evaluation For Impact of Pistol Projectile 9 MM Luger On Aircraft Skin Metal Plate
No ratings yet
Ballistic Limit Evaluation For Impact of Pistol Projectile 9 MM Luger On Aircraft Skin Metal Plate
10 pages
GlowCorp Case
No ratings yet
GlowCorp Case
25 pages
Optical Character Recognizer: Team Member
No ratings yet
Optical Character Recognizer: Team Member
7 pages
Text To Speech Conversion
No ratings yet
Text To Speech Conversion
4 pages
PRE Synopsis
No ratings yet
PRE Synopsis
3 pages
A Smart Reader For Visually Impaired People Using Raspberry PI
No ratings yet
A Smart Reader For Visually Impaired People Using Raspberry PI
5 pages
ToneLab LE Manual
No ratings yet
ToneLab LE Manual
128 pages
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
No ratings yet
"Camera Based Product Information Reading For Blind People": Priyanka Patil, Sonali Solat, Shital Hake
4 pages
Smart Reader For Blind People
No ratings yet
Smart Reader For Blind People
3 pages
Voice Assisted Text Reading System For Visually Impaired Persons
No ratings yet
Voice Assisted Text Reading System For Visually Impaired Persons
6 pages
Arquitectura de Computadores Varios
No ratings yet
Arquitectura de Computadores Varios
6 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
7 pages
HashiCorp Certified Terraform Associate (003) WhizCard
No ratings yet
HashiCorp Certified Terraform Associate (003) WhizCard
21 pages
Green and Black Minimalist Resume
No ratings yet
Green and Black Minimalist Resume
2 pages
100% Online: MSC Project Management Offered in Exclusive Partnership With Robert Kennedy College
No ratings yet
100% Online: MSC Project Management Offered in Exclusive Partnership With Robert Kennedy College
7 pages
Division 2 League Adm - Letter Mitunguu
No ratings yet
Division 2 League Adm - Letter Mitunguu
1 page
Vaporetto Lecoaspira 710
No ratings yet
Vaporetto Lecoaspira 710
15 pages
Glass-Lined Steel Process Equipment
100% (1)
Glass-Lined Steel Process Equipment
21 pages
Chan V. Honda Motor Co., Ltd. and Honda Phil.: Rights, Regulations and Remedies) in Relation To Sec 170
No ratings yet
Chan V. Honda Motor Co., Ltd. and Honda Phil.: Rights, Regulations and Remedies) in Relation To Sec 170
3 pages
MII Declaration Format
No ratings yet
MII Declaration Format
2 pages
Sample Whistle Blower Policy
No ratings yet
Sample Whistle Blower Policy
2 pages
Text To Speech Conversion Using Raspberry - PI
No ratings yet
Text To Speech Conversion Using Raspberry - PI
3 pages
Curriculum Reform
No ratings yet
Curriculum Reform
27 pages
Image To Text and Speech Conversion
No ratings yet
Image To Text and Speech Conversion
3 pages
ABB Azipod Brochure Lores
No ratings yet
ABB Azipod Brochure Lores
8 pages
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
From Everand
Optical Character Recognition: Unlocking the Power of Computer Vision for Optical Character Recognition
Fouad Sabry
No ratings yet
Human Visual System Model: Understanding Perception and Processing
From Everand
Human Visual System Model: Understanding Perception and Processing
Fouad Sabry
No ratings yet
Optical Character Recognition: Fundamentals and Applications
From Everand
Optical Character Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet

Optical Character Recognition Based Speech Synthesis: Project Report

Uploaded by

Optical Character Recognition Based Speech Synthesis: Project Report

Uploaded by

OPTICAL CHARACTER

YASHKARAN SINGH 18BEI0122

Serial Number Particulars Page Number

Keywords: Optical character recognition, Speech, Synthesis, Recognition, LabVIEW

Machine replication of human functions, like reading, is an ancient dream. However,

FLOW CHART OF OCR SYSTEM:

Optical Character Recognition

Text To Speech Synthesis:

ii) Play speech in wave file format:

 Signal separation using these blind techniques has found many

 It can also be used for Undetermined Blind Signal Separation, Blind

You might also like