0% found this document useful (0 votes)
58 views32 pages

Mini Project Report

The document is a mini project report that describes a text-to-speech converter developed by a group of students. It includes an introduction that describes text-to-speech synthesis and the objectives of developing a simple application. It also includes sections on literature review and analysis, the proposed system, system design, implementation, testing and results, and conclusions. The group developed a useful text-to-speech synthesizer application that converts inputted text to synthesized speech and allows saving it as an mp3 file.

Uploaded by

as4848284
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
58 views32 pages

Mini Project Report

The document is a mini project report that describes a text-to-speech converter developed by a group of students. It includes an introduction that describes text-to-speech synthesis and the objectives of developing a simple application. It also includes sections on literature review and analysis, the proposed system, system design, implementation, testing and results, and conclusions. The group developed a useful text-to-speech synthesizer application that converts inputted text to synthesized speech and allows saving it as an mp3 file.

Uploaded by

as4848284
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 32

lOMoARcPSD|33212006

MINI Project Report

Engineering (Rashtrasant Tukadoji Maharaj Nagpur University)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university


Downloaded by Aman Sharma ([email protected])
lOMoARcPSD|33212006

MINI PROJECT REPORT


ON
TEXT TO SPEECH CONVERTER
VI SEMESTER

COMPUTER ENGINEERING
Submitted by

ANIKET AGHAM
DHAWAL MADANKAR
DIPALI KAMBALE
MANTHAN JADHAV

Under the guidance of

PROF. KOMAL JAISINGHANI

Academic Year 2021-2022


Department of Computer Engineering

ST. VINCENT PALLOTTI COLLEGE OF ENGINEERING AND


TECHNOLOGY
Wardha Road, Gavsi Manapur, Nagpur

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

ST. VINCENT PALLOTTI COLLEGE OF ENGINEERING


AND TECHNOLOGY
Wardha Road, Gavsi Manapur, Nagpur

Department of Computer Engineering

CERTIFICATE

This is to certify that Aniket Agham, Dhawal Madankar, Dipali Kambale and Manthan
Jadhav have completed mini project on “TEXT TO SPEEECH CONVERTER” under my
supervision for partial fulfillment of VI semester, Bachelor of Engineering in COMPUTER
ENGINEERING under RASHTRASANT TUKDOJI MAHARAJ NAGPUR
UNIVEERSITY, NAGPUR.

Dr. S. M. Wanjari Prof. Komal Jaisinghani


Associate Professor Assistant Professor
Head of the Department Project Guide
Computer Engineering Computer Engineering

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

ACKNOWLEDGEMENT

The Title of our Mini project is, “TEXT TO SPEECH CONVERTER”. Any project requires
a lot of hard work, sincerity and systematic work methodologies. We express our gratitude to
our Project Guide, Prof. Komal Jaisinghani, for guiding us in every step of the project.

We would also like to thank Prof. Sunil Wanjari, Head of the Department of Computer
Engineering and all our faculty members who evaluated our project and pointed out the
shortcomings in the projects. They also gave us important feedback for our project which
helped us along the way.

We are also grateful to the Principal, Management of College, Dr. Surendra V. Gole and
Vice-principal, Prof. R. B. Gowardhan for the support in providing us the facilities of
computer lab and other required infrastructure.

Group Members
Aniket Agham
Dhawal Madankar
Dipali Kambale
Manthan Jadhav

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CONTENTS

CHAPTER NO. TITLE PAGE NO


1. INTRODUCTION
1.1 INTRODUCTION
1.2 OBJECTIVES
1.3 PURPOSE
1.4 SCOPE

2. LITERATURE REVIEW & ANALYSIS


2.1 LITERATURE REVIEW
2.2 ANALYSIS

3. PROPOSED SYSTEM
3.1 PLANNING AND SCHEDULING

4. SYSTEM DESIGN
4.1 FLOW OF PROGRAM
4.2 USED MODULES

5. IMPLEMENTATION

6. TESTING & RESULT

7. CONCLUSION & FUTURE SCOPE

8. REFERENCES

PROJECT TEAM MEMBER & PROJECT GUIDE INFORMATION

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

ABSTRACT

A Text-to-speech synthesizer is an application that converts text into spoken word, by


analyzing and processing the text using Natural Language Processing (NLP) and then using
Digital Signal Processing (DSP) technology to convert this processed text into synthesized
speech representation of the text. Here, we developed a useful text-to-speech synthesizer
in the form of a simple application that converts inputted text into synthesized speech and
reads out to the user which can then be saved as an mp3 file. The development of a text to
speech synthesizer will be of great help to people with visual impairment and make making
through large volume of text easier.

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 1
INTRODUCTION

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

1.1 INTRODUCTION

Text-to-speech synthesis -TTS - is the automatic conversion of a text into speech that
resembles, as closely as possible, a native speaker of the language reading that text. Text-to
speech synthesizer (TTS) is the technology which lets computer speak to you. The TTS
system gets the text as the input and then a computer algorithm which called TTS engine
analyses the text, pre-processes the text and synthesizes the speech with some mathematical
models. The TTS engine usually generates sound data in an audio format as the output.

1.2 OBJECTIVES

 To convert text to speech

 To convert text from one language to another

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

1.3 PURPOSE

Our purpose is text to speech translation. The system is helpful for persons having learning
difficulties or visually challenged. It Prevents eye from strain, and user can sit and listen
comfortably, help avoiding an external human translator. And also avoid sharing of trade
secrets with other translators. It will help in widening the trade market and Travelling to
foreign countries and speaking their native language will be made easy mechanical attempts
dating to the eighteenth century.

1.4 SCOPE

 This application is designed to overcome the language barriers.

 The application is designed to overcome language difference.

 The application designed to help visually impaired people.

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 2

LITERATURE REVIEW AND ANALYSIS

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

2.1 LITERATURE REVIEW

Sr. No. Paper Title and its Author Details of Publication Findings

1 Assistive system for product label detection International Journal Texts written in
with voice output for blind users of Research in objects are converted
Purnima C. and Rubesh Kumar T Engineering & into audio.
Advanced Technology
2014

2 Text-to-speech conversion on a personal ER publication Text converted into


computer audio
El-Imam, Y.A.; Banat, K.
3 Text-to-speech conversion in Spanish a IEEE International Text to converted into
complete rule-based synthesis system," Conference Acoustics, audio in Spanish
Acoustics Santos, J. Nombela Speech, and Signal language
processing

4 English Text to Multilingual Speech Blue Eyes Intelligence Text converted into
Translator Engineering audio with three
Using Android Vivek Hanumante, Rubi & Sciences different language
Debnath, Disha Bhattacharjee, Deepti Publication Pvt. Ltd.
Tripathi, Sahadev Roy

2.2 ANALYSIS

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

2.2.1 FUNCTIONAL REQUIREMENTS


 User-definable lexicon

 Language switching within the text

 Support for multiple document types

 Switching among the multiple voices

 Ability to create many audio file at once

 Text spelling check

2.2.2 NON-FUNCTIONAL REQUIREMENTS

Non-functional requirement specifies how the system should behave and that it is a constraint
upon the system behavior. Non-functional requirements of our system are:

 Ease of use

 Flexibility of voice control (set pitch, timbre, pause in the speech)

 Ease of Mark-up Language setting

 Accessibility of technical support

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 3

PROPOSED SYSTEM

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

3.1 PLANNING AND SCHEDULING

TASK NAME START END DURATION (in days)


Project Selection 10-03-2022 11-03-2022 2
Requirement Analysis 12-03-2022 17-03-2022 6
Requirement 18-03-2022 25-03-2022 8
Gathering
Planning 26-03-2022 02-04-2022 8
Design 03-04-2022 10-04-2022 8
Coding 11-04-2022 30-04-2022 20
Testing 01-05-2022 08-05-2022 8
Dispatch 09-05-2022 10-05-2022 2

Table 3.3.1 Planning and Scheduling

Table 3.3.1 Planning and Scheduling

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 4
SYSTEM DESIGN

4.1 FLOW OF PROGRAM

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Below is mentioned the appropriate and overall flow of program. The flow of program starts
with taking input as text followed by selecting language, downloading module, text
processing and finally the output as voice will generate.

Taking Input

Selecting Language

Downloading
Module

Text Processing

Voice Output

4.2 USED MODULES

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

In programming, a module is a piece of software that has a specific functionality.


We have used two different types of modules in our project
1 - Text to text processing module.
2 – Text to voice processing module.
Firstly, the language is detected and the module is downloaded, initially the text is translated
from one language to another and then the text is converted to speech. The modules are taken
from ML kit.

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 5
IMPLEMENTATION

5.1 SOFTWARE AND HARDWARE USED

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

5.1.1 SOFTWARE USED


 Java JDK5 or latest version

 Java Runtime Environment (JRE) 6

 Android SDK

 Android Studio

 Eclipse IDE for Java Developers

 Android Development Tool kit (ADT kit) / Eclipse

5.1.2 HARDWARE USED

 Microsoft Windows 7/8/10 (32 or 64 bit)

 2 GB RAM minimum, 8 GB recommended

 2 GB of available disk space minimum, 4 GB recommended


(500 MB for

IDE + 1.5 GB for Android SDK and emulator system image)

 1280 x 800 minimum screen resolution

 Java Development Kit (JDK) 8

5.3 CODE SNIPPET

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 6
TESTING AND RESULTS

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

6.1 TESTING
The aim of the testing process was to determine all defects in our project. This
program was subjected to a set of test inputs and various observations were made and based
on these observations it will be decided whether the program behaves as expected or not. Our
project went through two levels of testing
6.1.1 Unit testing
6.1.2 Integration testing

6.2 Unit testing


Unit testing is undertaking when a module has been created and successfully
reviewed. In order to test a single module we need to provide a complete environment
i.e. besides the module we would require.

 The procedures belonging to other modules that the modules under the test calls.

 Non-local data structures that module accesses.

 A procedure to call functions of the module under test with appropriate parameters
unit testing was done on each and every module that is described under module
description.

6.3 Integration testing

In this type of testing we test various integration of the project module by providing the input.
The primary objective is to test the module interfaces in order to ensure that no errors are
occurring when one module invokes the other module.

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

6.4 White Box Testing

White box testing is a method of testing software that tests internal structure or working of an
application, as we know the internal working of our project code we use white box testing.

Output Screenshots:

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 7
CONCLUSION AND FUTURE SCOPE

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

7.1Conclusion

As per the goal of this project an attempt is made to show how the app speaks out the any
language text. Here the provision is provided to the user to input the text and he can listen to
his text. The “naturalness” of the synthetic speech needs to be improved for implementing the
expressions of the human beings. By developing such systems, relationship between human
and computer becomes much closer. Thus it helps in overcoming the problem of DIGITAL
DIVIDE.

7.2Future Scope

 Improvement of the smoothness of the sound.


 Inclusion of prosody and the naturalization of the voices like human expressions.
 Reading of special cases like date and number.
 Inclusion of different kinds of voices and graphical faces.
 Import and Export of documents.
 Controlling the reading speed.
 The system can be further extended to include more languages

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

CHAPTER 8
REFERENCES

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

References :

T. Rubesh Kumar, C. Purnima “Assistive System for Product Label Detection with Voice
Output For Blind Users” International Journal of Research in Engineering & Advanced
Technology 2014.
Santos J. Nombela, "Text-to-speech conversion in Spanish a complete rule-based synthesis
system", Acoustics, Speech, and Signal, Processing, IEEE
International Conference on ICASSP '82, vol.7, no.pp.1593-1596, May 1982.
Programming knowledge -> https://www.youtube.com/watch?v=_Q8wtPCyMdo

Project Guide

Downloaded by Aman Sharma ([email protected])


lOMoARcPSD|33212006

And
Project Team Member

1. Project Guide
Prof. Komal Jaisinghani
Assistant Professor

2. Project Team Members

Sr. Name Contact Email Roll No


No.
1 Aniket Agham 7083324685 [email protected]
2 Dhawal Madankar 9673534092 [email protected]
3 Dipali Kambale 9022032822 [email protected] 05
4 Manthan Jadhav 9765251265 [email protected]

Downloaded by Aman Sharma ([email protected])

You might also like