0% found this document useful (0 votes)
73 views5 pages

Speech Recognition: A Seminar Report On

This document is a seminar report on speech recognition submitted in partial fulfillment of a bachelor's degree in computer science and engineering. It includes an introduction to speech recognition, a literature review on speech recognition processes and system types, an analysis of speech recognition algorithms like hidden Markov models and neural networks. It discusses popular speech recognition software, the advantages and disadvantages of speech recognition technology, and its applications. The conclusion reflects on the progress made in speech recognition and the future scope for improvement.

Uploaded by

Nilesh Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
73 views5 pages

Speech Recognition: A Seminar Report On

This document is a seminar report on speech recognition submitted in partial fulfillment of a bachelor's degree in computer science and engineering. It includes an introduction to speech recognition, a literature review on speech recognition processes and system types, an analysis of speech recognition algorithms like hidden Markov models and neural networks. It discusses popular speech recognition software, the advantages and disadvantages of speech recognition technology, and its applications. The conclusion reflects on the progress made in speech recognition and the future scope for improvement.

Uploaded by

Nilesh Sharma
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 5

A

Seminar Report
on
Speech Recognition
Submitted in partial fulfillment for the award of degree of
BACHELOR OF TECHNOLOGY
In
Computer Science & Engineering

Coordinator: Submitted By:


Mr. Loveleen Kumar ESHA JAIN
18EGJCS035

Submitted to:
Dr. S. S. Shekhawat
Head of Deptt.

Department of Computer Science & Engineering


GLOBAL INSTITUTE OF TECHNOLOGY
JAIPUR (RAJASTHAN)-302022
SESSION: 2021-22
Acknowledgement
I
I take this opportunity to express my deep sense of gratitude to my coordinator Mr.Loveleen Kumar,
Assistant Professor Department of Computer Science and Engineering, Global Institute of Technology,
Jaipur, for his valuable guidance and cooperation throughout the Practical Training work. He provided
constant encouragement and unceasing enthusiasm at every stage of the Practical Training work.
We are grateful to our respected Dr. I. C. Sharma, Principal GIT for guiding us during Practical
Training period
We express our indebtedness to Dr. S. S. Shekhawat, Head of Department of Computer Science and
Engineering, Global Institute of Technology, Jaipur for providing me ample support during my
Practical Training period.
Without their support and timely guidance, the completion of our Practical Training would have
seemed a farfetched dream. In this respect we find ourselves lucky to have mentors of such a great
potential.

Place: GIT, Jaipur

Esha Jain
18EGJCS035
B.Tech. VII Semester, IV Year, CSE

II
Abstract

Language is man's most important means of communication and speech its primary
medium. Spoken interaction both between human interlocutors and between humans and
machines is inescapably embedded in the laws and conditions of Communication, which
comprise the encoding and decoding of meaning as well as the mere transmission of
messages over an acoustical channel. Here we deal with this interaction between the man
and machine through synthesis and recognition applications.
Speech recognition, involves capturing and digitizing the sound waves, converting them to
basic language units or phonemes, constructing words from phonemes, and contextually
analyzing the words to ensure correct spelling for words that sound alike. Speech
Recognition is the ability of a computer to recognize general, naturally flowing
utterances from a wide variety of users. It recognizes the caller's answers to move
along the flow of the call.
Emphasis is given on the modeling of speech units and grammar on the basis of Hidden
Markov Model& Neural Networks. Speech Recognition allows you to provide input to an
application with your voice. The applications and limitations on this subject enlighten the
impact of speech processing in our modern technical field.
While there is still much room for improvement, current speech recognition systems have remarkable
performance. We are only humans, but as we develop this technology and build remarkable changes
we attain certain achievements. Rather than asking what is still deficient, we ask instead what should
be done to make it efficient.
Table Of Content
 Chapter 1: Introduction III
o 1.1 Introduction…………………………………………………………..
……..1
o 1.2 Speech Recognition…………………..…………...…….………………….1
 Chapter 2: Literature Survey
o 2.1 Speech Recognition Process…………………………...………….………..9
o 2.2 Structure of Standard Speech Recognition System….…...……….……...10
o 2.3 Types of Speech Recognition System……………………...…….………13
 Chapter 3: System Analysis
o 3.1 Speech Recognition Algorithms………………………...…………..……19
o 3.1.1 Dynamic Time Warping………………….…….……...……….….……19
o 3.1.2 Hidden Markov Model…………………………………..………….…..19
o 3.1.3 Neural Network………………………………………..………………..20
 Chapter 4: Discussion
o 4.1 Speech Recognition Softwares…………………………...……………….21
o 4.2 Advantages &Disadvantages……………………………………………...24
o 4.2.1 Advantages.……………………………………………..……………....24
o 4.2.2 Disadvantages…………………………………………...………………26
o 4.3 Applications………………………………………………...……………..29

 Chapter 5: Conclusion & Future Scope


o 5.1 Conclusion………………………………………………..……………....31
o 5.2 Future Scope……………………………………………......……….…….31
LIST OF FIGURES
IV

1.1 Speech Recognition ……………………………………………………………….….2


2.1 Typical Speech Recognition System …………………………………………….......4
2.2 Signal analysis converts raw speech to speech frames. …………………………….11
2.3 Acoustic models: template and state representations …………………………….…12
2.4 The alignment path with the best total score identifies the word sequence
and segmentation ……………………………………………………………….…...14
3.1 Simple HMM with two states & two output symbols..……………………….……..19
3.2 Unit activations for neural network ……………………………………………..…..13
4.1 Julius SR Engine Interface ………………………………………………………..…21
4.2 Google Now Interface ……………………………………………………………….22
4.3 Dragon Naturally Speaking Interface ………………………………………..….…..23
4.4 Windows Speech Recognition Interface……………………………………….……. 24

You might also like