0% found this document useful (0 votes)

81 views23 pages

Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem

This document discusses speech recognition technology. It provides an overview of what speech recognition is, how it works, challenges, applications, and key players in the market. Speech recognition involves converting speech to text using algorithms to analyze acoustic signals. It allows computers to understand and respond to spoken commands and questions. However, there are still weaknesses like environmental noise, determining word boundaries, and recognizing homonyms. The document also explores how speech recognition may enable applications like universal translation and hands-free computing in the future.

Uploaded by

Hitesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

81 views23 pages

Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem

Uploaded by

Hitesh Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 23

TEAM: Mr. RAHUL KR.

SINGH
Mr.HITESH KUMAR
IT VII SEM
The Computer of the Future will TALK,
LISTEN, UNDERSTAND and RESPOND

One of them is the Apple Macintosh of today.

Apple’s Speech Recognition and Speech Synthesis
Technologies now give speech-savvy applications the
power to carry out your voice commands and even
speak back to you in plain English.
Speech recognition is the process of converting a
speech signal to a sequence of words, by means of an
algorithm implemented as a computer program.

Voice Verification or speaker recognition is a related

process that attempts to identify the person speaking .
 Speech recognition is the process of converting an
acoustic signal, captured by a microphone or a
telephone, to a set of words.

The recognized words can be the final results, as for

applications such as commands & control, data entry,
and document preparation.

They can also serve as the input to further linguistic

processing in order to achieve speech understanding.
This process is even more complicated for phrases and sentences -- the
system has to figure out where each word stops and starts. The classic
example is the phrase "recognize speech," which sounds a lot like
"wreck a nice beach" when you say it very quickly. The program has to
analyze the phonemes using the phrase that came before it in
order to get it right. Here's a breakdown of the two phrases:

r eh k ao g n ay z s p iy ch

"recognize speech“

r eh k ay n ay s b iy ch

"wreck a nice beach"
Who Can Benefit from Speech
Recognition?
Persons with mobility impairments or injuries
that prevent keyboard access
Persons who have or who are seeking to prevent
repetitive stress injuries
Persons with writing difficulties
Any person who want hands-free access to the
computer
Any persons who wants to increase their typing
speed
(reportedly up to 160 wpm)
WHAT IS REQUIRED TO USE SPEECH
RECOGNITION?

A Powerful Computer
Consistent Speech (not necessarily intelligible)
Fluid speech (i.e., not pausing between words)
desirable for use of continuous speech products
Patience
Basic knowledge of computers
Fairly high cognitive ability
Command recognition - Voice user interface
with the computer
Dictation
Interactive Voice Response
Automotive speech recognition
Medical Transcription
Pronunciation Teaching in computer-aided language
learning applications
Automatic Translation
Hands-free computing
 Discrete
Slower dictation process - better for persons
with difficulty in language processing or in fluid
speech
Word-by-word style, rather than phrases,
reflects the way beginning writers form
sentences
Continuous
Processes speech by phrase
Takes context into account
Is less accurate if phrases are interrupted
Advantages: Speed and accuracy (for most users)
ch
Lead to spee
pe
controlled ty
Identifica ation
tion writer, transl
& Recogn
i t i on system,
of Spea o rk plac e f or P-C.
ker w
Speech Analysis

WHO? What? How? Lie-D

etec
to r

Verification Identification Recognition Understanding

Reference storage:
Properties of
Learned Material

Problem
Speech Analysis:
Recognition:
SPEECH Parameters;
Comparison with
Response,
Reference,
Property Extraction
Decision

Special Chip Main Program

Recognized Speech
 A speaker-independent system can recognize with the
same reliability essentially fewer words than a speaker-
dependent system because the latter is TRAINED IN
ADVANCE. Training in advance means that there exists a
training phase for the speech recognition system, which
takes a half an hour.

Speaker-dependent recognition system can

recognize around 25,000 words, Speaker-
independent recognition system can recognize
around 500 words but with a worse recognition
rate .
The major players in the speech recognition market are
Dragon Systems, Lernout & Hauspie (L&H), and IBM.
 Dragon’s original product, Dragon Dictate, is currently
the only product that uses the discrete speech model

 The current L&H product line, called VoiceXpress,

includes a Standard, Advanced, and Professional edition.

 IBM has been a major player in speech recognition for

many years. Its discrete speech product, IBM Voice Type,
IBM has discontinued this product and is now focusing all
its efforts on developing continuous speech products. Its
current product line, IBM Via Voice Millenium, includes a
Standard, Web and Professional edition. The web edition
features natural language commands for Internet
Explorer, Netscape Communicator and America Online.
Room acoustic with existent environmental noise.
Overlapping of the primary sound wave.
Word boundary must be determined.
During comparison time normalization is necessary.
The same word can be spoken quickly or slowly
Speech Recognition: Weaknesses and Flaws

Low signal-to-noise ratio

Overlapping speech
Intensive use of computer power
Homonyms
Primary goal of the Speech Analysis is to
correctly determine individual words with.
probability ≤ 1
Environmental noise, room acoustics and a speaker’s
physical and psychological conditions play an
important role in determination.
Ex. let’s assume extremely bad individual words
recognition with a probability of 0.95. This means that
5% of the words are incorrectly recognized. If we have a
sentence with 3 words, the probability of
recognizing the sentence correctly is 0.95 × 0.95
× 0.95 = 0.857.
A universal translator
At some point in the future, speech recognition may
become speech understanding.
The Application of Speech Recognition Techniques to
Radar Target Doppler Recognition
universal translator
Multilingual Speech Processing, Edited by Tanja
Schultz and Katrin Kirchhoff, April 2006
Multimedia : COMPUTING ,COMMNICATIONS &
APPLICATIONS (By. RALF STEINMETZ & KLARA
NABRSTED)
www.software.ibm.com/speech/
www.dragonsys.com
http://cslu.cse.ogi.edu/HLTsurvey/ch1node5.html
http://www.apple.com/macosx/developertools
Enjoy Speech Recognition
Technology

Speech Recognition Seminar Report
87% (97)
Speech Recognition Seminar Report
32 pages
Reading and Writing Teaching Guide
100% (3)
Reading and Writing Teaching Guide
26 pages
English Studies Scheme of Work For Junior Secondary School JSS 1
100% (2)
English Studies Scheme of Work For Junior Secondary School JSS 1
23 pages
Mr. Sibananda Panda Mca 4 Semister
No ratings yet
Mr. Sibananda Panda Mca 4 Semister
18 pages
Speech Recognition
0% (1)
Speech Recognition
27 pages
Speech Recognition PPT F
100% (2)
Speech Recognition PPT F
16 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
14 pages
Speech Recognition: Prof. Ram Meghe Institute of Technology and Research, Badnera-Amravati
No ratings yet
Speech Recognition: Prof. Ram Meghe Institute of Technology and Research, Badnera-Amravati
13 pages
Speech Recognition: BY Charu Joshi
100% (2)
Speech Recognition: BY Charu Joshi
26 pages
Speech Recognition: BY Charu Joshi
No ratings yet
Speech Recognition: BY Charu Joshi
26 pages
Speech Recognition Report
100% (1)
Speech Recognition Report
20 pages
SPEECH
100% (1)
SPEECH
17 pages
Features: Digital Assistant
No ratings yet
Features: Digital Assistant
7 pages
An Introduction To Speech and Speaker Recognition
No ratings yet
An Introduction To Speech and Speaker Recognition
8 pages
Minor Project123
No ratings yet
Minor Project123
40 pages
Speech Technology
No ratings yet
Speech Technology
5 pages
Speech Recognition System Using Ic Hm2007
100% (1)
Speech Recognition System Using Ic Hm2007
21 pages
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
No ratings yet
A Seminar Report On: R. H. Sapat College of Engineering, Management Studies and Research
32 pages
Features: Digital Assistant
No ratings yet
Features: Digital Assistant
8 pages
Ai in Speech Recognition
No ratings yet
Ai in Speech Recognition
24 pages
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
No ratings yet
Artificial Intelligence: Presented By: A.Sowmya CH - Sushma
10 pages
Speech Recognition Seminar
No ratings yet
Speech Recognition Seminar
19 pages
SPEECH
No ratings yet
SPEECH
8 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
22 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
24 pages
Voice Recognition System: Third Year Electronics, Third Year Electronics
No ratings yet
Voice Recognition System: Third Year Electronics, Third Year Electronics
14 pages
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
No ratings yet
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
8 pages
Ai Project Sona-1 (1) - 250630 - 194118
No ratings yet
Ai Project Sona-1 (1) - 250630 - 194118
10 pages
Speech Recognition Technology in A Ubiquitous Computing Environment
No ratings yet
Speech Recognition Technology in A Ubiquitous Computing Environment
24 pages
A Survey On Speech Recognition
No ratings yet
A Survey On Speech Recognition
2 pages
Voice Recognition System Report
No ratings yet
Voice Recognition System Report
17 pages
Speech Recognition Full Report
No ratings yet
Speech Recognition Full Report
11 pages
Speech Recognition Technology
No ratings yet
Speech Recognition Technology
23 pages
Speech Recognition: - Shetul Chothani
No ratings yet
Speech Recognition: - Shetul Chothani
19 pages
Speech Recognition Project
No ratings yet
Speech Recognition Project
33 pages
Speech Recognition Using Ic HM2007
100% (4)
Speech Recognition Using Ic HM2007
31 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
Speech Recognition
No ratings yet
Speech Recognition
17 pages
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
No ratings yet
(IJCST-V4I2P62) :Dr.V.Ajantha Devi, Ms.V.Suganya
6 pages
Key Application: Automatic Speech Recognition or ASR, As It's
No ratings yet
Key Application: Automatic Speech Recognition or ASR, As It's
8 pages
Speech Recognition Technology: Applications & Future: Pankaj Pathak
No ratings yet
Speech Recognition Technology: Applications & Future: Pankaj Pathak
3 pages
Tan Pan Hassan VoiceRecognition
No ratings yet
Tan Pan Hassan VoiceRecognition
21 pages
Final Report
No ratings yet
Final Report
35 pages
The PC Interfaced Voice Recognition System Is To Implement A Password For Authentication
No ratings yet
The PC Interfaced Voice Recognition System Is To Implement A Password For Authentication
7 pages
A Report On
No ratings yet
A Report On
35 pages
Text and Speech CCS369-UNIT 5
No ratings yet
Text and Speech CCS369-UNIT 5
9 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
9 pages
Speech Recognition1
No ratings yet
Speech Recognition1
24 pages
Speech Recognition: SK - Rahil 1602-11-735-046
No ratings yet
Speech Recognition: SK - Rahil 1602-11-735-046
1 page
NLP 1.3.1 - Speed Recogmnition
No ratings yet
NLP 1.3.1 - Speed Recogmnition
20 pages
Vivek Kumar - 1613112052
No ratings yet
Vivek Kumar - 1613112052
7 pages
Work 3
No ratings yet
Work 3
22 pages
AI Speech Recognition Document
No ratings yet
AI Speech Recognition Document
26 pages
Tejaswini Group Report
No ratings yet
Tejaswini Group Report
18 pages
Seminar Presentation: Topic: Speech Recognition
No ratings yet
Seminar Presentation: Topic: Speech Recognition
26 pages
Jasmeet Seminar Report
No ratings yet
Jasmeet Seminar Report
24 pages
Ai For Speech Recognition
No ratings yet
Ai For Speech Recognition
27 pages
Automatic Speech
No ratings yet
Automatic Speech
14 pages
Voice Technology Seminar
100% (1)
Voice Technology Seminar
35 pages
Personal Voice Assistant in Python
100% (1)
Personal Voice Assistant in Python
30 pages
Speaker Recognition: Fundamentals and Applications
From Everand
Speaker Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Coding for Beginners: A Step-by-Step Guide to Learn Python, Java, SQL, C, C++, C#, HTML, and CSS from Scratch
From Everand
Coding for Beginners: A Step-by-Step Guide to Learn Python, Java, SQL, C, C++, C#, HTML, and CSS from Scratch
Vere salazar
No ratings yet
Aspects of Connected Speech
100% (2)
Aspects of Connected Speech
7 pages
ORAL COM Digital District ST
No ratings yet
ORAL COM Digital District ST
11 pages
As En6 Q2 W2 D2
No ratings yet
As En6 Q2 W2 D2
9 pages
Guide To Using The Phonics Tool: Choose A Voice
No ratings yet
Guide To Using The Phonics Tool: Choose A Voice
1 page
Vowels & Consonants 3rd Lecture
No ratings yet
Vowels & Consonants 3rd Lecture
79 pages
The Distinctive Features: Consonants
No ratings yet
The Distinctive Features: Consonants
25 pages
Vol.20 - SP - Ed.No.1-2018.-2322 Dunja Pavličević-Franić
No ratings yet
Vol.20 - SP - Ed.No.1-2018.-2322 Dunja Pavličević-Franić
22 pages
1st Speaking Marking Criteria
No ratings yet
1st Speaking Marking Criteria
3 pages
Segmentals
100% (1)
Segmentals
43 pages
English 3 Lamp
No ratings yet
English 3 Lamp
145 pages
Language As Chunks, Not Words: Ramesh Krishnamurthy
No ratings yet
Language As Chunks, Not Words: Ramesh Krishnamurthy
7 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
9 pages
Public Speaking Session 8 - Strategies For Final Delivery
No ratings yet
Public Speaking Session 8 - Strategies For Final Delivery
39 pages
c318 PDF
No ratings yet
c318 PDF
2 pages
Persuasive Presentation Rubrics
No ratings yet
Persuasive Presentation Rubrics
6 pages
11 Slides SP Skills II 2
No ratings yet
11 Slides SP Skills II 2
67 pages
Difference Between Phonetics and Phonology
100% (2)
Difference Between Phonetics and Phonology
2 pages
Business Proposal Talk To Hand
No ratings yet
Business Proposal Talk To Hand
33 pages
The Sound of Intellect
No ratings yet
The Sound of Intellect
16 pages
Lectures 1 Rabiner Speech Processing
No ratings yet
Lectures 1 Rabiner Speech Processing
77 pages
Proposal
No ratings yet
Proposal
22 pages
НМК Practical Phonetics
No ratings yet
НМК Practical Phonetics
129 pages
Public Speaking & Creative Writing Curriculum: Storytelling Voice & Fluency Conversation Skills
No ratings yet
Public Speaking & Creative Writing Curriculum: Storytelling Voice & Fluency Conversation Skills
1 page
Finn2008AHandbookonStuttering BookReview
No ratings yet
Finn2008AHandbookonStuttering BookReview
5 pages
Sample Lesson Plan
No ratings yet
Sample Lesson Plan
6 pages
Articulatory Phonetics
No ratings yet
Articulatory Phonetics
3 pages
Department of Education: Division Virtual Scilympics
No ratings yet
Department of Education: Division Virtual Scilympics
40 pages
2.1 General Phonetics, Cardinal Vowels & Consonants
No ratings yet
2.1 General Phonetics, Cardinal Vowels & Consonants
8 pages

Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem

Uploaded by

Team: Mr. Rahul Kr. Singh MR - Hitesh Kumar It Vii Sem

Uploaded by

TEAM: Mr. RAHUL KR.

One of them is the Apple Macintosh of today.

Voice Verification or speaker recognition is a related

The recognized words can be the final results, as for

They can also serve as the input to further linguistic

r eh k ao g n ay z s p iy ch

r eh k ay n ay s b iy ch

WHO? What? How? Lie-D

Verification Identification Recognition Understanding

Special Chip Main Program

Speaker-dependent recognition system can

 The current L&H product line, called VoiceXpress,

 IBM has been a major player in speech recognition for

Low signal-to-noise ratio

You might also like