0% found this document useful (0 votes)
5 views7 pages

Speech Recognition

The document discusses speech recognition, which involves mapping acoustic signals of spoken language to corresponding word sequences. It defines the input as a sequence of acoustic vectors and the output as a target sequence of words or characters. The goal is to develop an automatic speech recognition (ASR) function that accurately computes the most probable linguistic sequence from the acoustic input.

Uploaded by

devanand272003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views7 pages

Speech Recognition

The document discusses speech recognition, which involves mapping acoustic signals of spoken language to corresponding word sequences. It defines the input as a sequence of acoustic vectors and the output as a target sequence of words or characters. The goal is to develop an automatic speech recognition (ASR) function that accurately computes the most probable linguistic sequence from the acoustic input.

Uploaded by

devanand272003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Speech Recognition

Mr. Sivadasan E T
Associate Professor
Vidya Academy of Science and Technology, Thrissur
Speech recognition

The task of speech recognition is to map an


acoustic signal containing a spoken natural
language utterance into the corresponding
sequence of words intended by the speaker.
Speech recognition

Let X = (x(1), x(2) , . . . , x(T)) denote the sequence


of acoustic input vectors (traditionally produced
by splitting the audio into 20ms frames).

Let y = (y1 , y2 , . . . , yN ) denote the target output


sequence (usually a sequence of words or
characters).
Speech recognition
The automatic speech recognition (ASR) task consists of
creating a function f∗ASR that computes the most probable
linguistic sequence y given the acoustic sequence X:
Goal of ASR Models
Improving ASR Accuracy:
Thank You!

You might also like