The document discusses speech recognition, which involves mapping acoustic signals of spoken language to corresponding word sequences. It defines the input as a sequence of acoustic vectors and the output as a target sequence of words or characters. The goal is to develop an automatic speech recognition (ASR) function that accurately computes the most probable linguistic sequence from the acoustic input.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
5 views7 pages
Speech Recognition
The document discusses speech recognition, which involves mapping acoustic signals of spoken language to corresponding word sequences. It defines the input as a sequence of acoustic vectors and the output as a target sequence of words or characters. The goal is to develop an automatic speech recognition (ASR) function that accurately computes the most probable linguistic sequence from the acoustic input.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7
Speech Recognition
Mr. Sivadasan E T Associate Professor Vidya Academy of Science and Technology, Thrissur Speech recognition
The task of speech recognition is to map an
acoustic signal containing a spoken natural language utterance into the corresponding sequence of words intended by the speaker. Speech recognition
Let X = (x(1), x(2) , . . . , x(T)) denote the sequence
of acoustic input vectors (traditionally produced by splitting the audio into 20ms frames).
Let y = (y1 , y2 , . . . , yN ) denote the target output
sequence (usually a sequence of words or characters). Speech recognition The automatic speech recognition (ASR) task consists of creating a function f∗ASR that computes the most probable linguistic sequence y given the acoustic sequence X: Goal of ASR Models Improving ASR Accuracy: Thank You!