0% found this document useful (0 votes)
51 views7 pages

Vocoders: Nadeem Pasha

This document discusses vocoders, which are used to model speech so the most important features can be captured with few bits. It explains that vocoders either model the speech waveform over time or break it down into frequency components. The document also discusses that phase is not important for speech perception as long as the energy levels are correct. It provides details on how channel vocoders and formant vocoders work by separating speech into frequency bands and analyzing excitation and pitch.

Uploaded by

nadeemp78
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views7 pages

Vocoders: Nadeem Pasha

This document discusses vocoders, which are used to model speech so the most important features can be captured with few bits. It explains that vocoders either model the speech waveform over time or break it down into frequency components. The document also discusses that phase is not important for speech perception as long as the energy levels are correct. It provides details on how channel vocoders and formant vocoders work by separating speech into frequency bands and analyzing excitation and pitch.

Uploaded by

nadeemp78
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Vocoders

Nadeem pasha
1st Sem, Mtech(SP)
Dept of ECE
Siddaganga Institute of Technology

Vocoders

Vocoders voice coders, which cannot be usefully applied


when other analog signals, such as modem signals, are in
use.
concerned with modeling speech so that the salient
features are captured in as few bits as possible.
use either a model of the speech waveform in time (LPC
(Linear Predictive Coding) vocoding), or...
break down the signal into frequency components and
model these (channel vocoders and formant vocoders).

Vocoder simulation of the voice is not very good yet.

Nadeem pasha

Vocoders

Phase Insensitivity
A complete reconstituting of speech waveform is really
un- necessary, perceptually: all that is needed is for the
amount of energy at any time to be about right, and the
signal will sound about right.
Phase is a shift in the time argument inside a function of
time.
Suppose we strike a piano key, and generate a roughly
sinusoidal sound cos(t), with = 2f .
Now if we wait sufficient time to generate a phase shift
/2 and then strike another key, with sound
cos(2t + /2), we generate a waveform like the solid
line in Figure.
This waveform is the sum cos(2t + /2).

Nadeem pasha

Vocoders

Figure: Solid line: Superposition of two cosines, with a phase shift.


Dashed line: No phase shift. The wave is very different, yet the
sound is the same, perceptually.

If we did not wait before striking the second note, then


our waveform would be cos(t) + cos(2t). But
perceptually, the two notes would sound the same sound,
even though in actuality they would be shifted in phase.
Nadeem pasha

Vocoders

Channel Vocoder
Vocoders can operate at low bit-rates, 1-2 kbps. To do
so, a channel vocoder first applies a filter bank to
separate out the different frequency components.

Figure: Channel Vocoder


Nadeem pasha

Vocoders

Due to Phase Insensitivity (i.e. only the energy is


important):
The waveform is rectified to its absolute value.
The filter bank derives relative power levels for each
frequency range.
A subband coder would not rectify the signal, and would
use wider frequency bands.

A channel vocoder also analyzes the signal to determine


the general pitch of the speech (low bass, or high
tenor), and also the excitation of the speech.
A channel vocoder applies a vocal tract transfer model to
generate a vector of excitation parameters that describe a
model of the sound, and also guesses whether the sound
is voiced or unvoiced.

Nadeem pasha

Vocoders

Format Vocoder
Formants: the salient frequency components that are
present in a sample of speech, as shown in Figure.

Figure: The solid line shows frequencies present in the first 40


msec of the speech sample of digital speech signal. The dashed
line shows that while similar frequencies are still present one
second later, these frequencies have shifted.
Nadeem pasha

Vocoders

You might also like