Sp'module 4.pdf'
Sp'module 4.pdf'
Complex cepstrum involves the use of the complex logarithm and that the
cepstrum, as it has traditionally been defined, involves only the logarithm of
the magnitude of the Fourier transform; that is, the short-time cepstrum,c[n],
is given by
Computation Based on the z-Transform
HOMOMORPHIC FILTERING OF NATURAL SPEECH
• We are now in a position to apply the concepts of the
cepstrum and homomorphic filtering to a natural
speech signal.
• Recall that the model for speech production, as shown
in Figure 8.12, consists of a slowly time-varying linear
system excited by either a quasi-periodic impulse train
or by random noise.
• Thus, it is appropriate to think of a short segment of
voiced speech as having been taken from the steady-
state output of a linear time-invariant system excited
by a periodic impulse train.
• Similarly, a short segment of unvoiced speech can be
thought of as resulting from the excitation of a linear
time-invariant system by random noise.
• The purpose of this section is to demonstrate that
similar behavior results if short-time homomorphic
analysis methods are employed with natural speech
inputs.
A Model for Short-Time Cepstral Analysis of
Speech
• over the length(L) of the window, the speech signal s[n]
satisfies the convolution equation
For unvoiced speech, no such periodicity occurs in the logarithm of the DTFT of the
windowed unvoiced signal, and therefore no cepstral peaks occur.
Voiced Speech Analysis Using the DFT
• Figure 8.31, which shows a segment of speech selected by the window,
w[n], with the complex cepstrum computed of the input is selected by
what might be termed a “cepstrum window,” denoted l[n]. This type of
filtering is appropriately called “frequency-invariant linear filtering”
since multiplying the complex cepstrum l[n]by corresponds to
convolving its DTFT, L(e jω), X(eˆ jω), as in
with the complex logarithm,
Unvoiced Speech Analysis Using the DFT