Sp'module 4.pdf'

Uploaded by

Manoj Naik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views70 pages

Sp'module 4.pdf'

Uploaded by

Manoj Naik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 70

Module 4: The Cepstrum and

Homomorphic Speech Processing

• For discrete-time signals, a deﬁnition that captures the essential
features of the original deﬁnition is that thecepstrum of a signal is
the inverse discrete-time Fourier transform (IDTFT) of the logarithm
of the magnitude of the discrete-time Fourier transform (DTFT) of
the signal.
That is,
HOMOMORPHIC SYSTEMS FOR CONVOLUTION

• Oppenheim developed a new theory of systems that

was based on the mathematical theory of linear vector
spaces.
• The essence of this theory was that certain operations
of signal combination (convolution and multiplication
in particular) satisfy the same postulates as does
addition in the theory of linear vector spaces.
• From this observation, Oppenheim showed that
classes of non-linear systems could be defined on the
basis of a generalized principle of superposition. He
termed such systems homomorphic systems.
• Of particular importance for our present discussion is
the class of homomorphic systems for which the input
and output are combined by convolution.
• Ahomomorphic filter is simply a homomorphic system having the
property that one component (the desired component) passes
through the system essentially unaltered, while the other
component (the undesired component) is removed.
• In Eq. (8.5), for example, ifx2[n] were the undesirable component,
we would require that the output corresponding tox2[n] be a unit
sample, y2[n] = δ[n], while the output corresponding tox1[n] would
closely approximate x1[n] so that the output of the homomorphic
filter would bey[n] = x1[n] ∗ δ[n] = x1[n].
• This is entirely analogous to when a conventional linear system is
used to separate (filter) a desired signal from an additive
combination of the desired signal and noise. In this case the
desired result is that the output due to the noise is zero.
• Thus, the sequenceδ[n] plays the same role for convolution as is
played by the zero signal for additive combinations. Homomorphic
filters are of interest to us because our goal in speech processing
is to separate the convolved excitation and vocal tract
components of the speech model.
• An important aspect of the theory of homomorphic systems is that
any homomorphic system can be represented as a cascade of
three homomorphic systems, as depicted in Figure 8.3 for the case
of homomorphic systems for convolution.
The first system takes inputs combined by convolution and transforms them
into an additive combination of corresponding outputs.
The second system is a conventional linear system obeying the principle of
superposition as given in Eq. (8.3a). The third system is the inverse of the first
system; i.e., it transforms signals combined by addition back into signals
combined by convolution.
The importance of the existence of such a canonic form for homomorphic
systems is that the design of such systems reduces to the problem of the
design of the central linear system in Figure 8.3.
The systemD∗{·} is called the characteristic system for convolution and it is
fixed in the canonic form of Figure 8.3. Likewise, its inverse, called theinverse
characteristic system for convolution, and denoted D−1 ∗ {·}, is also a fixed
system.
• The characteristic system for convolution also obeys a
generalized principle of superposition where the input operation
is convolution and the output operation is ordinary addition.
The properties of the characteristic system are defined as
Representation by DTFTs
an appropriate definition of the complex logarithm is
Figure 8.7 illustrates the problem
that arises when one tries to
properly define the phase angle of
the DTFT. The principal value phase
has discontinuities of sizeπ2
because an angle in the complex
plane is always ambiguous to within
an integer multipleπ.ofThis
2
poses no problem for the complex
exponential
Minimum- and Maximum-Phase Signals
Homomorphic Analysis of the Speech Model
• Since the excitation and impulse response of a linear
time-invariant system are combined by convolution, the
problem of speech analysis can also be viewed as a
problem in separating the components of a convolution,
and therefore, homomorphic systems and the cepstrum
are useful tools for speech analysis.
• In the model of Figure 8.12, the pressure signal at the
lips,s[n], for a voiced section of speech is represented
as the convolution
s[n] = p[n] ∗ hV[n], .............(8.38a)
• where p[n] is the quasi-periodic voiced excitation signal,
• hV[n] represents the combined effect of the vocal tract
impulse responsev[n], the glottal pulse g[n], the radiation
load response at the lips, r[n], and the voiced gain, AV.
• The effective impulse response,hV[n], is itself the convolution
of g[n], v[n], and r[n], including scaling by the voiced section
gain control, AV; i.e.,
hV[n] = AV · g[n] ∗ v[n] ∗ r[n]........ (8.38b)
Homomorphic Analysis of the Model for Voiced Speech
Homomorphic Analysis of the Model for Unvoiced Speech
COMPUTING THE SHORT-TIME CEPSTRUM AND COMPLEX CEPSTRUM OF
SPEECH
The inverse characteristic system for convolution is needed for homomorphic
filtering
of speech. Following our approach above, we obtain this system from Figure
8.6 by simply replacing the DTFT operators by their corresponding DFT
computations.

Complex cepstrum involves the use of the complex logarithm and that the
cepstrum, as it has traditionally been defined, involves only the logarithm of
the magnitude of the Fourier transform; that is, the short-time cepstrum,c[n],
is given by
Computation Based on the z-Transform
HOMOMORPHIC FILTERING OF NATURAL SPEECH
• We are now in a position to apply the concepts of the
cepstrum and homomorphic filtering to a natural
speech signal.
• Recall that the model for speech production, as shown
in Figure 8.12, consists of a slowly time-varying linear
system excited by either a quasi-periodic impulse train
or by random noise.
• Thus, it is appropriate to think of a short segment of
voiced speech as having been taken from the steady-
state output of a linear time-invariant system excited
by a periodic impulse train.
• Similarly, a short segment of unvoiced speech can be
thought of as resulting from the excitation of a linear
time-invariant system by random noise.
• The purpose of this section is to demonstrate that
similar behavior results if short-time homomorphic
analysis methods are employed with natural speech
inputs.
A Model for Short-Time Cepstral Analysis of
Speech
• over the length(L) of the window, the speech signal s[n]
satisfies the convolution equation
For unvoiced speech, no such periodicity occurs in the logarithm of the DTFT of the
windowed unvoiced signal, and therefore no cepstral peaks occur.
Voiced Speech Analysis Using the DFT
• Figure 8.31, which shows a segment of speech selected by the window,
w[n], with the complex cepstrum computed of the input is selected by
what might be termed a “cepstrum window,” denoted l[n]. This type of
filtering is appropriately called “frequency-invariant linear filtering”
since multiplying the complex cepstrum l[n]by corresponds to
convolving its DTFT, L(e jω), X(eˆ jω), as in
with the complex logarithm,
Unvoiced Speech Analysis Using the DFT

• To complete the illustration of homomorphic analysis of

natural speech, consider the example of unvoiced
speech given in Figure 8.35. Figure 8.35a shows a
waveform segment of the fricative /SH/ multiplied by a
401-point Hamming window. The rapidly varying curve
plotted with the thin line in Figure 8.35b is the
corresponding log magnitude function X(e log |jω)|.
Figure 8.35c shows the corresponding cepstrum c[n].
CEPSTRUM ANALYSIS OF ALL-POLE
MODELS
CEPSTRUM DISTANCE MEASURES
Mel-Frequency Cepstrum
Coeﬃcients

(Thomas F. Quatieri) Discrete Time Speech Signal P (BookFi - Org) 2 PDF
100% (3)
(Thomas F. Quatieri) Discrete Time Speech Signal P (BookFi - Org) 2 PDF
800 pages
Rajesh Thesis
No ratings yet
Rajesh Thesis
86 pages
Homomorphic Filtering and Speech Processing Using Cepstrum Analysis
100% (2)
Homomorphic Filtering and Speech Processing Using Cepstrum Analysis
22 pages
Oppenheim1968 - Homomorphic Analysis of Speech
No ratings yet
Oppenheim1968 - Homomorphic Analysis of Speech
6 pages
Lecture 6 Convolution Using DFT
No ratings yet
Lecture 6 Convolution Using DFT
38 pages
Final Report On Speech Recognition Project
No ratings yet
Final Report On Speech Recognition Project
32 pages
Cepstrum Analysis
No ratings yet
Cepstrum Analysis
23 pages
Final Lesson No. 6 (Graphs of Sine and Cosine Functions)
No ratings yet
Final Lesson No. 6 (Graphs of Sine and Cosine Functions)
39 pages
DSP PDF
No ratings yet
DSP PDF
0 pages
Gessl Cepstrum
No ratings yet
Gessl Cepstrum
29 pages
Real and Complex Cepstrum
No ratings yet
Real and Complex Cepstrum
26 pages
Non-Linear Convolution: A New Approach For The Auralization of Distorting Systems
No ratings yet
Non-Linear Convolution: A New Approach For The Auralization of Distorting Systems
20 pages
l8 Ceps
No ratings yet
l8 Ceps
1 page
Group Delay Functions and Its Applications in Speech
No ratings yet
Group Delay Functions and Its Applications in Speech
38 pages
Cepstrum
No ratings yet
Cepstrum
5 pages
Am-Demodulation of Speech Spectra and Its Application To Noise Robust Speech Recognition
No ratings yet
Am-Demodulation of Speech Spectra and Its Application To Noise Robust Speech Recognition
4 pages
Ece6255 L16
No ratings yet
Ece6255 L16
71 pages
Cepstral Analysis: Appendix 3
No ratings yet
Cepstral Analysis: Appendix 3
3 pages
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
No ratings yet
E9 261 - Speech Information Processing: Homework # 3 Due Date: May 2, 2021
4 pages
DSP Manual Exp1 9
No ratings yet
DSP Manual Exp1 9
34 pages
Speech Analisys
No ratings yet
Speech Analisys
56 pages
ps6 Soln Fall09
No ratings yet
ps6 Soln Fall09
12 pages
SP Question Bank
No ratings yet
SP Question Bank
2 pages
Cepstrum vs. LPC: A Comparative Study For Speech Formant Frequencies Estimation
No ratings yet
Cepstrum vs. LPC: A Comparative Study For Speech Formant Frequencies Estimation
16 pages
Homomorphic Speech Processing
No ratings yet
Homomorphic Speech Processing
32 pages
SP Question Bank
No ratings yet
SP Question Bank
2 pages
Padovani
No ratings yet
Padovani
4 pages
Group Delay
No ratings yet
Group Delay
38 pages
Linear Systems Theory
No ratings yet
Linear Systems Theory
11 pages
DSP 2 Fourier Analysis
No ratings yet
DSP 2 Fourier Analysis
54 pages
Scale Transform in Speech Analysis
No ratings yet
Scale Transform in Speech Analysis
6 pages
Algorithm For The Identification and Verification Phase
No ratings yet
Algorithm For The Identification and Verification Phase
9 pages
Module 3
No ratings yet
Module 3
12 pages
7.0 Speech Signals and Front-End Processing: References: 1. 3.3, 3.4 of Becchetti
No ratings yet
7.0 Speech Signals and Front-End Processing: References: 1. 3.3, 3.4 of Becchetti
50 pages
Speech Feature Extraction
No ratings yet
Speech Feature Extraction
9 pages
Cepstrum and Homomorphic Filtering
No ratings yet
Cepstrum and Homomorphic Filtering
24 pages
Module 3 Complete Notes
No ratings yet
Module 3 Complete Notes
123 pages
l4n JN Uhbh Hiunun Hbinun
No ratings yet
l4n JN Uhbh Hiunun Hbinun
36 pages
Advanced Training Course On FPGA Design and VHDL For Hardware Simulation and Synthesis
No ratings yet
Advanced Training Course On FPGA Design and VHDL For Hardware Simulation and Synthesis
20 pages
Lecture Notes 10 - Monday 7/10: Summary of Last Lecture
No ratings yet
Lecture Notes 10 - Monday 7/10: Summary of Last Lecture
5 pages
Speech Signal Processing: A Handbook of Phonetic Science
No ratings yet
Speech Signal Processing: A Handbook of Phonetic Science
24 pages
Cepstral Analysis: Appendix 3
No ratings yet
Cepstral Analysis: Appendix 3
3 pages
1.7A Rational Functions & End Behavior
No ratings yet
1.7A Rational Functions & End Behavior
6 pages
Igital Ignal Rocessing: Balochistan University of Information Technology, Engineering & Management Sciences-Quetta
No ratings yet
Igital Ignal Rocessing: Balochistan University of Information Technology, Engineering & Management Sciences-Quetta
10 pages
Important Questions
No ratings yet
Important Questions
4 pages
Time Frequency Representation of Digital Signals and Systems Based On Short Time
No ratings yet
Time Frequency Representation of Digital Signals and Systems Based On Short Time
15 pages
8 Cepstral Analysis
No ratings yet
8 Cepstral Analysis
24 pages
The Cepstrum Method
No ratings yet
The Cepstrum Method
31 pages
DSP Lecture 2
No ratings yet
DSP Lecture 2
77 pages
R Assumingp Q Q Ci,: Chapter 6 - Speech Analysis
No ratings yet
R Assumingp Q Q Ci,: Chapter 6 - Speech Analysis
6 pages
SP Module 5 PPT L4C
No ratings yet
SP Module 5 PPT L4C
2 pages
Untitled Document
No ratings yet
Untitled Document
7 pages
ps5 Soln Fall09 BK
No ratings yet
ps5 Soln Fall09 BK
21 pages
Discrete Time Processing of Speech Signa
No ratings yet
Discrete Time Processing of Speech Signa
12 pages
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
No ratings yet
Speech Coding and Phoneme Classification Using Matlab and Neuralworks
4 pages
Abstract:: Text-Independent and Dependent Methods. in A Text
No ratings yet
Abstract:: Text-Independent and Dependent Methods. in A Text
11 pages
Pub Remedial Mathematics
No ratings yet
Pub Remedial Mathematics
603 pages
11 Maths Solution Converted 746
No ratings yet
11 Maths Solution Converted 746
111 pages
MATH 115: Lecture II Notes
No ratings yet
MATH 115: Lecture II Notes
3 pages
Digital Signal Processing "Speech Recognition": Paper Presentation On
No ratings yet
Digital Signal Processing "Speech Recognition": Paper Presentation On
12 pages
Analytic Functions: Book: A First Course in Complex Analysis With Applications by Dennis G. Zill and
100% (1)
Analytic Functions: Book: A First Course in Complex Analysis With Applications by Dennis G. Zill and
13 pages
21EC63
No ratings yet
21EC63
3 pages
Second Semester B.Tech University Examination, June-2019 Model Question Paper Mathematics-2
No ratings yet
Second Semester B.Tech University Examination, June-2019 Model Question Paper Mathematics-2
2 pages
4.06 Gradient Divergence Curl and Laplacian
No ratings yet
4.06 Gradient Divergence Curl and Laplacian
8 pages
UEM Sol To Exerc Chap 097
No ratings yet
UEM Sol To Exerc Chap 097
11 pages
Trigonometric Ratio & Identities - DPP 01 (Of Lec 04) - Arjuna JEE 2.0 2024
No ratings yet
Trigonometric Ratio & Identities - DPP 01 (Of Lec 04) - Arjuna JEE 2.0 2024
2 pages
Worksheet 4
No ratings yet
Worksheet 4
2 pages
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
No ratings yet
Shanon Encoding and Fano Encoding, Theorem, Problems On Entropy
25 pages
Revision Test 1
No ratings yet
Revision Test 1
2 pages
MA1201 End+Mid Semester Question Paper
No ratings yet
MA1201 End+Mid Semester Question Paper
5 pages
Convergence of Fourier Series
No ratings yet
Convergence of Fourier Series
5 pages
Sample Final Exam SSCE1693 202320242
No ratings yet
Sample Final Exam SSCE1693 202320242
9 pages
Chapter 4.3 Part 1 Circular Functions PDF
No ratings yet
Chapter 4.3 Part 1 Circular Functions PDF
4 pages
MAT9004 Lecture Outline
No ratings yet
MAT9004 Lecture Outline
4 pages
Lecture Ba 01
No ratings yet
Lecture Ba 01
22 pages
Module 2
No ratings yet
Module 2
45 pages
01 - Mean Value Theorem - Typical
No ratings yet
01 - Mean Value Theorem - Typical
17 pages
LINEARIZATION
No ratings yet
LINEARIZATION
10 pages
Vector Differentiation
No ratings yet
Vector Differentiation
18 pages
Gba-Eixg-Ybq - 27 Mar 2024
No ratings yet
Gba-Eixg-Ybq - 27 Mar 2024
4 pages
On Quantum Channels.: Ab A A B
No ratings yet
On Quantum Channels.: Ab A A B
14 pages
Student Reg Form-2024
No ratings yet
Student Reg Form-2024
1 page
Elimination Using Matrices.
No ratings yet
Elimination Using Matrices.
8 pages
Function MCQ-1
No ratings yet
Function MCQ-1
3 pages
Functions L1 06
No ratings yet
Functions L1 06
7 pages
Differentiation Formulas
No ratings yet
Differentiation Formulas
5 pages
Lecture 9 1
No ratings yet
Lecture 9 1
5 pages
Berry (2001) Why Are Special Functions Special
No ratings yet
Berry (2001) Why Are Special Functions Special
3 pages
113-1 Math4018 HW5
No ratings yet
113-1 Math4018 HW5
1 page
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
From Everand
Introduction to Partial Differential Equations: From Fourier Series to Boundary-Value Problems
Arne Broman
2.5/5 (2)
Nonlinear Transformations of Random Processes
From Everand
Nonlinear Transformations of Random Processes
Ralph Deutsch
No ratings yet
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
Some Case Studies on Signal, Audio and Image Processing Using Matlab
From Everand
Some Case Studies on Signal, Audio and Image Processing Using Matlab
Dr. Hedaya Mahmood Alasooly
No ratings yet
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
From Everand
Filter Bank: Insights into Computer Vision's Filter Bank Techniques
Fouad Sabry
No ratings yet
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
From Everand
Adaptive Filter: Enhancing Computer Vision Through Adaptive Filtering
Fouad Sabry
No ratings yet

Sp'module 4.pdf'

Uploaded by

Sp'module 4.pdf'

Uploaded by

Module 4: The Cepstrum and

Homomorphic Speech Processing

• Oppenheim developed a new theory of systems that

• To complete the illustration of homomorphic analysis of

You might also like