FPP CV
FPP CV
YEAR/SEMESTER/DEPT : III/VI/ECE
DEPARTMENT : ECE
PREPARED BY APPROVED BY
SIGNATURE
DATE
To impart education that caters to the growing challenges of the industry and social
needs of our nation.
To constantly upgrade the standards of teaching and learning in the field of
engineering and technology while promoting a healthy research atmosphere.
To foster a healthy symbiosis with the industry through meaningful and dynamic
interactions.
DEPARTMENT VISION
DEPARTMENT MISSION
Constantly upgrade engineering pedagogy that caters to the growing challenges
of the industry.
Develop conceptual learning that leads towards critical and innovative thinking.
Establish good harmony with industry that fills the gap between academia and
the outside world enabling the students to prepare for diverse and competitive
career paths.
To endorse higher studies and pursue research in the ECE discipline with
sensitivity toward societal requirements.
Course Syllabus
Course Plan
Ref No.: RIT/IQAC/AcdAct/LP/I001/2.0
Proposed Books
Session Date/ Topics to be covered Delivery
Text/
Period Methods
Reference
Introduction, Course outcome, The
1 Human speech production T1 BB, PPT
mechanism
Discrete-Time model of speech T1 BB, PPT
production, Speech
2 perception - human auditory
system
Phonetics - articulatory phonetics, T1 BB, Video
acoustic phonetics, and
3 auditory phonetics
Categorization of speech sounds, T1 BB
4 Spectrographic analysis of speech
sounds
Pitch frequency, Pitch period T1 BB,PPT
5 measurement using spectral and
cepstral domain
6 Formants, T1 BB
Evaluation of Formants for voiced
and unvoiced speech.
No. of Periods allotted by the University: 6
Books
Proposed
Session Date/ Topics to be covered Text/
Period Delivery
Reference Methods
4 Speech transformations T2 BB
T1
Shaila D. Apte, Speech and Audio Processing, Wiley India (P) Ltd, New Delhi, 2012
T2 Philipos C. Loizou, Speech Enhancement Theory and Practice, Second Edition, CRC Press,
Inc., United States, 2013
ASSIGNMENT I
Assignment I
1. Record a speech waveform for different consonants like /t/, /s/, and /p/ using a .wav
file. Write a MATLAB program to read a wav file and plot it. (Hint: See sample file plots
in the text.)
2. Record a speech waveform for a sentence and take the V/UV decision for a speech
segment based on its energy. (Hint: See the plot for a sentence and calculate energy of
a segment of size 256.)
3. Write a MATLAB program for calculation of ZCR and measure it for .wav files of /a/ and
/h/. (Hint: Use a MATLAB function and calculate ZCR for a speech segment recorded.)
4. Write a MATLAB program for computing log energy for the voiced and unvoiced parts
of an utterance and compare the results. (Hint: Open the utterance file. Track the
voiced and unvoiced segments by observing the plot of signal. Find energy of each
segment and find its log. You will observe that log energy will be high for voiced
segment.)
Assignment II
1. Write a program to find the impulse response coefficients for a vocal tract. Execute it
and interpret your results.
2. Write a program to find the cepstral coefficients for a speech segment of size 512
samples. Execute it and interpret your results. Record a speech file with sampling
frequency of 11050 Hz.
3. Write a program to covert the frequency in Hz to frequency in mel scale and Bark scale.
Execute it and interpret your results.
4. Write a program to find the mel frequency cepstral coefficients for a speech segment of
size 512. Execute it and interpret your results. Record a speech file with sampling
frequency of 11050 Hz.
5. Write a program to find the PLP coefficients for a speech segment. Execute it and
interpret your results. Record a speech file with sampling frequency of 11050 Hz.
Assignment III
1. A speech signal has a root mean square amplitude of 3 V, and is to be coded using six
bits. Find the step size and the quantization noise power.
2. A speech signal has a root mean square amplitude of 2 V. The signal is to be coded using
six bits. The signal density is assumed to be Laplacian. Find the step size. Find
quantization noise power. Find the SNR.
3. Let the signal samples be given by 1.8, 2.4, 3.5, 5.1, 6.2, 9.7, 11.2, 13.2, 12.6, 10.2, 7.3,
5.9, 8, 7.4, 5.2, 3.1, and 1.8. Find the step size and quantization noise power for a signal
and for a difference signal if a four-bit quantizer is used.
4. Consider a sequence to be quantized having sample values given as 0.1, −0.2, 0.2, 0.1,
−0.3, 0.1, etc. Find the updated step sizes, output levels and quantized output values.
5. The signal samples have values in volts given by 0.2, 0.4, 0.55, 0.7, 0.75, 0.85, 0.9, 1.0,
0.9, 0.83, 0.77, 0.66, 0.5, 0.4, 0.29, and 0.01. Find the four-bit coded value for all 16
samples. Find the quantization noise power. 3. A speech signal has a root mean square
amplitude
Assignment IV
1. Draw the block schematic for the spectral subtraction method for wideband noise
removal and explain it. What is the modification used in the basic method for modified
spectral subtraction?
2. Explain the use of adaptive noise canceller for speech enhancement.
3. Record a sentence in your voice. Add any noise to it. (You may use a standard NOIZEUS
speech database.) Design a comb filter to eliminate the noise.
Assignment V
1. Record a sentence in your voice. Find the cepstrum for successive segments of size 128
samples and insert a logo consistin of a small binary image as a watermark.
2. Record a sentence in your voice. Try to find WT for successive segments of size, say, 128
samples and insert a logo consisting of a small binary image as a watermark in different
bands of wavelet decomposition and try to recover the watermark.