0% found this document useful (0 votes)

11 views

Homework

The document outlines Homework 2 for a course on Machine Learning for Signal Processing, due on February 23, 2024. It includes four main problems focusing on white noise removal, DCT and PCA analysis, parallax techniques, and GMM for parallax, with specific instructions for data processing and submission format. Students are required to use Jupyter Notebook or MATLAB Livescript, avoid toolboxes, and provide comprehensive reports with visualizations and code submissions.

Uploaded by

mpeducation2025

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Homework

Uploaded by

mpeducation2025

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Machine Learning for Signal Processing (Residential)

(ENGR-E 511; CSCI-B 590)

Homework 2
Due date: Feb. 23, 2024, 23:59 PM (US Eastern)

Instructions
• Submission format: (Jupyter Notebook or MATLAB Livescript) + HTML
– Your notebook should be a comprehensive report, not just a code snippet. Mark-ups are
mandatory to answer the homework questions. You need to use LaTeX equations in the
markup if you’re asked.
– Google Colab is the best place to begin with if this is the first time using iPython
notebook. No need to use GPUs.
– Download your notebook as an .html version and submit it as well, so that the AIs can
check out the plots and audio. Here is how to convert to html in Google Colab.
– Meaning you need to embed an audio player in there if you’re asked to submit an audio
file
• Avoid using toolboxes.

P1: White Noise [3 points]

1. Have you ever wondered what it means by “white” noise? It’s actually from the light. When
the light is the sum of all visible frequencies, then it looks white to human eyes. If you pass the
light through a prism, then you all of a sudden see the rainbow colors, so called a “spectrum.”
Yes, the prism does an analogue version of the Fourier transform.
2. So, even if we don’t see the sound we listen to, if the signal consists of too many sinusoids
with different frequencies, it sounds “white.” I know, I know, it doesn’t make sense.
3. You may also want to note that the sample distribution of a white noise signal looks like a
Gaussian distribution, which is not news to us because we all know the central limit theorem.
4. x.wav is a speech signal contaminated by white noise. As I haven’t taught you guys how to
properly do speech enhancement yet, you’re not supposed to know a machine learning-based
solution to this problem (don’t worry I’ll cover it soon). Instead, you did learn how to do
STFT, so I want you to at least manually erase the white noise from this signal to recover
the clean speech source. For some reason, we know that the white noise added to the signal
doesn’t change its volume over time. So, what we’re going to do is to listen to the sound and
eyeball the spectrogram to find out the frames only with white noise. Then, we will build
our simple noise model, with which we will suppress the noise in the other speech-plus-noise
frames.
(Note: don’t forget to turn off the sampling rate option sr=None if you use librosa.load).

1
5. First off, create a DFT matrix F using the equation shown in M02-L01-S11 and S12. You’ll
of course create a N × N complex matrix, but if you see its real and imaginary versions
separately, you’ll see something like the ones in M02-L01-S14 (the ones in the slide are 20×20,
i.e. N = 20). For this problem let’s fix N = 1024.
6. Prepare your data matrix X. You extract the first frame of N samples from the input signal,
and apply a Hann window1 . What that means is that from the definition of Hann window, you
create a window of size N and element-wise multiply the window and your N audio samples.
Place it as your first column vector of the data matrix X. Move by N/2 samples. Extract
another frame of N samples and apply the window. This goes to the second column vector of
X. Do it for your third frame (which should start from (N + 1)’th sample, and so on. Since
you moved just by the half of the frame size, your frames are overlapping each other by 50%.
(Note: this time it’s okay to use the toolbox to calculate Hann windows.)
7. Apply the DFT matrix to your data matrix, i.e. Y = F X. This is your spectrogram with
complex values. See how it looks like (by taking magnitudes and plotting). For example, you
can use imshow in matplotlib.

8. In this spectrogram, identify frames that are only with noise2 . For example the ones at the
end of signal would be a good choice. Take a sample mean of the chosen Pcolumn vectors (the
1
original magnitudes, not the exponentiated ones), e.g. M = |Cnoise | i∈Cnoise |Y :,i |, where
Cnoise is the set of chosen frames and |Cnoise | is the number of frames. This is your noise
model.
9. Subtract M out of all the magnitude spectra, |Y |. This will give you some residual mag-
nitudes with suppressed noise. Be careful with negative values: you don’t want them in
your “magnitude” spectra. One quick method to remove them is to turn them into zeros.
Get the original phase from the input spectrogram, i.e. Y /|Y | (element-wise division), and
multiply each of the phase values by the corresponding cleaned-up magnitude to recover the
complex-valued spectra of the estimated clean speech.

10. Multiply the inverse DFT matrix, which you can also create by using the equation in S12.
Let’s call this F ∗ . Since it’s the inverse transform, F ∗ F ≈ I (you can check it, although the
off diagonals might be a very small number rather than zero). You multipy this matrix to
your spectrogram, which is with suppressed white noise, to get back to the recovered version
of your data matrix, X̂. In theory this should give you a real-valued matrix, but you’ll still
see some imaginary parts with a very small value. Ignore them by just taking the real part.
Reverse the procedure in 1.6 to get the time domain signal. Basically it must be a procedure
that transpose every column vector of X̂ and overlap-and-add the right half of t-th row vector
with the left half of the (t + 1)-th row vector and so on. Listen to the signal to check if the
white noise is suppressed.
1 https://en.wikipedia.org/wiki/Hann_function

https://docs.scipy.org/doc/scipy/reference/generated/scipy.signal.get_window.html

2 Depending on the plotting function you use, it’s possible that you can’t really “see” the white noise. It’s because

your white noise is not loud enough. What you can do to better visualize this spectrogram is to exaggerate the small
magnitudes while suppress the large ones. For example, I can visualize |Y |0.5 instead of |Y |, where exponentiation
is element-wise. Don’t worry about this visualization issue if you can see the white noise-only frames from your
spectrogram.

2
11. Submit your code and the denoised audio file. Do NOT use any STFT functions you can find
in toolboxes.

P2: DCT and PCA [3 points]

1. s.wav is a recording of Prof. K’s voice. Load it. Randomly select 8 consecutive samples out
of the 5,000,000 samples. This is your first column vector of your data matrix X. Repeat
this procedure 10 times. Then, the size of X is 8 × 10.
2. Calculate the covariance matrix out of this, whose size must be 8 × 8. Do eigendecomposition
and extract 8 eigenvectors, each of which is with 8 dimensions. Yes, you just did PCA. Plot
your W ⊤ matrix and compare it to the DCT matrix shown in M02-L01-S21. Similar? Submit
your plot and code.
3. Create another data matrix with 100 samples, i.e. X ∈ R8×100 . Do PCA on this one. How
about 1,000 samples? Can you see your PCA is getting better with larger datasets? Why do
you feel that your PCA is getting better? Try to explain in comparison with the DCT matrix.

4. You just saw that PCA might be able to replace the pre-fixed DCT basis vectors. But, as
you can see in your matrices, they are not the same. Discuss the pros and cons of PCA and
DCT in your report.

P3: Parallax [3 points]

1. You live in a planet far away from the earth. Your solar system belongs to a galaxy, which
is about to merge with another galaxy (it is not rare in the outer space, but don’t worry, the
merger takes a few billions of years). Anyhow, because of this merger, in your deep sky you
see lots of stars from your galaxy as well as the other stars in the other neighboring galaxy.
Of course you don’t know which one is from which galaxy though.

2. You are going to use a technique called “parallax” to solve this problem. It’s actually very
similar to the computer vision algorithm called “stereo matching” that stereophonic cameras
are using to find out the 3D depth information from the scene. That’s actually why we humans
can recognize the distance of a visual object (we have two eyes). See Figure 1 for an example.

3. Let’s get back to your remote planet. In your planet, parallax works by taking a picture of
the deep sky in June and another one in December (yes, you have 12 months there, too). If
you take a picture of the deep sky, you see the stars nearby (i.e. the ones in your galaxy)
changes its position much more in the two pictures, while the starts far way (i.e. the ones in
the neighboring galaxy) change their position less. See Figure 2 and 3.

4. june.png and december.png are the two pictures you took for this parallax project. In
theory, you need to apply a computer vision technique, called “non-maximum suppression,”
with which you can identify the position of all the stars in the two pictures. In theory, for
each of the stars in june.png, you look for its position in december.png by scanning a star
in the same row (because the stars always move to right). But we will not do it in here.

5. Instead, you get the (x, y) coordinates of all the stars in the two pictures. june.mat and
december.mat contain the positions of the stars in the two pictures. Each row has two

3
Left image Right image

Figure 1: The tree is closer than the mountain. So, from the left camera, the tree is located on
the right hand side, while the right camera captures it on the left hand side. On the contrary, the
mountain in the back does not have this disparity.

Your planet in June

Your sun
A star in your galaxy

Your planet in Dec.

The other stars
in the neighboring galaxy

Figure 2: You take two pictures of the same area of the sky in June and December, respectively.
Because of the pretty big movement of your planet due to its revolution, you can see that the
close-by stars oscillate more in the two pictures than the far-away ones, just like the tree and the
mountain in Figure 1.

4
June Dec

Figure 3: The oscillation of the close star (green) in the two pictures. Note that the other stars
(blue) don’t move much.

coordinates, x-coordinate and y-coordinate, and there are 2,700 such rows in each matrix,
each of which is for a particular star. You can load the .mat files in python with the
scipy.io.loadmat(filepath) function3 4 .
6. If you take the first column vectors of the two matrices (i.e. the x-coordinates of the stars),
you can subtract the ones in June from the ones in December to find out their disparity, or
the amount of oscillation, which is a vector of 2,700 elements. Draw a histogram out of this
disparity values to gauge if you can see two clusters there (pay attention to the bin size).
Submit your histogram and answer in the report.
7. Write your own k-means clustering code, and apply on this disparity dataset. Find out the
cluster means and report the values. Which one do you think corresponds to the stars in your
galaxy and which is for the other galaxy? Why do you think so? Justify your answer in the
report.

P4: GMM for Parallax [3 points]

1. Implement your own EM algorithm for GMM to propose an alternative solution to the parallax
problem. Report your means. Explain which one you prefer between the k-means solution
and the GMM results. Justify your answer.

3 https://docs.scipy.org/doc/scipy/reference/generated/scipy.io.loadmat.html
4 Also, you might have to cast the value to int16 in python to avoid overflowing.

Marsden - Vector Calculus, 6th Ed, Solutions PDF
93% (14)
Marsden - Vector Calculus, 6th Ed, Solutions PDF
1,687 pages
Yamane, 1973 - Statistics An Introductory Analysis
80% (5)
Yamane, 1973 - Statistics An Introductory Analysis
8 pages
Truong Nguyen Gilbert Strang Wavelets and Filter Banks PDF
50% (2)
Truong Nguyen Gilbert Strang Wavelets and Filter Banks PDF
436 pages
E105 - Thevenin's and Norton's Theorem
No ratings yet
E105 - Thevenin's and Norton's Theorem
15 pages
Homework 3 Residential
No ratings yet
Homework 3 Residential
5 pages
DSP Lab 5
No ratings yet
DSP Lab 5
7 pages
Final Version of Lab 8
No ratings yet
Final Version of Lab 8
9 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Listing Code Voice Recognition
No ratings yet
Listing Code Voice Recognition
11 pages
158LAB2
No ratings yet
158LAB2
6 pages
HW 5
No ratings yet
HW 5
2 pages
Python 9
No ratings yet
Python 9
10 pages
COE4TL4 Lab3 PDF
No ratings yet
COE4TL4 Lab3 PDF
3 pages
Calaquian, Carl Alvin M. - Mondero, Raymond T. - : Stem (t-3, X) Stem ( - (t-3) - 3, X)
No ratings yet
Calaquian, Carl Alvin M. - Mondero, Raymond T. - : Stem (t-3, X) Stem ( - (t-3) - 3, X)
9 pages
DSP DA-01
No ratings yet
DSP DA-01
14 pages
Audio Noise detection
No ratings yet
Audio Noise detection
29 pages
Frequency Domain Statistics
No ratings yet
Frequency Domain Statistics
12 pages
Transform Methods in Image Processing
No ratings yet
Transform Methods in Image Processing
32 pages
Linear Algebra, Signal Processing, And Wavelets - A Unified Approach_ MATLAB Version (Instructor's Solution Manual) (Solutions)
No ratings yet
Linear Algebra, Signal Processing, And Wavelets - A Unified Approach_ MATLAB Version (Instructor's Solution Manual) (Solutions)
209 pages
Ece45 HW2
No ratings yet
Ece45 HW2
5 pages
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
No ratings yet
Laboratory Manual 4: Discrete Time Fourier Transform & Discrete Fourier Transform
10 pages
Digital Signal Processing Assignment # 4 THEME: The Discrete Fourier Transform (DFT
No ratings yet
Digital Signal Processing Assignment # 4 THEME: The Discrete Fourier Transform (DFT
9 pages
Project Sol Spring08
No ratings yet
Project Sol Spring08
11 pages
hw3 Sol
No ratings yet
hw3 Sol
10 pages
Lab 1: DTFT, DFT, and DFT Spectral Analysis: (LABE 410) Dr. Jad Abou Chaaya
No ratings yet
Lab 1: DTFT, DFT, and DFT Spectral Analysis: (LABE 410) Dr. Jad Abou Chaaya
4 pages
Spca 465
No ratings yet
Spca 465
25 pages
Experiment 2: Fourier Series and Fourier Transform: I. Objectives
No ratings yet
Experiment 2: Fourier Series and Fourier Transform: I. Objectives
8 pages
LAB 5 Filtering Periodic Signals PDF
No ratings yet
LAB 5 Filtering Periodic Signals PDF
5 pages
EE 261 The Fourier Transform and Its Applications Fall 2007 Problem Set Nine Due Wednesday, December 5
No ratings yet
EE 261 The Fourier Transform and Its Applications Fall 2007 Problem Set Nine Due Wednesday, December 5
2 pages
Anushruta Mitra - EE2800 - 2024 - Class - Project
No ratings yet
Anushruta Mitra - EE2800 - 2024 - Class - Project
4 pages
Speech Understanding Content
No ratings yet
Speech Understanding Content
10 pages
Time Domain and Frequency Domain Signal Representation: ES440. Lab 1-MATLAB
No ratings yet
Time Domain and Frequency Domain Signal Representation: ES440. Lab 1-MATLAB
3 pages
Speech Signal Processing ASSIGNMENT - 3 Date - 10.02.2018
No ratings yet
Speech Signal Processing ASSIGNMENT - 3 Date - 10.02.2018
21 pages
Proyecto 04
0% (1)
Proyecto 04
6 pages
ASP Exercises 1
No ratings yet
ASP Exercises 1
12 pages
Homework
No ratings yet
Homework
4 pages
Sipro Lab
100% (2)
Sipro Lab
16 pages
Report On Project 1 Speech Emotion Recognition
No ratings yet
Report On Project 1 Speech Emotion Recognition
10 pages
Dsaa Assignment 3: Jyoti Misra 201303007
No ratings yet
Dsaa Assignment 3: Jyoti Misra 201303007
14 pages
بحث قطر ممتاز
No ratings yet
بحث قطر ممتاز
10 pages
ECE 410 Digital Signal Processing D. Munson University of Illinois
No ratings yet
ECE 410 Digital Signal Processing D. Munson University of Illinois
12 pages
Lab Filter Noise Music
No ratings yet
Lab Filter Noise Music
5 pages
Engineering Assignment Sample
No ratings yet
Engineering Assignment Sample
10 pages
Signals and Systems Lab - Assignment2
No ratings yet
Signals and Systems Lab - Assignment2
12 pages
Lab4 2011
No ratings yet
Lab4 2011
6 pages
Spectral Analysis Lab 1
No ratings yet
Spectral Analysis Lab 1
18 pages
Pub - Wavelets and Filter Banks PDF
No ratings yet
Pub - Wavelets and Filter Banks PDF
436 pages
DSP Lab ExptList 2020
No ratings yet
DSP Lab ExptList 2020
10 pages
Greinerwk 7 Labecet 350
No ratings yet
Greinerwk 7 Labecet 350
13 pages
The final
No ratings yet
The final
4 pages
Introduction To Matlab
No ratings yet
Introduction To Matlab
3 pages
ELEN3801 - Fall 2009 Homework 7
No ratings yet
ELEN3801 - Fall 2009 Homework 7
2 pages
Dsp Da-02 23bec0056 Yash Mehta
No ratings yet
Dsp Da-02 23bec0056 Yash Mehta
14 pages
12 Fourier T Xen
No ratings yet
12 Fourier T Xen
129 pages
Lab 4
No ratings yet
Lab 4
8 pages
Mathlab Ok
No ratings yet
Mathlab Ok
16 pages
DSP LAB EXPERIMENT - 4
No ratings yet
DSP LAB EXPERIMENT - 4
19 pages
Multimedia Audio Processing and Practice Homework-1
No ratings yet
Multimedia Audio Processing and Practice Homework-1
1 page
A Numerical Tour of Signal Processing - Image Denoising With Linear Methods
No ratings yet
A Numerical Tour of Signal Processing - Image Denoising With Linear Methods
8 pages
Project Assignment: R. Nassif, ECE Department, AUB EECE 491-691, Digital Signal Processing
No ratings yet
Project Assignment: R. Nassif, ECE Department, AUB EECE 491-691, Digital Signal Processing
3 pages
MFCC
100% (2)
MFCC
6 pages
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Killer Python Science Experiment Template
No ratings yet
Killer Python Science Experiment Template
3 pages
Introduction To Algebraic Coding Theory - 2022
No ratings yet
Introduction To Algebraic Coding Theory - 2022
266 pages
Lecture 4
No ratings yet
Lecture 4
43 pages
November 2002 QP - Paper 2 CIE Physics IGCSE
No ratings yet
November 2002 QP - Paper 2 CIE Physics IGCSE
20 pages
The Selection of Alarm Levels For Personnel Exit Monitors: Operational Monitoring Good Practice Guide
No ratings yet
The Selection of Alarm Levels For Personnel Exit Monitors: Operational Monitoring Good Practice Guide
59 pages
Dynamic Analysis of Machine Foundation
No ratings yet
Dynamic Analysis of Machine Foundation
15 pages
MIDTERM TEST Solution
No ratings yet
MIDTERM TEST Solution
5 pages
As Physics Quiz 1 2
No ratings yet
As Physics Quiz 1 2
6 pages
Aerospace Actuators 1 Needs Reliability and Hydraulic Power Solutions 1st Edition Jean-Charles Mar? pdf download
100% (1)
Aerospace Actuators 1 Needs Reliability and Hydraulic Power Solutions 1st Edition Jean-Charles Mar? pdf download
56 pages
ECT Prev. Year End Sem
No ratings yet
ECT Prev. Year End Sem
4 pages
जल सुरक्षा
No ratings yet
जल सुरक्षा
5 pages
Fatigue Crack Initiation and Microcrack Growth in 4140 Steel
No ratings yet
Fatigue Crack Initiation and Microcrack Growth in 4140 Steel
5 pages
Year 11 Paper 3 Higher Autumn (A) 2021
No ratings yet
Year 11 Paper 3 Higher Autumn (A) 2021
12 pages
Practice MCQs (Atomic Structure)
No ratings yet
Practice MCQs (Atomic Structure)
11 pages
Design of Footing 1 (ISOLATED FOOTING)
No ratings yet
Design of Footing 1 (ISOLATED FOOTING)
32 pages
Estimation of Diffuse Solar Radiation in The Regio
No ratings yet
Estimation of Diffuse Solar Radiation in The Regio
11 pages
Specific Energy: 2g v2 H Z E Fluid, Flowing of Energy Total + +
No ratings yet
Specific Energy: 2g v2 H Z E Fluid, Flowing of Energy Total + +
5 pages
Seismic Design of Elevated Water Storage Tanks
No ratings yet
Seismic Design of Elevated Water Storage Tanks
76 pages
Certificate of Conformity: Date of Issue Johnson
No ratings yet
Certificate of Conformity: Date of Issue Johnson
1 page
2010 Summer Camp - Alexander Remorov - Miscellaneous Problems - Solutions
No ratings yet
2010 Summer Camp - Alexander Remorov - Miscellaneous Problems - Solutions
9 pages
Electro Magnetic Induction: 1 Mark Questions
No ratings yet
Electro Magnetic Induction: 1 Mark Questions
6 pages
HEPA Filters Key Requirements
No ratings yet
HEPA Filters Key Requirements
12 pages
محاضرات الكورس الثاني PDF
No ratings yet
محاضرات الكورس الثاني PDF
51 pages
Aas Lab Report
No ratings yet
Aas Lab Report
4 pages
AMM - Vol.118 NR 06 PDF
No ratings yet
AMM - Vol.118 NR 06 PDF
97 pages
2025-03-11-0.5098568660070603 class 11
No ratings yet
2025-03-11-0.5098568660070603 class 11
2 pages
Numerical Ability
No ratings yet
Numerical Ability
2 pages

Homework

Uploaded by

Homework

Uploaded by

Machine Learning for Signal Processing (Residential)

(ENGR-E 511; CSCI-B 590)

P1: White Noise [3 points]

P2: DCT and PCA [3 points]

P3: Parallax [3 points]

Your planet in June

Your planet in Dec.

P4: GMM for Parallax [3 points]

You might also like