0% found this document useful (0 votes)

47 views5 pages

Lab 6 - Shazam Part II

data

Uploaded by

Dan Dinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views5 pages

Lab 6 - Shazam Part II

data

Uploaded by

Dan Dinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Lab 6: Shazam Part II

Dinh Bao Dan-V202200850

1. Introduction

The Shazam Beta version builds a simplified version of Shazam program: it pre-processes the
songs in the database, extracts the feature and builds a hash table. Then for any clip which we
want to identify, the Beta program follows the same procedures, matches in the hash table, and
identifies the clip by finding the similar features.

Now in the Shazam part II, we are trying to improve the Beta program such that it is more robust
against noise and more time efficient. For example, we may add some filters in the preprocessing
step and use some efficient data structure in the database.

2. Design Considerations

A. Time Performance:
Many algorithms have training and test parts. Training is the part you can run once and use it for
each test. In your code make_database creates a hashtable to save the information of all songs.
You don’t need to run it every time you call your main file. Try to save the hashtable variable as
a mat file and load instead.
IMPLEMENTATION:
Makedatabase.m
function hashTable = make_database(gs,deltaTL,deltaTU,deltaF)
table = [];
for i = 1:50
id = num2str(i);
toRead = strcat('songDatabaseShort/', num2str(i),'.mat');
tempTable = make_table(toRead,gs,deltaTL,deltaTU,deltaF,1);
songID = i.*(ones(length(tempTable),1));% For songID
table = [table; tempTable, songID];
hashTable = hash(table); %make database
end
save 'hashTable.mat' hashTable;
end

Main.m
%% Part I: Generate a data base
%hashTable = make_database(gs,deltaTL,deltaTU,deltaF); //I remove the
make_database step and load the processed database.
load('hashTable.mat');

Verification 1: I run the verification on all 50 songs and this is the result without load the
preprocessed database:
Page 1 of 2
With the preprocessed database:

B. Preprocessing: Filters
For pre-processing we can consider filters. For this lab, we added some high frequency Gaussian
noise into the first clip to obtain '1Noise.mat' in the folder 'songHighNoise'.

One possible filter to reduce the high frequency Gaussian noise could be implementing a low-
pass filter. Of course, there are many other filters that we could use, such as bandpass filter, etc.

Try to listen to the noisy sound using the sound(y, Fs) command. Try to filter the sound and play
again. Did you hear any improvements in the music you played?
IMPLEMENTATION:Adding low-pass filter
Lowpassfilter.m
function filter_clip = lowpassfilter(clip,Fs)
cutoff_frequency=2000;
wn= cutoff_frequency/(Fs/2);
[B,A]=butter(2,wn,'low');
filter_clip= filter(B,A,clip);
end

maketable.m
function table = make_table(song,gs,deltaTL,deltaTU,deltaF,mode)
myVars = {"y","Fs"};
clip= load(song, myVars{:});
% Step 1. Preprocessing
if (mode==2)
clip.y= lowpassfilter(clip.y,clip.Fs);
end
y=clip.y;
Fs=clip.Fs;
… with the rest keep the same
// maketable now have 2 modes, mode 1 is maketable for the song, mode 2 is
maketable for the clip and in mode 2, there is filter feature added

Page 2 of 2
Verification 2: The sound of highnoise files are actually reduced however, there still some noise
left so we should try another filter. We try notch filter and clearly hear less noise sound.
function filteredSong = notch_filter(input, samplingFreq)
input = input(:);
N = length(input);
startFreq = -samplingFreq/2;
endFreq = samplingFreq/2 - (2*samplingFreq/2)/N;
stepSize = (2*samplingFreq/2)/N;
freq = startFreq:stepSize:endFreq;
X = fftshift(fft(input))/N;

X_magnitude = abs(X(:));

r = 0.985;
peakDist = N * 0.0005;
threshold = 1.4;

[peakValues, peakLocations] = findpeaks(X_magnitude, 'SortStr', 'descend',

'MinPeakDistance', peakDist, 'NPeaks', 4);

if length(peakValues) >= 3
ratio = peakValues(1) / peakValues(3);
else
ratio = 0;
end

cutoff_frequency = 2000;
wn = cutoff_frequency / (samplingFreq / 2);
[B, A] = butter(2, wn, 'low');
input = filter(B, A, input);

if ratio > threshold

posPeak = freq(peakLocations(2));
if posPeak > 1000
filterFreq = (posPeak * 2 * pi) / samplingFreq;
a = [1, -2 * r * cos(filterFreq), r * r];
b = [1, -2 * cos(filterFreq), 1];
filteredSong = filter(b, a, input);
else
filteredSong = input;
end
else
filteredSong = input;
end
end

Page 3 of 2
C. Spectrogram: Window Size and overlap.
When we apply the spectrogram to the signal, we fix the window size and the overlap. The larger
the window size is, the fewer data we will have. The smaller overlap we set, the less data there
will be. Fewer data will surely improve the time performance of the code but it will also reduce
the accuracy of the matching.

We want our algorithm to work well on shorter clips. In the zip file songDataBaseShort.zip random
15 seconds clips from each song is stored. Try your algorithm on these shorter clips to see how is
the accuracy.

Verification 3: Try different combination of the window size and the overlap. Use the
combination you found that will make sure good accuracy and the data running time is short.
Do not forget to retrain your hash table when you change the parameters. If it takes a long time
you can change the number of songs to 10 from 50.
Implementation:
Original window size and number overlap:
window=64 * 10^-3 * new_Fs;
noverlap=32 * 10^-3 * new_Fs;
Most optimal try with
window=round(64* 10^-3 * new_Fs);
noverlap=round(8* 10^-3 * new_Fs);
nfft=round(64*10^-3*new_Fs);

Time run for 50 songs is reduced by 0.4s which is about 14% improvement
without effecting accuracy.
D. Feature Extraction: Spectrogram Local Peaks: Local Troughs and Window Size.
Previously, we used the local peaks (local maximum) to construct the boolean matrix
localPeakLocation. An alternative way is to use local troughs (local minimums).
IMPLEMENTATION:
%% Step 3. Feature Extraction
array = -floor(gs/2):floor(gs/2);
localPeakLocation = ones(size(log_S));

for i = 1:gs
for j = 1:gs
if (array(i) == 0 && array(j) == 0)
localPeakLocation = localPeakLocation;
else
CA = circshift(log_S,[array(i),array(j)]);
localPeakLocation = (log_S-CA < 0) .* localPeakLocation;
end
end
end

To extract the trough we only need to change “>” sign to “<” sign. With the original window size
Page 4 of 2
and noverlap we get the result:

This means that the speed increase a little bit when I chose to extract trough
instead of peaks.

E) Conclusion
In this lab, we have tried many ways to speed up the algorithm to find a song from its short
clip. At first we need to create a preprocessed database to avoid processing data again and again
when we run the program. Next, we tried the low pass filter to remove noise from the clip so that
we can analyze the clip easier and identify its original song. Lastly, we tried to switch from
extracting peaks to extracting troughs of the songs and this also gives us a better results in speed.

Page 5 of 2

Synon Questions With Answers
100% (1)
Synon Questions With Answers
8 pages
n10235728 CAB401 Report
No ratings yet
n10235728 CAB401 Report
14 pages
MATLAB Code For Musical Notes Recognition
No ratings yet
MATLAB Code For Musical Notes Recognition
2 pages
Aicte Notification
No ratings yet
Aicte Notification
22 pages
Lab 6 - Shazam Part II
No ratings yet
Lab 6 - Shazam Part II
2 pages
Shazam Princeton ELE201
No ratings yet
Shazam Princeton ELE201
7 pages
Upsample DSP
No ratings yet
Upsample DSP
8 pages
Pivot Excel
No ratings yet
Pivot Excel
50 pages
ECE251s Signals Project
No ratings yet
ECE251s Signals Project
11 pages
Week2 - Fourier Series - The Math Behind The Music - V1
No ratings yet
Week2 - Fourier Series - The Math Behind The Music - V1
5 pages
Ronatay Santos Signals Final Output
No ratings yet
Ronatay Santos Signals Final Output
8 pages
Music Note Finupload
No ratings yet
Music Note Finupload
15 pages
Signals Report
No ratings yet
Signals Report
12 pages
Lab S: Sampling and Aliasing: 0.1 Simple Examples
No ratings yet
Lab S: Sampling and Aliasing: 0.1 Simple Examples
4 pages
Audio Fingerprinting With Python and Numpy
No ratings yet
Audio Fingerprinting With Python and Numpy
13 pages
Fir and I I R Filters Worksheet Answers
No ratings yet
Fir and I I R Filters Worksheet Answers
9 pages
Lab 4
No ratings yet
Lab 4
5 pages
Ab Star Action
No ratings yet
Ab Star Action
7 pages
Hanoi University of Science and Technology
No ratings yet
Hanoi University of Science and Technology
9 pages
Ita Posgrad EA 268 Lab-1
No ratings yet
Ita Posgrad EA 268 Lab-1
4 pages
Insights On Song Genres With PCA Analysis of Spectrograms
No ratings yet
Insights On Song Genres With PCA Analysis of Spectrograms
20 pages
Audio Search Engine
No ratings yet
Audio Search Engine
4 pages
Pitch Detection Algorithms
No ratings yet
Pitch Detection Algorithms
21 pages
MTP 1
No ratings yet
MTP 1
32 pages
MSC Data Science - 02 PDF
No ratings yet
MSC Data Science - 02 PDF
37 pages
Audio Processing Using Matlab: Elena Grassi
No ratings yet
Audio Processing Using Matlab: Elena Grassi
12 pages
Programming & Numerical Analysis: Kai-Feng Chen
No ratings yet
Programming & Numerical Analysis: Kai-Feng Chen
45 pages
Sound and Vibrational Analysis
No ratings yet
Sound and Vibrational Analysis
14 pages
Spearfinal 05
No ratings yet
Spearfinal 05
4 pages
Audio Fingerprinting
No ratings yet
Audio Fingerprinting
5 pages
Melody Transcription EC304 Signal Processing: Project Project Report
No ratings yet
Melody Transcription EC304 Signal Processing: Project Project Report
16 pages
Artboard 7
No ratings yet
Artboard 7
1 page
Lab 02: An Experiment With Tuning Fork Synthesis of Sinusoidal
No ratings yet
Lab 02: An Experiment With Tuning Fork Synthesis of Sinusoidal
6 pages
Aml CT2 4M
No ratings yet
Aml CT2 4M
8 pages
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
No ratings yet
Eng 6 Audio Signals: Bevan Baas, Andre Knoesen
30 pages
Beat Tracking: 1. Rhythm Perception 2. Onset Extraction 3. Beat Tracking 4. Dynamic Programming
No ratings yet
Beat Tracking: 1. Rhythm Perception 2. Onset Extraction 3. Beat Tracking 4. Dynamic Programming
19 pages
Voice Recognition
No ratings yet
Voice Recognition
17 pages
MFCC Code
No ratings yet
MFCC Code
8 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Listing Code Voice Recognition
No ratings yet
Listing Code Voice Recognition
11 pages
Pert Usa PHD
No ratings yet
Pert Usa PHD
232 pages
Ph.D. Thesis Computationally Efficient Methods For Polyphonic Music Transcription
No ratings yet
Ph.D. Thesis Computationally Efficient Methods For Polyphonic Music Transcription
232 pages
Course Introduction: 1. Course Structure 2. DSP: The Short-Time Fourier Transform
No ratings yet
Course Introduction: 1. Course Structure 2. DSP: The Short-Time Fourier Transform
24 pages
Pad Assignment 2
No ratings yet
Pad Assignment 2
12 pages
Lab 04: Synthesis of Sinusoidal Signals-Music Synthesis: Signal Processing First
No ratings yet
Lab 04: Synthesis of Sinusoidal Signals-Music Synthesis: Signal Processing First
12 pages
Sound Lab: Power Spectra: Background
No ratings yet
Sound Lab: Power Spectra: Background
4 pages
Friday Lunchtime Lecture: Making Music With Open Data
No ratings yet
Friday Lunchtime Lecture: Making Music With Open Data
31 pages
Audio Signal Processing
No ratings yet
Audio Signal Processing
7 pages
MATLAB Audio Processing Ho
No ratings yet
MATLAB Audio Processing Ho
7 pages
Musical Notes Identification Using Digital Signal
No ratings yet
Musical Notes Identification Using Digital Signal
9 pages
DSP Lab4
No ratings yet
DSP Lab4
6 pages
Matlab Lab 8
No ratings yet
Matlab Lab 8
11 pages
CS229 Final Report - Music Genre Classification
No ratings yet
CS229 Final Report - Music Genre Classification
6 pages
ASP Exercises 1
No ratings yet
ASP Exercises 1
12 pages
MATLAB For Audio Signal Processing: P. Professorson UT Arlington Night School
No ratings yet
MATLAB For Audio Signal Processing: P. Professorson UT Arlington Night School
27 pages
Audio Noise Detection
No ratings yet
Audio Noise Detection
29 pages
Spectrogram S
No ratings yet
Spectrogram S
9 pages
Audiosegment Readthedocs Io en Latest
No ratings yet
Audiosegment Readthedocs Io en Latest
23 pages
Chord Recognition
No ratings yet
Chord Recognition
9 pages
Lab Filter Noise Music
No ratings yet
Lab Filter Noise Music
5 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Lab 3 (Saha)
No ratings yet
Lab 3 (Saha)
7 pages
Onfiniti
No ratings yet
Onfiniti
6 pages
Ebookmetafile 4417
No ratings yet
Ebookmetafile 4417
38 pages
Dingo Coin White Paper
No ratings yet
Dingo Coin White Paper
4 pages
أثر استخدام تقنية الذكاء الاصطناعي (chat gpt) على التحصيل العلمي للطلبة الجامعيين في ظل اقتصاد المعرفة، دراسة ميدانية على عينة من طلبة جامعة الجزائر 2
No ratings yet
أثر استخدام تقنية الذكاء الاصطناعي (chat gpt) على التحصيل العلمي للطلبة الجامعيين في ظل اقتصاد المعرفة، دراسة ميدانية على عينة من طلبة جامعة الجزائر 2
17 pages
E-Yantra Robotics Competition E-Yantra+ Caretaker Robot Theme
No ratings yet
E-Yantra Robotics Competition E-Yantra+ Caretaker Robot Theme
7 pages
Boujou 5.0 GettingStarted
No ratings yet
Boujou 5.0 GettingStarted
50 pages
Metrohm KF Coulometer 831
No ratings yet
Metrohm KF Coulometer 831
7 pages
Model Predictive Control of Power Electronics Converter: Jiaying Wang
No ratings yet
Model Predictive Control of Power Electronics Converter: Jiaying Wang
81 pages
INTERN
No ratings yet
INTERN
40 pages
BioBlocksLab - A Portable DIY Bio Lab Using BioBlocks Language - ScienceDirect
No ratings yet
BioBlocksLab - A Portable DIY Bio Lab Using BioBlocks Language - ScienceDirect
14 pages
EU GDPR Opportunities For Grocery Retail
100% (1)
EU GDPR Opportunities For Grocery Retail
24 pages
FPSC Slip
No ratings yet
FPSC Slip
1 page
Vocabulary Classroom Objects
No ratings yet
Vocabulary Classroom Objects
2 pages
What Is Satellite Radio?: Satellites
No ratings yet
What Is Satellite Radio?: Satellites
4 pages
Embedded System Design (ECE 1021) : Introduction To Embedded Systems
No ratings yet
Embedded System Design (ECE 1021) : Introduction To Embedded Systems
18 pages
OTM Logs
No ratings yet
OTM Logs
53 pages
Technical Training Presentation
No ratings yet
Technical Training Presentation
55 pages
Component e 45589
No ratings yet
Component e 45589
32 pages
Resizing Partitions (For Android)
No ratings yet
Resizing Partitions (For Android)
2 pages
Face Detection Poc: Using Opencv and Other Freely Available Libraries
No ratings yet
Face Detection Poc: Using Opencv and Other Freely Available Libraries
12 pages
Roots Millennium School Khyber Campus Peshawar 1 Term Exam Dec. 2016
No ratings yet
Roots Millennium School Khyber Campus Peshawar 1 Term Exam Dec. 2016
3 pages
DATA GATHERING PROCEDURE of A Research
No ratings yet
DATA GATHERING PROCEDURE of A Research
8 pages
Lec6 MobileRobotControl
No ratings yet
Lec6 MobileRobotControl
5 pages
Emails (SK, China, Italy) (March 2023)
No ratings yet
Emails (SK, China, Italy) (March 2023)
226 pages
Construction of Transmission Line Catenary From Survey Data
No ratings yet
Construction of Transmission Line Catenary From Survey Data
7 pages
It-402 Project File Work Class-X Based On UNIT-3 DBMS Instructions
No ratings yet
It-402 Project File Work Class-X Based On UNIT-3 DBMS Instructions
2 pages
Avr EM'CY DIESEL GENERATOR-3
No ratings yet
Avr EM'CY DIESEL GENERATOR-3
7 pages

Lab 6 - Shazam Part II

Uploaded by

Lab 6 - Shazam Part II

Uploaded by

Lab 6: Shazam Part II

Dinh Bao Dan-V202200850

[peakValues, peakLocations] = findpeaks(X_magnitude, 'SortStr', 'descend',

if ratio > threshold

You might also like