Proiect ME

This document summarizes a process for analyzing music data using spectral analysis and k-means clustering. It first describes accessing a dataset of 10,000 songs from the Million Song Dataset in HDF5 format. It then explains performing feature extraction on the data using fast Fourier transforms (FFT) to project the high-dimensional input to a lower dimension. Finally, it outlines applying k-means clustering to group the features into K clusters, iterating until cluster centroids converge. Potential weaknesses of k-means clustering are also noted.

Uploaded by

Razvan Mazilu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

91 views11 pages

Proiect ME

Uploaded by

Razvan Mazilu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Million Song Dataset

Feature extraction with Spectral Analysis

Classification with k-Means algorithm
Data Set used (1)

MillionSongSubset from https://labrosa.ee.columbia.edu. 10000 songs (1%

from Million Song Dataset) selected random.
Data are in HDF5 format, which is a dedicated format to organize big data
arrays.
I have used a Matlab wrapper in order access the from the HDF5 files. This
wrapper was found on https://labrosa.ee.columbia.edu also.
Data Set used (2)

• Data for each song is wrapped in a .h5 . It looks like in the bellow pictures:
• There are no audio signal data, only metadata like year,
artist…
Input set

1000 arrays like in picture with ascii code of songs

name
Feature extraction using Spectral Analysis

• Features extraction means to create a projection form a M dimensional

space of the input features to N dimensional space (N < M). The new
features from the N dimensional spaces shall be uncorrelated.
• Spectral Analysis can be done using FFT, which is already implemented in
MATLAB. The function for FFT is fft();
Apply fft to input data

• we observe that only the first element has a

significant value
• we are going to select only 1st element from
each row from the input data.
Classification using K-means algorithm

• Classification using K-means algorithm means to group the input features in K

clusters using an iterative method.
• Steps for K-means algorithm are next ones:
• Set randomly K centroids in input features spaces.
• Calculate distances from each features to the all centroids and assign the feature to the
closest one.
• Recalculate the centroids based on the features in each cluster.
• Repeat until convergence (there is no more features which change the cluster from they
appear)
K Means Clustering

http://rossfarrelly.blogspot.ro/2012/12/k-meansclustering.html
Weakness of K-means Algorithm

• It is not robust to outliners. Very far data from the centroid, will pull the centroid away from
the real one
• The result is circular cluster shape because is based on distance
• Sensitive to initial condition. Different initial condition may produce different result of
cluster. The algorithm may be trapped in the local optimum.
• When the numbers of data are not so many, initial groping will determine the cluster
significantly

http://people.revoledu.com/kardi/tutorial/kMean/Weakness.htm
Thank you!

76 Case Study
100% (1)
76 Case Study
7 pages
Robert Venturi - Idea of A Duck and A Decorated Shed
No ratings yet
Robert Venturi - Idea of A Duck and A Decorated Shed
7 pages
1 s2.0 S0031320319301608 Main
No ratings yet
1 s2.0 S0031320319301608 Main
18 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Image Enhancement Image Filtering
No ratings yet
Image Enhancement Image Filtering
167 pages
Report 1
No ratings yet
Report 1
3 pages
20 ENG 016 Assignment 8
No ratings yet
20 ENG 016 Assignment 8
4 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
K Means
No ratings yet
K Means
33 pages
Detecting Patterns With Unsupervised Learning
No ratings yet
Detecting Patterns With Unsupervised Learning
21 pages
Week 7 Kmeans
No ratings yet
Week 7 Kmeans
18 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
Clustering
No ratings yet
Clustering
55 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
K Means
100% (2)
K Means
329 pages
WS - Data Analytics Fundamental-R
No ratings yet
WS - Data Analytics Fundamental-R
51 pages
Kmeans R
No ratings yet
Kmeans R
2 pages
Session 18-Cluster Analysis
No ratings yet
Session 18-Cluster Analysis
20 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
Algorithms New
No ratings yet
Algorithms New
8 pages
Clustering Fraud Detection
No ratings yet
Clustering Fraud Detection
45 pages
Week 9 Notes
No ratings yet
Week 9 Notes
6 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
SUMERA - Kmeans Clustering - Jupyter Notebook
No ratings yet
SUMERA - Kmeans Clustering - Jupyter Notebook
7 pages
Aam Unit 4 QB With Answer
No ratings yet
Aam Unit 4 QB With Answer
11 pages
Da Exp 10 66
No ratings yet
Da Exp 10 66
6 pages
K-Means Clustering - MATLAB Kmeans
No ratings yet
K-Means Clustering - MATLAB Kmeans
23 pages
K Means Clustering in R Example - Learn by Marketing
No ratings yet
K Means Clustering in R Example - Learn by Marketing
3 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
Day 3
No ratings yet
Day 3
74 pages
ML Exercises 4 5 6 en
No ratings yet
ML Exercises 4 5 6 en
4 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
K Mean
No ratings yet
K Mean
12 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
77 pages
Overview of Clustering:: UNIT-5
No ratings yet
Overview of Clustering:: UNIT-5
27 pages
CS-3035 (ML) - CS End April 2024
No ratings yet
CS-3035 (ML) - CS End April 2024
21 pages
Unit 5 ML
No ratings yet
Unit 5 ML
38 pages
AST2
No ratings yet
AST2
107 pages
Kmeans&Variants
No ratings yet
Kmeans&Variants
29 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
No ratings yet
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
6 pages
Wk03 Machine Learning
No ratings yet
Wk03 Machine Learning
5 pages
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
No ratings yet
Feature Extraction Techniques Using Support Vector Machines in Disease Prediction
8 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
8 pages
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
No ratings yet
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
6 pages
R Cluster Analysis
No ratings yet
R Cluster Analysis
5 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
K Means
No ratings yet
K Means
25 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
Data Mining - Clustering
No ratings yet
Data Mining - Clustering
90 pages
3ML.03.Feature Reduction
No ratings yet
3ML.03.Feature Reduction
44 pages
ml2 1
No ratings yet
ml2 1
7 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
Kernel Clustering
No ratings yet
Kernel Clustering
57 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
Unit 4
No ratings yet
Unit 4
46 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Anish Goli - AMSCO 7.2 & 7.3 World War I Reading & G.O 20-21
No ratings yet
Anish Goli - AMSCO 7.2 & 7.3 World War I Reading & G.O 20-21
6 pages
Dow Theory: II. The Market Has Three Trends
No ratings yet
Dow Theory: II. The Market Has Three Trends
2 pages
Annexure A - Common Library Specifications Unified Payment Interface
No ratings yet
Annexure A - Common Library Specifications Unified Payment Interface
16 pages
RRB AlpTech CBT 2 Paper With Official Answer Key Trade Electrician
No ratings yet
RRB AlpTech CBT 2 Paper With Official Answer Key Trade Electrician
54 pages
Autoclave - Device
No ratings yet
Autoclave - Device
12 pages
Porter Five Forces Analysis
No ratings yet
Porter Five Forces Analysis
19 pages
Optimum PM and Reliability Centred Spares
100% (2)
Optimum PM and Reliability Centred Spares
79 pages
Ultrasonography In Obstetrics And Gynecology 5th Edition Peter W Callen Md pdf download
No ratings yet
Ultrasonography In Obstetrics And Gynecology 5th Edition Peter W Callen Md pdf download
30 pages
Ge 101
No ratings yet
Ge 101
10 pages
Math1 Functions
No ratings yet
Math1 Functions
3 pages
Glass PDF
No ratings yet
Glass PDF
5 pages
Chrysanthemums Propagation
No ratings yet
Chrysanthemums Propagation
8 pages
Force and Destiny Endless Vigil Sentinel Career PDF
No ratings yet
Force and Destiny Endless Vigil Sentinel Career PDF
99 pages
Uq Graduate School Thesis Preparation
100% (2)
Uq Graduate School Thesis Preparation
8 pages
The Challenges of Evidence-Based Medicine-Kulkarni2005
No ratings yet
The Challenges of Evidence-Based Medicine-Kulkarni2005
6 pages
EasyJet EM Failure Case Study
No ratings yet
EasyJet EM Failure Case Study
4 pages
Chapter-3: Coordinate Geometry
No ratings yet
Chapter-3: Coordinate Geometry
11 pages
Arima Garch 11 Modelling and Forecasting For A Ge Stock Price Using R
No ratings yet
Arima Garch 11 Modelling and Forecasting For A Ge Stock Price Using R
20 pages
4 Hofstede Summary
100% (2)
4 Hofstede Summary
6 pages
Chemistry 6th Edition Gilbert Solution Manual Unlocked Test Bank
No ratings yet
Chemistry 6th Edition Gilbert Solution Manual Unlocked Test Bank
332 pages
Project Proposal CG Final
100% (2)
Project Proposal CG Final
3 pages
The Internal Anatomy of A Frog
No ratings yet
The Internal Anatomy of A Frog
7 pages
Seminar Report On I-Vtec
60% (5)
Seminar Report On I-Vtec
31 pages
One - Sample & Independent Sample T-Tests
No ratings yet
One - Sample & Independent Sample T-Tests
4 pages
Inspired To Be GREEN - Vol 11
No ratings yet
Inspired To Be GREEN - Vol 11
48 pages
2023 Scenario 1 Worksheet - Org Goal & HR Planning
No ratings yet
2023 Scenario 1 Worksheet - Org Goal & HR Planning
4 pages
Fce Listening Practice Test 6
No ratings yet
Fce Listening Practice Test 6
4 pages
The Girl Who Can
100% (1)
The Girl Who Can
2 pages

Proiect ME

Uploaded by

Proiect ME

Uploaded by

Million Song Dataset

Feature extraction with Spectral Analysis

MillionSongSubset from https://labrosa.ee.columbia.edu. 10000 songs (1%

1000 arrays like in picture with ascii code of songs

• Features extraction means to create a projection form a M dimensional

• we observe that only the first element has a

• Classification using K-means algorithm means to group the input features in K

You might also like