0% found this document useful (0 votes)

31 views6 pages

ML DSBA Lab7

This document describes k-means and spectral clustering algorithms for unsupervised machine learning. It discusses how k-means aims to partition data into k clusters by finding cluster centroids that minimize distances between data points and their closest centroid. The document also describes how spectral clustering uses the eigenvalues of a similarity matrix to perform dimensionality reduction before clustering in fewer dimensions.

Uploaded by

Houssam Fouki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views6 pages

ML DSBA Lab7

Uploaded by

Houssam Fouki

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

F OUNDATIONS OF M ACHINE L EARNING

M.S C . IN D ATA S CIENCES AND B USINESS A NALYTICS

C ENTRALE S UP ÉLEC
Lab 7: k-Means and Spectral Clustering Algorithms

Instructor: Fragkiskos Malliaros

TA: Benjamin Maheu
December 2, 2021

1 Description
In this lab, we will study unsupervised learning techniques, focusing on two well-known clustering
algorithms: (i) k-Means and (ii) Spectral Clustering. Initially, we discuss the basic characteristics of each
algorithm and then we examine how they can be applied to identify the underlying clustering structure
of a dataset.

2 k-Means Algorithm
k-means is one of the simplest unsupervised learning algorithms that solves the well known clustering
problem. The goal of a clustering algorithm is to split the instances of the dataset into k clusters, where
each instance is assigned to the closest cluster as defined by a distance function. The main idea is to
define k centers (centroids), one for each cluster. The algorithm defines an iterative process, where the
following two steps take part at each iteration: (i) take each instance belonging to the dataset and assign
it to the nearest centroid, and (ii) re-calculate the centroids of each of the k clusters. Thus, the k centroids
change their location step by step until no more changes are done.
More formally, suppose that we are given a dataset X = {x1 , x2 , . . . , xm }, where each xi ∈ Rn .
The goal of the k-means algorithm is to group the data into k cohesive clusters, where k in as input
parameter of the algorithm. Algorithm 1 gives the pseudocode of k-means.
In the algorithm above, k is a parameter of the algorithm and corresponds to the number of clusters
we want to find; the cluster centroids µj represent our current guesses for the positions of the centers of
the clusters. To initialize the cluster centroids (in step 1 of the algorithm), we could choose k training ex-
amples randomly, and set the cluster centroids to be equal to the values of these k examples. Of course,
other initialization methods are also possible, such as the kmeans++ technique1 . To find the closest
centroid, a distance (or similarity) function should be defined, and typically the Euclidean distance is
used.
Based on this notion of similarity, the problem of clustering can be reduced to the problem of finding
appropriate centroids. This, in turn, can be expressed as the task of minimizing the following objective
1 Wikipedias lemma for k-means++: http://en.wikipedia.org/wiki/K-means++.

1
Algorithm 1 k-Means Clustering Algorithm
Input: Dataset X = {x1 , x2 , . . . , xm }, where each xi ∈ Rn and parameter k
Output: Clusters C1 , C2 , . . . , Ck (i.e., cluster assignments of each instance C = {c1 , c2 , . . . , cm })

1: Initialize cluster centroids µ1 , µ2 , . . . , µk by choosing k instances of X randomly

2: repeat
3: Assign each instance xi ∈ X to the closest centroid, i.e., ci = arg minj kxi − µj k
1 P
4: Re-compute the centroids µ1 , µ2 , . . . , µk of each cluster based on µj = x, where Cj , j =
nj x∈Cj
1, . . . , k the j-th cluster and nj the size of the j-th cluster
5: until Centroids do not change (convergence)

function:

k
X X
E(k) = kxi − µj k. (1)
j=1 xi ∈Cj

Thus, minimizing Eq. (1) is to determine suitable centroids µj such that, if the data is partitioned into
corresponding clusters Cj , distances between data points and their closest cluster centroid become as
small as possible.
The convergence of k-means algorithm is highly dependent on the initialization of the centroids.
Although the algorithm can converge, this may be to a local minimum of the objective function of Eq.
(1). One way to overcome this problem is by executing the algorithm several times, with different
initializations of the centroids.
Another issue is how to set parameter k, i.e., how to determine the number of clusters of the dataset.
Intuitively, increasing k without penalty, will always reduce the amount of error in the resulting cluster-
ing, to the extreme case of zero error if each data point is considered its own cluster (i.e., when k equals
the number of data points, m). One such method is known as the elbow rule2 . The idea is to examine and
compare the sum of squared error (SSE) given in Eq. (1) for a number of cluster solutions. In general,
as the number of clusters increases, the SSE should decrease because clusters are, by definition, smaller.
A plot of the SSE against a series of sequential cluster levels (i.e., different values) can be helpful here.
That is, an appropriate cluster solution could be defined as the one where the reduction in SSE slows
dramatically. This produces an ”elbow” in the plot of SSE against the different values of k.

2.1 Pipeline of the task

Vizualization of the dataset
Next we describe the pipeline of task contained in the kmeans/main.py 2.0
1.5
Python script. The goal here is to apply k-means on two datasets. The 1.0
0.5
first one is an artificial dataset where the data points form four dis-
2nd dimension

0.0

tinct clusters, similar to the one shown in Fig. 1. The second one is 0.5
1.0
the MNIST handwritten digits dataset that has been also used in the 1.5
2.0
supervised learning labs. The basic difference here is that we do not
2.5
3 2 1 0 1 2 3
take into account the class labels. We use a modified version of the 1st dimension

dataset (similar to the one used in Lab 5). We have applied PCA on Figure 1: Example of artificial
the data and we keep the first 8 principal components. We also keep a dataset.
sample of the data consisting of 1000 instances. Thus, the size of dataset X is 1000 × 8. Our goal is to
apply k-means clustering on X.
2 Description of the elbow rule can be found in http://www.mattpeeples.net/kmeans.html.

2
For both datasets, initially we load the data and for illustration purposes, we visualize them. For
example, in the case of the MNIST dataset we perform the steps shown below.
# Number o f i n s t a c e s and number o f p r i n c i p a l c o m p o n e n t s ( f e a t u r e s )
n i n s t a n c e s = 1000
pca features = 8

# Get t h e l a b e l s o f e a c h d i g i t
images , l a b e l s m n i s t = r e a d d a t a s e t ( n i n s t a n c e s , p c a f e a t u r e s ) ;

# Create the d a t a s e t ( data mnist ) t h a t w i l l be used in c l u s t e r i n g

# l o a d t h e PCA f e a t u r e s o f t h e t e s t d a t a s e t
d a t a m n i s t = a r r a y ( l i s t ( csv . r e a d e r ( open ( ” t e s t d a t a . csv ” , ” rb ” ) , d e l i m i t e r = ’ , ’ ) ) ) . a s t y p e ( ’ f l o a t ’ )
data mnist = data mnist [ : n instances , : pca features ] # only 8 f i r s t f e a t u r e s a r e k e p t

Then, we run k-means algorithm for different values of k. We also plot the data (two dimensions) based
on clustering results produced by the algorithm.
# Run k−means a l g o r i t h m f o r d i f f e r e n t v a l u e s o f k

k = 10
l a b e l s p r e d m n i s t = kmeans ( da t a m n is t , k )

2.2 Tasks to be done

• Fill in the code of the kmeans() function in the kmeans.py file, based on Algorithm 1.

• Run the k-means algorithm for different values of k for the two datasets and examine the quality
of the produced clusters.

• Use the elbow rule described above to determine the number of clusters k by examining the sum
of squared error (SSE) for the clusters produced for different values of k.

3 Spectral Clustering
Spectral clustering techniques make use of the spectrum (eigenvalues) of the similarity matrix of the
data to perform dimensionality reduction before clustering in fewer dimensions. The similarity matrix
is provided as an input and consists of a quantitative assessment of the relative similarity of each pair
of points in the dataset.
Given a set of data points x1 , . . . , xm , ∀xi ∈ Rn and some notion of similarity sij between all pairs of
data points xi and xj , the intuitive goal of clustering is to divide the data points into several groups such
that points in the same group are similar and points in different groups are dissimilar to each other. If
we do not have more information than similarities between data points, a nice way of representing the
data is in form of the similarity graph G = (V, E). Each vertex vi in this graph represents a data point
xi . Two vertices are connected if the similarity sij between the corresponding data points xi and xj is
positive or larger than a certain threshold, and the edge is weighted by sij . The problem of clustering
can now be reformulated using the similarity graph: we want to find a partition of the graph such
that the edges between different groups have very low weights (which means that points in different
clusters are dissimilar from each other) and the edges within a group have high weights (which means
that points within the same cluster are similar to each other).

3
How to create a similarity graph

There are several popular constructions to transform a given set x1 , . . . , xm , ∀xi ∈ Rn of data points with
pairwise similarities sij or pairwise distances dij into a graph. When constructing similarity graphs the
goal is to model the local neighborhood relationships between the data points.

• k-Nearest Neighbors graph. Here the goal is to connect vertex vi with vertex vj if vj is among the
k-nearest neighbors of vi . However, this definition leads to a directed graph, as the neighborhood
relationship is not symmetric. The most common way to deal with this, is to simply ignore the
directions of the edges; that is, we connect vi and vj with an undirected edge if vi is among the
k-nearest neighbors of vj or if vj is among the k-nearest neighbors of vi . The resulting graph is
what is usually called the k-nearest neighbors graph.

• The fully connected graph. Here we simply connect all points with positive similarity with
each other, and we weight all edges by sij . As the graph should represent the local neighbor-
hood relationships, this construction is only useful if the similarity function itself models local
neighborhoods. An example for such a similarity function is the Gaussian similarity function
s(xi , xj ) = exp(kxi − xj k2 )/(2σ 2 ), where the parameter σ controls the width of the neighbor-
hoods.

The algorithm

Next we describe the pseudocode of the spectral clustering algorithm.

Algorithm 2 Spectral Clustering

Input: Dataset X = {x1 , x2 , . . . , xm }, where each xi ∈ Rn and parameter k
Output: Clusters C1 , C2 , . . . , Ck (i.e., cluster assignments of each instance C = {c1 , c2 , . . . , cm })

1: Construct the similarity graph G using one of the ways described above. Let W be the adjacency
matrix of this graph.
2: Compute the Laplacian matrix L = D − W. Matrix D corresponds to the diagonal degree matrix of
graph G (i.e., degree of each node vi (= number of neighbors) in the main diagonal).
3: Apply eigenvalue decomposition to the Laplacian matrix L and compute the eigenvectors that cor-
respond to k smallest eigenvalues. Let U = [u1 |u2 | . . . |uk ] ∈ Rm×k be the matrix containing these
eigenvectors as columns.
4: For i = 1, . . . , m, let yi ∈ Rk be the vector corresponding to the i-th row of U. Apply k-means to the
points (yi )i=1,...,m (i.e., the rows of U) and find clusters C1 , C2 , . . . , Ck .

In spectral clustering, the data is projected into a lower-dimensional space (the spectral/eigenvector
domain) where they are easily separable, say using k-means. 6
So, what is the reason to apply spectral clustering (in the
4
similarity data matrix) and not applying directly k-means
to the initial data? Typically, k-means algorithm is inter- 2

esting in finding compact clusters of convex shape, while 0

on the other hand spectral clustering methods are trying

2
to identify connectivity patterns in the similarity graph. In
4
many cases, we are interested in finding clusters that are
non-convex and in this case the k-means algorithm does not 6
6 4 2 0 2 4 6
x1
behave well. Figure 2 shows an example of a dataset where
the ”natural” clusters in R2 do not correspond to convex Figure 2: Example of a dataset.

4
compact regions. Applying k-means to this dataset will extract the clusters shown in Fig. 3 (a). On the
other hand, as shown in Fig. 3 (b), applying spectral clustering, we are able to find non-convex clusters
with good connectivity properties.

6 6

4 4

2 2

0 0
x2

x2
2 2

4 4

6 6
6 4 2 0 2 4 6 6 4 2 0 2 4 6
x1 x1

(a) k-means (b) Spectral clustering

Figure 3: Results using k-means and spectral clustering algorithms.

3.1 Pipeline of the task

Next we describe the pipeline of the task contained in the spectral clustering/main.py file. The
goal is to apply spectral clustering on the artificial data shown in Fig. 2. Initially, we call the function
generateData() contained in the generateData.py file, to create the artificial data. Then, we use
the build-in implementation of Python for the k-means algorithm to cluster this dataset (you can also
use your own implementation) and we plot the results.
# Number o f c l u s t e r s
k = 3
# C l u s t e r u s i n g kmeans
c e n t r o i d s , l a b e l s = kmeans2 ( data , k )

Then, we create the similarity graph, finding the N closest neighbors of each data instance, using the Eu-
clidean distance. Notice that, after finding the closest neighbors using the findClosestNeighbours()
function, we can directly form the adjacency matrix W of the graph. Here we create a binary (0 or 1)
adjacency matrix (if two points are neighbors, we add the corresponding edge with weight 1).
N = 10
c l o s e s t N e i g h b o u r s = f i n d C l o s e s t N e i g h b o u r s ( data , N)

# Create adjacency matrix

W = z e r o s ( ( data . shape [ 0 ] , data . shape [ 0 ] ) )
f o r i in range ( data . shape [ 0 ] ) :
f o r j in range ( n ) :
W[ i , c l o s e s t N e i g h b o u r s [ i , j ] ] = 1
W[ c l o s e s t N e i g h b o u r s [ i , j ] , i ] = 1

Having the similarity graph (described by the adjacency matrix W), we can apply the spectral clustering
algorithm, finding the underlying clustering structure.
# Perform s p e c t r a l c l u s t e r i n g

5
l a b e l s = s p e c t r a l C l u s t e r i n g (W, k )

3.2 Tasks to be done

• Fill in the code of the spectralClustering() function in the spectralClustering.py file
to implement the spectral clustering algorithm as described in Algorithm 2. Note that, the adja-
cency matrix W (step 1 of the algorithm) has been already created.

• Run the algorithm and reproduce the clustering results shown in Fig. 3 (b).

References
[1] Christopher M. Bishop. ”Pattern Recognition and Machine Learning”. Springer-Verlag New York,
Inc., 2006.

[2] Tom M. Mitchell. ”Machine learning”. Burr Ridge, IL: McGraw Hill 45, 1997.

[3] Ulrike Von Luxburg. ”A tutorial on spectral clustering”. Statistics and computing, Springer, 2007.

K Means
No ratings yet
K Means
25 pages
Detecting Patterns With Unsupervised Learning
No ratings yet
Detecting Patterns With Unsupervised Learning
21 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
Unit 4
No ratings yet
Unit 4
46 pages
Unsupervised Learning 1
No ratings yet
Unsupervised Learning 1
40 pages
Kmean
No ratings yet
Kmean
24 pages
Unit 4
No ratings yet
Unit 4
22 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
23 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
19.1. Partitioning-Based Clustering Algorithms
No ratings yet
19.1. Partitioning-Based Clustering Algorithms
27 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
Clustering
No ratings yet
Clustering
55 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
78 pages
Unit 4
No ratings yet
Unit 4
63 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
ML Clustering2
No ratings yet
ML Clustering2
11 pages
Unit 4
No ratings yet
Unit 4
19 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
3.k-Metoids and Hierarchical Updated
No ratings yet
3.k-Metoids and Hierarchical Updated
50 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
ML Unit-4
No ratings yet
ML Unit-4
23 pages
MLT Unit 3 Notes
No ratings yet
MLT Unit 3 Notes
19 pages
Unit IV
No ratings yet
Unit IV
96 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
66 pages
cz4041 10 Clustering
No ratings yet
cz4041 10 Clustering
67 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
Lecture 01 - Unsupervised Learning (Optional)
No ratings yet
Lecture 01 - Unsupervised Learning (Optional)
57 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
5 - Clustering
No ratings yet
5 - Clustering
13 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
09.unsupervised Learning
No ratings yet
09.unsupervised Learning
50 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
Clustering (Class 38-39)
No ratings yet
Clustering (Class 38-39)
45 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Week 11
No ratings yet
Week 11
49 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
Clustering Classification and Intro Neural Network
No ratings yet
Clustering Classification and Intro Neural Network
168 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Unit 7 Clustering (P)
No ratings yet
Unit 7 Clustering (P)
22 pages
Chapter 3 Unsupervised Learning
No ratings yet
Chapter 3 Unsupervised Learning
45 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
K-Means Clustering Algorithm - Javatpoint
No ratings yet
K-Means Clustering Algorithm - Javatpoint
21 pages
K Means
No ratings yet
K Means
33 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Main EL CM2end 2023
No ratings yet
Main EL CM2end 2023
33 pages
3.flajolet Martin Algorithm
No ratings yet
3.flajolet Martin Algorithm
31 pages
1 MinHash-1
No ratings yet
1 MinHash-1
4 pages
5.Topic-Sensitive PageRank S5
No ratings yet
5.Topic-Sensitive PageRank S5
11 pages
Lecturenotes 3
No ratings yet
Lecturenotes 3
11 pages
Congratulations! You Passed!: Shallow Neural Networks
No ratings yet
Congratulations! You Passed!: Shallow Neural Networks
7 pages
Lab3 Report-7 PDF
No ratings yet
Lab3 Report-7 PDF
29 pages
TE - 2019 - (AIML) Artificial Intelligence and Machine Learning
No ratings yet
TE - 2019 - (AIML) Artificial Intelligence and Machine Learning
4 pages
Zulfa Putri Asmawi - TUGAS 10
No ratings yet
Zulfa Putri Asmawi - TUGAS 10
8 pages
Data Structures - Cheat Sheet
100% (1)
Data Structures - Cheat Sheet
2 pages
Publication
No ratings yet
Publication
8 pages
Autoencoder: Tuan Nguyen - AI4E
No ratings yet
Autoencoder: Tuan Nguyen - AI4E
35 pages
Filtering Images in The Frequency Domain
No ratings yet
Filtering Images in The Frequency Domain
25 pages
Image Processing Enhancement, Filtering and Edge Detection Using The Fuzzy Logic Approach
No ratings yet
Image Processing Enhancement, Filtering and Edge Detection Using The Fuzzy Logic Approach
6 pages
Problem Set 4
No ratings yet
Problem Set 4
1 page
Lecture 5
No ratings yet
Lecture 5
36 pages
Laboratory Exercise #4 Newton-Raphson Method
No ratings yet
Laboratory Exercise #4 Newton-Raphson Method
10 pages
Lecture 33 35 Fuzzy C Means Clustering
No ratings yet
Lecture 33 35 Fuzzy C Means Clustering
22 pages
Lakkireddybalreddi-M.tech - Ece - Systems and Signal Processing - Syllabus
No ratings yet
Lakkireddybalreddi-M.tech - Ece - Systems and Signal Processing - Syllabus
43 pages
Paper 01
No ratings yet
Paper 01
17 pages
Juntilla W. Problem Set
No ratings yet
Juntilla W. Problem Set
8 pages
Post Midsem Prob
No ratings yet
Post Midsem Prob
5 pages
Divide and Conquer 06 Class Notes PDF
No ratings yet
Divide and Conquer 06 Class Notes PDF
36 pages
The Costas Loop - Wrapping It Up (Eric Hagemann)
No ratings yet
The Costas Loop - Wrapping It Up (Eric Hagemann)
6 pages
MTH 501 Assigment 2 (2022) - Vuanswer
No ratings yet
MTH 501 Assigment 2 (2022) - Vuanswer
8 pages
TUTORIAL 2-WEEK 2-Series Solution-Ordinary
No ratings yet
TUTORIAL 2-WEEK 2-Series Solution-Ordinary
2 pages
Merge Sort, Radix Sort, Shell Sort
100% (1)
Merge Sort, Radix Sort, Shell Sort
21 pages
Notes Polynomials
No ratings yet
Notes Polynomials
3 pages
Evaluation of Cordic Algorithms For Fpga Design PDF
No ratings yet
Evaluation of Cordic Algorithms For Fpga Design PDF
2 pages
Homework Set No. 4: 1. Trapezoid and Simpson's Methods
No ratings yet
Homework Set No. 4: 1. Trapezoid and Simpson's Methods
3 pages
Excel Equation Solver
No ratings yet
Excel Equation Solver
3 pages
r05320201 Digital Signal Processing
No ratings yet
r05320201 Digital Signal Processing
8 pages
D1 Exercise 1A
No ratings yet
D1 Exercise 1A
3 pages
cs3401 Algorithms Lab Manual Final
No ratings yet
cs3401 Algorithms Lab Manual Final
43 pages
Newton Raphson Main Note
No ratings yet
Newton Raphson Main Note
10 pages

ML DSBA Lab7

Uploaded by

ML DSBA Lab7

Uploaded by

F OUNDATIONS OF M ACHINE L EARNING

M.S C . IN D ATA S CIENCES AND B USINESS A NALYTICS

Instructor: Fragkiskos Malliaros

1: Initialize cluster centroids µ1 , µ2 , . . . , µk by choosing k instances of X randomly

2.1 Pipeline of the task

# Create the d a t a s e t ( data mnist ) t h a t w i l l be used in c l u s t e r i n g

2.2 Tasks to be done

Next we describe the pseudocode of the spectral clustering algorithm.

Algorithm 2 Spectral Clustering

esting in finding compact clusters of convex shape, while 0

on the other hand spectral clustering methods are trying

(a) k-means (b) Spectral clustering

Figure 3: Results using k-means and spectral clustering algorithms.

3.1 Pipeline of the task

# Create adjacency matrix

3.2 Tasks to be done

You might also like