0% found this document useful (0 votes)

99 views7 pages

Spectral Clustering: X Through The Parameter W 0. The Resulting

Spectral clustering is an algorithm that can handle non-convex cluster shapes. It projects the dataset onto a space where clusters can be represented by hyperspheres. It first constructs a graph from the dataset where vertices are samples and edges represent proximity. The graph is then represented by an affinity matrix. Eigendecomposition of the normalized graph Laplacian yields indicator vectors that indicate connected components, which often align with optimal clusters. Spectral clustering was able to correctly separate two overlapping sinusoidal curves, unlike K-means which failed due to the non-convex shapes.

Uploaded by

Merve Aydın Chester

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

99 views7 pages

Spectral Clustering: X Through The Parameter W 0. The Resulting

Uploaded by

Merve Aydın Chester

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Spectral clustering

One of the most common algorithm families that can manage

non-convex clusters is spectral clustering. The main idea is
to project the dataset X on a space where the clusters can be
captured by hyperspheres (for example, using K-means). This
result can be achieved in different ways, but, as the goal of the
algorithm is to remove the concavities of generic shaped
regions, the first step is always the representation of X as a
graph G={V, E}, where the vertices V ≡ X and the weighted
edges represent the proximity of every couple of
samples xi, xj ∈ X through the parameter wij ≥ 0. The resulting
graph can be either complete (fully connected) or it can have
edges only between some sample couples (that is, the weight
of non-existing weights is set equal to zero). In the following
diagram, there's an example of a partial graph:

Example of a graph: Point x0 is the only one that is connected to

There are two main strategies that can be employed to

determine the weights wij: KNN and Radial Basis
Function (RBF). The first one is based on the same algorithm
discussed in the previous chapter. Considering a number k of
neighbors, the dataset is represented as ball-tree or kd-tree
and, for each sample xi, the set kNN(xi) is computed. At this
point, given another sample xj, the weight is computed as
follows:

In this case, the graph doesn't contain any information about

the actual distances, and hence, considering the same distance
function d(•) employed in KNN, it is preferable to
represent wij as:

This method is simple and rather reliable, but the resulting

graph is not fully connected. Such a condition can be easily
achieved by employing a RBF, defined as follows:

In this way, all couples are automatically weighted according to

their distance. As the RBF is a Gaussian curve, it is equal
to 1 when xi = xj and decreases proportionally to the square
distance d(xi, xj) (represented as the norm of the difference).
The parameter γ determines the amplitude of the half-bell
curve (in general, the default value is γ=1). When γ < 1, the
amplitude increases and the other way around. Therefore, γ <
1 implies a lower sensitivity to the distance, while with γ > 1,
the RBF drops quicker, as shown in the following screenshot:

Bidimensional RBFs as functions of the distance between x and

0 computed for γ = 0.1, 1.0, and 5.0

With γ = 0.1, x = 1 (with respect to 0.0) is weighted about 0.9.

This value becomes about 0.5 for γ = 1.0and almost zero for γ
= 5.0. Hence, when tuning a spectral clustering model, it's
extremely important to consider different values for γ and
select the one that yields the best performances (for example,
evaluated using the criteria discussed in Chapter 2, Clustering
Fundamentals). Once the graph has been created, it can be
represented using a symmetric affinity matrix W = {wij}. For
KNN W is generally sparse and can be efficiently stored and
manipulated with specialized libraries. Instead, with RBF, it is
always dense and, if X ∈ ℜN × M, it needs to store N2 values.

It's not difficult to prove that the procedure we have analyzed

so far is equivalent to a segmentation of X into a number of
cohesive regions. In fact, let's consider, for example, a
graph G with an affinity matrix obtained with KNN. A connected
component Ci is a subgraph where every couple of
vertices xa, xb ∈ Ci are connected through a path of vertices
belonging to Ci and there are no edges connecting any vertex
of Ci with a vertex not belonging to Ci. In other words, a

connected component is a cohesive subset Ci G that

represents an optimal candidate for a cluster selection. In the
following diagram, there's an example of a connected
component extracted from a graph:
Example of a connected component extracted from a graph

In the original space, the points x0, x2, and x3 are connected

to xn, xm, and xq through x1. This can represent a very simple
non-convex geometry such as a half-moon. In fact, in this case,
the convexity assumption is no more necessary for an optimal
separation because, as we are going to see, these components
are extracted and projected onto subspaces with flat
geometries (easily manageable by algorithms such as K-
means).

This process is more evident when KNN is employed, but, in

general, we can say that two regions can be merged when the
inter-region distance (for example, the distance between the
two closest points) is comparable to the average intra-region
distance. One of the most common methods to solve this
problem has been proposed by Shi and Malik (in Normalized
Cuts and Image Segmentation, J. Shi and J. Malik, IEEE
Transactions on Pattern Analysis and Machine Intelligence, Vol.
22, 08/2000) and it's called normalized cuts. The whole proof is
beyond the scope of this book, but we can discuss the main
concepts. Given a graph, it's possible to build the normalized
graph Laplacian, defined as:

The diagonal matrix D is called degree matrix and each

element dii is the sum of the weights of the corresponding row.
It's possible to prove the following statements:
 After eigendecomposing L (it's easy to compute both eigenvalues and
eigenvectors considering the unnormalized graph Laplacian Lu = D -
W and solving the equation Luv = λDv), the null eigenvalue is always
present with multiplicity p.
 If G is an undirected graph (so wij ≥ 0 ∀ i, j), the number of connected
components is equal to p(the multiplicity of the null eigenvalue).

 If A ℜN and Θ is a countable subset of A (that is, X is a countable

subset because the number of samples is always finite), a vector v ∈ ℜN is
called the indicator vector for Θ if, given θi ∈ Θ, v(i) = 1 if θi ∈ A and
v(i) = 0 otherwise. For example, if we have two vectors a = (1, 0) and b =
(0, 0) (so, Θ={a, b}) and we consider A = {(1, n) where n ∈ [1, 10]}, the
vector v = (1, 0) is an indicator vector, because a ∈ A and b ∉ A.
 The first p eigenvectors of L (corresponding to the null eigenvalue) are
indicator vectors for the eigenspaces spanned by each connected
component C1, C2, ..., Cp.

Hence, if the dataset is made up of M samples xi ∈ ℜN, and the

graph G is associated with an affinity matrix WM × M, Shi and Malik
proposed to build a matrix B ∈ ℜM × p containing the
first p eigenvectors as columns and to cluster the rows using a
simpler method such as K-means. In fact, each row represents
the projection of a sample onto a p-dimensional subspace
where the non-convexities are represented by subregions that
can be enclosed into regular balls.

Let's now apply spectral clustering in order to separate a

bidimensional sinusoidal dataset generated with the following
snippet:
import numpy as np

nb_samples = 2000

X0 = np.expand_dims(np.linspace(-2 * np.pi, 2 * np.pi, nb_samples), axis=1)

Y0 = -2.0 - np.cos(2.0 * X0) + np.random.uniform(0.0, 2.0, size=(nb_samples,
1))

X1 = np.expand_dims(np.linspace(-2 * np.pi, 2 * np.pi, nb_samples), axis=1)

Y1 = 2.0 - np.cos(2.0 * X0) + np.random.uniform(0.0, 2.0, size=(nb_samples,
1))

data_0 = np.concatenate([X0, Y0], axis=1)

data_1 = np.concatenate([X1, Y1], axis=1)
data = np.concatenate([data_0, data_1], axis=0)

The dataset is shown in the following screenshot:

A sinusoidal dataset for the spectral clustering example

We haven't specified any ground truth; however, the goal is to

separate the two sinusoids, (which are non-convex). It's easy to
check that a ball capturing a sinusoid will also include many
samples belonging to the other sinusoidal subset. In order to
show the difference between a pure K-means and spectral
clustering (scikit-learn implements the Shi-Malik algorithm
followed by K-means clustering), we are going to train both
models, using for the latter an RBF (affinity parameter) with γ
= 2.0 (gamma parameter). Of course, I invite the reader to also
test other values and the KNN affinity. The RBF-based solutions
is shown in the following snippet:
from sklearn.cluster import SpectralClustering, KMeans

km = KMeans(n_clusters=2, random_state=1000)
sc = SpectralClustering(n_clusters=2, affinity='rbf', gamma=2.0,
random_state=1000)

Y_pred_km = km.fit_predict(data)
Y_pred_sc = sc.fit_predict(data)

The results are shown in the following screenshot:

Original dataset (left). Spectral clustering result (center). K-
means result (right)

As you can see, K-means partitions the dataset with two balls
along the x-axis, while spectral clustering succeeds in
separating the two sinusoids correctly. This algorithm is very
powerful whenever both the number of clusters and the
dimensionality of X are not too large (in this case the
eigendecomposition of the Laplacian can become very
computationally expensive). Moreover, as the algorithm is
based on a graph cutting procedure, it's perfectly suited when
the number of clusters is even.

An Introduction To Optimization-4th Ed-Solution Manual
100% (1)
An Introduction To Optimization-4th Ed-Solution Manual
221 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
Graph-Based Clustering and Data Visualization Algorithms (PDFDrive)
No ratings yet
Graph-Based Clustering and Data Visualization Algorithms (PDFDrive)
120 pages
ML Unit - IV
No ratings yet
ML Unit - IV
56 pages
Spec Clus Mod
No ratings yet
Spec Clus Mod
29 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
66 pages
Spectral Clustering 2
No ratings yet
Spectral Clustering 2
39 pages
Seminar 3
No ratings yet
Seminar 3
43 pages
19 Ejs1533
No ratings yet
19 Ejs1533
32 pages
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
No ratings yet
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
28 pages
Dsbdunitiii T1729232981820-1
No ratings yet
Dsbdunitiii T1729232981820-1
26 pages
03 23MAT214 MIS4 KMeans Spectral Clustering
No ratings yet
03 23MAT214 MIS4 KMeans Spectral Clustering
52 pages
3.1 Graph Clustering Using Normalized Cuts
No ratings yet
3.1 Graph Clustering Using Normalized Cuts
24 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
Path Based Dissimilarity Measured For Thesis Book Preparation
No ratings yet
Path Based Dissimilarity Measured For Thesis Book Preparation
11 pages
Spectral Clustering Survey
No ratings yet
Spectral Clustering Survey
12 pages
Giu 2719 65 22376 2025-02-17T23 42 29
No ratings yet
Giu 2719 65 22376 2025-02-17T23 42 29
37 pages
UNIT2SVMKNN
No ratings yet
UNIT2SVMKNN
31 pages
Clustering
No ratings yet
Clustering
82 pages
The Latest Research Progress On Spectral Clustering
No ratings yet
The Latest Research Progress On Spectral Clustering
10 pages
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
No ratings yet
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
11 pages
PR Module 4 QB
No ratings yet
PR Module 4 QB
37 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
Chap 8
No ratings yet
Chap 8
9 pages
Expert Systems With Applications: Tülin Inkaya
No ratings yet
Expert Systems With Applications: Tülin Inkaya
10 pages
Spectral Clustering
No ratings yet
Spectral Clustering
7 pages
LecN10 R
No ratings yet
LecN10 R
9 pages
Graph Based Clustering
No ratings yet
Graph Based Clustering
78 pages
Cluster Crit
No ratings yet
Cluster Crit
34 pages
Semi-Supervised Spectral Clustering Using Shared Nearest Neighbor For Data With Different Shape and Density
No ratings yet
Semi-Supervised Spectral Clustering Using Shared Nearest Neighbor For Data With Different Shape and Density
8 pages
Tutorial On Spectral Clustering
No ratings yet
Tutorial On Spectral Clustering
26 pages
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
No ratings yet
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
6 pages
DS303 Clustering
No ratings yet
DS303 Clustering
20 pages
Lecture 13 - Unsupervised Learning, PCA ICA
No ratings yet
Lecture 13 - Unsupervised Learning, PCA ICA
50 pages
Luxburg07 Tutorial 4488
No ratings yet
Luxburg07 Tutorial 4488
32 pages
Lecture 12 - Unsupervised Learning - Shoould Be Marged
No ratings yet
Lecture 12 - Unsupervised Learning - Shoould Be Marged
31 pages
Lecture Notes On Clustering
No ratings yet
Lecture Notes On Clustering
10 pages
MLT Notes
No ratings yet
MLT Notes
17 pages
Community Detection
No ratings yet
Community Detection
9 pages
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
No ratings yet
Support Vector Clustering: Journal of Machine Learning Research 2 (2001) 125-137 Submitted 3/04 Published 12/01
13 pages
2092 On Spectral Clustering Analysis and An Algorithm
No ratings yet
2092 On Spectral Clustering Analysis and An Algorithm
8 pages
Unsuper
No ratings yet
Unsuper
15 pages
Basic of SVM Algorithm
No ratings yet
Basic of SVM Algorithm
10 pages
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
No ratings yet
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
13 pages
Community Detection With Graph Neural Networks
No ratings yet
Community Detection With Graph Neural Networks
16 pages
5 - Clustering
No ratings yet
5 - Clustering
13 pages
LAB6
No ratings yet
LAB6
4 pages
n25 PDF
No ratings yet
n25 PDF
8 pages
Clustering
No ratings yet
Clustering
55 pages
I Jcs It 2015060141
No ratings yet
I Jcs It 2015060141
5 pages
Lecture 6
No ratings yet
Lecture 6
55 pages
Spectral Approach (BU)
No ratings yet
Spectral Approach (BU)
2 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Spectral Analysis of Signed Graphs For Clustering, Prediction and Visualization
No ratings yet
Spectral Analysis of Signed Graphs For Clustering, Prediction and Visualization
12 pages
Mixture Models and Clustering
No ratings yet
Mixture Models and Clustering
8 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
DBSCAN - Introduction in Machine Learning.
No ratings yet
DBSCAN - Introduction in Machine Learning.
3 pages
Learning Spectral Clustering
No ratings yet
Learning Spectral Clustering
8 pages
Python Course Book
No ratings yet
Python Course Book
219 pages
Python Course Book
No ratings yet
Python Course Book
219 pages
4 Introduction To The Divergence Theorem and Stoke's Theorem (Short)
No ratings yet
4 Introduction To The Divergence Theorem and Stoke's Theorem (Short)
116 pages
Linear Algebra
No ratings yet
Linear Algebra
12 pages
Vectors Olympiad Questions
No ratings yet
Vectors Olympiad Questions
12 pages
Mathematics (Hons) Question (WBSU) - SEM6 - MTMA - 2020-22 - Library
No ratings yet
Mathematics (Hons) Question (WBSU) - SEM6 - MTMA - 2020-22 - Library
25 pages
Worksheet - 4 Scalar & Vector Triple Product
No ratings yet
Worksheet - 4 Scalar & Vector Triple Product
12 pages
L04 - 2D Finite Difference Method
No ratings yet
L04 - 2D Finite Difference Method
73 pages
Sureshot 20 (Maths)
No ratings yet
Sureshot 20 (Maths)
66 pages
Chapter 2 Slides - 2.7-2.8
No ratings yet
Chapter 2 Slides - 2.7-2.8
8 pages
Line Integral
No ratings yet
Line Integral
8 pages
MSC Maths
No ratings yet
MSC Maths
71 pages
Course Title Edition New? Author Publisher
No ratings yet
Course Title Edition New? Author Publisher
3 pages
Linear Algebra Lecture1
No ratings yet
Linear Algebra Lecture1
20 pages
Class 12 Maths Half Yearly Exam (20220-2023)
No ratings yet
Class 12 Maths Half Yearly Exam (20220-2023)
4 pages
Trusses: A Typical Plane Truss Is Shown in Figure 4.1 Below
No ratings yet
Trusses: A Typical Plane Truss Is Shown in Figure 4.1 Below
17 pages
Vectors Upto Addition
No ratings yet
Vectors Upto Addition
32 pages
Convex-Optimization Github Io
No ratings yet
Convex-Optimization Github Io
328 pages
Lac
No ratings yet
Lac
30 pages
ARMENIANGENOCIDE
No ratings yet
ARMENIANGENOCIDE
110 pages
MAE501Syllabus PDF
No ratings yet
MAE501Syllabus PDF
5 pages
Maths Project 2
No ratings yet
Maths Project 2
8 pages
Post-Graduate Course
No ratings yet
Post-Graduate Course
82 pages
Linear Algebra Assignment 2 Vector Space, Subspace, Basis & Dimension
No ratings yet
Linear Algebra Assignment 2 Vector Space, Subspace, Basis & Dimension
6 pages
The Shape of Higher-Dimensional State Space - Bloch-Ball
No ratings yet
The Shape of Higher-Dimensional State Space - Bloch-Ball
10 pages
Combinatorics Winter Camp 1
No ratings yet
Combinatorics Winter Camp 1
2 pages
HKU MATH1853 - Brief Linear Algebra Notes: 1 Eigenvalues and Eigenvectors
No ratings yet
HKU MATH1853 - Brief Linear Algebra Notes: 1 Eigenvalues and Eigenvectors
6 pages
Robotics: Dr. Omar Salah Eldin Mahmoud
No ratings yet
Robotics: Dr. Omar Salah Eldin Mahmoud
13 pages
123213
No ratings yet
123213
4 pages
LET Us A An, A M.E.A,, N 2, - . - . Re: T. P. Krasulina
No ratings yet
LET Us A An, A M.E.A,, N 2, - . - . Re: T. P. Krasulina
7 pages
Static Monthly Version - This Schedule Would Apply Each Month. For 30 Day Month Just Drop Day 31 and For Feb Drop Days 29 To 31
No ratings yet
Static Monthly Version - This Schedule Would Apply Each Month. For 30 Day Month Just Drop Day 31 and For Feb Drop Days 29 To 31
2 pages
Matrix Review
No ratings yet
Matrix Review
7 pages
Exercise 3. Markov Chains (Initial State Multiplication)
No ratings yet
Exercise 3. Markov Chains (Initial State Multiplication)
4 pages
Pre-Processing and Segmentation of Skin Lesion Images Using Watershed and K-Means Algorithms
No ratings yet
Pre-Processing and Segmentation of Skin Lesion Images Using Watershed and K-Means Algorithms
3 pages
Private Acarkent Doğa Anatolian High School 2015-2016 EDUCATION YEAR First Term 9 Grade 3 Mathematics Exam Make Up
No ratings yet
Private Acarkent Doğa Anatolian High School 2015-2016 EDUCATION YEAR First Term 9 Grade 3 Mathematics Exam Make Up
4 pages

Spectral Clustering: X Through The Parameter W 0. The Resulting

Uploaded by

Spectral Clustering: X Through The Parameter W 0. The Resulting

Uploaded by

Spectral clustering

One of the most common algorithm families that can manage

Example of a graph: Point x0 is the only one that is connected to

There are two main strategies that can be employed to

In this case, the graph doesn't contain any information about

This method is simple and rather reliable, but the resulting

In this way, all couples are automatically weighted according to

Bidimensional RBFs as functions of the distance between x and

With γ = 0.1, x = 1 (with respect to 0.0) is weighted about 0.9.

It's not difficult to prove that the procedure we have analyzed

connected component is a cohesive subset Ci G that

In the original space, the points x0, x2, and x3 are connected

This process is more evident when KNN is employed, but, in

The diagonal matrix D is called degree matrix and each

 If A ℜN and Θ is a countable subset of A (that is, X is a countable

Hence, if the dataset is made up of M samples xi ∈ ℜN, and the

Let's now apply spectral clustering in order to separate a

X0 = np.expand_dims(np.linspace(-2 * np.pi, 2 * np.pi, nb_samples), axis=1)

X1 = np.expand_dims(np.linspace(-2 * np.pi, 2 * np.pi, nb_samples), axis=1)

data_0 = np.concatenate([X0, Y0], axis=1)

The dataset is shown in the following screenshot:

We haven't specified any ground truth; however, the goal is to

The results are shown in the following screenshot:

You might also like