0% found this document useful (0 votes)

11 views39 pages

Spectral Clustering 2

Uploaded by

nguyenxuanan15032005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views39 pages

Spectral Clustering 2

Uploaded by

nguyenxuanan15032005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 39

Spectral Clustering

Course: Cluster Analysis and Other

Unsupervised Learning Methods (Stat 593 E)

Speakers: Rebecca Nugent1, Larissa

Stanberry2

Department of 1 Statistics, 2 Radiology,

University of Washington
Outline
 What is spectral clustering?
 Clustering problem in graph theory
 On the nature of the affinity matrix
 Overview of the available spectral
clustering algorithm
 Iterative Algorithm: A Possible
Alternative
Spectral Clustering
 Algorithms that cluster points
using eigenvectors of matrices
derived from the data
 Obtain data representation in the
low-dimensional space that can be
easily clustered
 Variety of methods that use the
eigenvectors differently
Data-driven Method 1 Method 2

matrix

Data-driven Method 1 Method 2

matrix

Data-driven Method 1 Method 2

matrix
Spectral Clustering
 Empirically very successful
 Authors disagree:
 Which eigenvectors to use
 How to derive clusters from these
eigenvectors

 Two general methods

Method #1
 Partition using only one
eigenvector at a time
 Use procedure recursively
 Example: Image Segmentation
 Uses 2nd (smallest) eigenvector to
define optimal cut
 Recursively generates two clusters
with each cut
Method #2
 Use k eigenvectors (k chosen by
user)

 Directly compute k-way partitioning

 Experimentally has been seen to be

“better”
Spectral Clustering
Algorithm Ng, Jordan, and
Weiss

Given a set of points S={s1,…sn}
 Form the affinity matrix
2 2
 || si  s j || / 2
Aij e i j Aii 0

Define diagonal matrix Dii=  aik 
 Form the matrix  1/ 2  1/ 2
 L D ADof L to form
Stack the k largest eigenvectors
the columns of the new matrix X:
 Renormalize each of X’s rowsx1to , x2have
,..., xkunit
length. Cluster rows of Y as points in R k
Cluster analysis & graph
theory
 Good old example : MST  SLD

Minimal spanning tree is the graph of minimum length connecting

all data points. All the single-linkage clusters could be obtained by
deleting the edges of the MST, starting from the largest one.
Cluster analysis & graph
theory II
 Graph Formulation
 View data set as a set of vertices V={1,2,…,n}
 The similarity between objects i and j is viewed as
the weight of the edge connecting these vertices
Aij. A is called the affinity matrix
 We get a weighted undirected graph G=(V,A).
 Clustering (Segmentation) is equivalent to partition
of G into disjoint subsets. The latter could be
achieved by simply removing connecting edges.
Nature of the Affinity
Matrix
i j
2 2
 ( si  s j ) / 2
Aij e Aii 0

“closer” vertices Weight as a function of

will get larger
weight

Simple Example

 Consider two 2-dimensional slightly

overlapping Gaussian clouds each containing
100 points.
Simple Example cont-d I
Simple Example cont-d II
Magic
2 2
 || si  s j || / 2
Aij e
 Affinities grow as grows 


How the choice of  value affects the
results?

What would be the optimal choice for ?
Example 2 (not so simple)
Example 2 cont-d I
Example 2 cont-d II
Example 2 cont-d III
Example 2 cont-d IV
Spectral Clustering
Algorithm Ng, Jordan, and
Weiss
 Motivation
 Given a set of points

S  s1 ,..., sn   R l

 We would like to cluster them into k

subsets
Algorithm
nxn
 Form the affinity matrix A R
2 2
 || si  s j || / 2
 DefineAij e ifi  j
Aii 0
 Scaling parameter chosen by user
 Define D a diagonal matrix whose
(i,i) element is the sum of A’s row i
Algorithm
 1/ 2  1/ 2
 Form the matrixL D AD

 Find x1 , x2 ,..., xk , the k largest

eigenvectors of L
 These form the the columns of the
new matrix X

Note: have reduced dimension from nxn to nxk
Algorithm
 Form the matrix Y
 Renormalize each of X’s rows to have
unit length
 Yij  X ij /( X ij 2 ) 2
nxk j
 YR
k
 R
Treat each row of Y as a point in
 Cluster into k clusters via K-means
Algorithm
 Final Cluster Assignment
 Assign pointsi to cluster j iff row i of
Y was assigned to cluster j
Why?
 If we eventually use K-means, why
not just apply K-means to the
original data?

 This method allows us to cluster

non-convex regions
User’s Prerogative
 Choice of k, the number of clusters

 Choice of scaling factor


 Realistically, search over2
and
pick value that gives the tightest
clusters

 Choice of clustering method

Comparison of Methods
Authors Matrix used Procedure/Eigenvectors
used
Ax  x
Perona/ Affinity A 1 x:
st

Freeman Recursive procedure

Shi/Malik D-A with D a 2nd smallest generalized
( D  A) x  Dx
degree eigenvector
D(i, i )  A(i, j )
matrix j Also recursive

Scott/ Affinity A, Finds k eigenvectors of A,

Longuet- User inputs forms V. Normalizes rows
Higgins k of V. Forms Q = VV’.
Segments by Q. Q(i,j)=1 -
> same cluster
Ng, Jordan, Affinity A, Normalizes A. Finds k
Advantages/
Disadvantages
 Perona/Freeman
 For block diagonal affinity matrices,
the first eigenvector finds points in the
“dominant”cluster; not very consistent
 Shi/Malik
 2nd generalized eigenvector minimizes
affinity between groups by affinity
within each group; no guarantee,
constraints
Advantages/
Disadvantages
 Scott/Longuet-Higgins
 Depends largely on choice of k
 Good results
 Ng, Jordan, Weiss
 Again depends on choice of k
 Claim: effectively handles clusters
whose overlap or connectedness
varies across clusters
Affinity Matrix Perona/Freeman Shi/Malik
Scott/Lon.Higg
1st eigenv. 2nd gen. eigenv. Q matrix

Affinity Matrix Perona/Freeman Shi/Malik

Scott/Lon.Higg
1st eigenv. 2nd gen. eigenv. Q matrix

Affinity Matrix Perona/Freeman Shi/Malik

Scott/Lon.Higg
Inherent Weakness
 At some point, a clustering method
is chosen.
 Each clustering method has its
strengths and weaknesses
 Some methods also require a priori
knowledge of k.
One tempting alternative
The Polarization Theorem (Brand&Huang)
 Consider eigenvalue decomposition of the

affinity matrix VVT=A

 Define X=1/2VT

 Let X
(d) =X(1:d, :) be top d rows of X: the d
principal eigenvectors scaled by the square
root of the corresponding eigenvalue
 A =X TX
d (d) (d) is the best rank-d approximation
to A with respect to Frobenius norm (||A||
F
2
= a ij )
2
The Polarization Theorem
II
 Build Y(d) by normalizing the columns of X(d)
to unit length
 Let ij be the angle btw xi,xj – columns of
X(d)
 Claim
As A is projected to successively lower
ranks A(N-1), A(N-2), … , A(d), … , A(2), A(1), the
sum of squared angle-cosines (cos ij)2 is
strictly increasing
Brand-Huang algorithm
 Basic strategy: two alternating
projections:
 Projection to low-rank
 Projection to the set of zero-
diagonal doubly stochastic matrices
(all rows and columns sum to unity)

stochastic matrix has all rows and
columns sum to unity
Brand-Huang algorithm II
 While {number of EV=1}<2 do
 APA(d)PA(d) …

Projection is done by suppressing the negative
eigenvalues and unity eigenvalue.

 The presence of two or more stochastic

(unit)eigenvalues implies reducibility of
the resulting P matrix.
 A reducible matrix can be row and column
permuted into block diagonal form
Brand-Huang algorithm III
References
 Alpert et al Spectral partitioning with multiple eigenvectors
 Brand&Huang A unifying theorem for spectral embedding and
clustering
 Belkin&Niyogi Laplasian maps for dimensionality reduction and
data representation
 Blatt et al Data clustering using a model granular magnet
 Buhmann Data clustering and learning
 Fowlkes et al Spectral grouping using the Nystrom method
 Meila&Shi A random walks view of spectral segmentation
 Ng et al On Spectral clustering: analysis and algorithm
 Shi&Malik Normalized cuts and image segmentation
 Weiss et al Segmentation using eigenvectors: a unifying view

ROCKTOPPLE v1.0.3
No ratings yet
ROCKTOPPLE v1.0.3
2,751 pages
SC 7 N 1 4 Identifying Variables
100% (1)
SC 7 N 1 4 Identifying Variables
41 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
IB Math HL Calculus Questions
97% (29)
IB Math HL Calculus Questions
21 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
Spectral Method Slides
No ratings yet
Spectral Method Slides
206 pages
Spectral Algorithms
No ratings yet
Spectral Algorithms
110 pages
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
No ratings yet
Path-Based Spectral Clustering: Guarantees, Robustness To Outliers, and Fast Algorithms
66 pages
ML Unit - IV
No ratings yet
ML Unit - IV
56 pages
E-Note 27952 Content Document 20241123033842PM
No ratings yet
E-Note 27952 Content Document 20241123033842PM
57 pages
Brahmagupta: Early Life and Work
100% (1)
Brahmagupta: Early Life and Work
3 pages
Spec Clus Mod
No ratings yet
Spec Clus Mod
29 pages
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
No ratings yet
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
28 pages
Clustering
No ratings yet
Clustering
82 pages
Luxburg07 Tutorial 4488
No ratings yet
Luxburg07 Tutorial 4488
32 pages
Clustering
No ratings yet
Clustering
41 pages
03 23MAT214 MIS4 KMeans Spectral Clustering
No ratings yet
03 23MAT214 MIS4 KMeans Spectral Clustering
52 pages
Giu 2719 65 22376 2025-02-17T23 42 29
No ratings yet
Giu 2719 65 22376 2025-02-17T23 42 29
37 pages
Sem232 LA CC07 Group08
No ratings yet
Sem232 LA CC07 Group08
23 pages
09 - Spectral Clustering
No ratings yet
09 - Spectral Clustering
22 pages
주례발표 (20101019) SpectralClusteringforClass
No ratings yet
주례발표 (20101019) SpectralClusteringforClass
18 pages
PR Module 4 QB
No ratings yet
PR Module 4 QB
37 pages
4 Clustering
No ratings yet
4 Clustering
21 pages
Clustering
No ratings yet
Clustering
65 pages
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
No ratings yet
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
11 pages
Worksheet-1 Trigonometry
No ratings yet
Worksheet-1 Trigonometry
3 pages
A Survey of Kernel and Spectral Methods For Clustering
No ratings yet
A Survey of Kernel and Spectral Methods For Clustering
15 pages
Spectral Analysis of Signed Graphs For Clustering, Prediction and Visualization
No ratings yet
Spectral Analysis of Signed Graphs For Clustering, Prediction and Visualization
12 pages
Atif IS Paperwork
No ratings yet
Atif IS Paperwork
31 pages
Paper-2 Clustering Algorithms in Data Mining A Review
No ratings yet
Paper-2 Clustering Algorithms in Data Mining A Review
7 pages
Clustering Algorithms: I I M M M N S
No ratings yet
Clustering Algorithms: I I M M M N S
16 pages
Spectral Clustering Survey
No ratings yet
Spectral Clustering Survey
12 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
Adaptive Graph Regularized Low-Rank Matrix Factorization With Noise and Outliers For Clustering
No ratings yet
Adaptive Graph Regularized Low-Rank Matrix Factorization With Noise and Outliers For Clustering
13 pages
Schedule Risk Analysis
No ratings yet
Schedule Risk Analysis
40 pages
The Latest Research Progress On Spectral Clustering
No ratings yet
The Latest Research Progress On Spectral Clustering
10 pages
DS303 Clustering
No ratings yet
DS303 Clustering
20 pages
Spectral Clustering: Eyal David Image Processing Seminar May 2008
No ratings yet
Spectral Clustering: Eyal David Image Processing Seminar May 2008
52 pages
Topic 6e - Hierarchical Clustering (MIN)
No ratings yet
Topic 6e - Hierarchical Clustering (MIN)
14 pages
Pattern Vectors From Algebraic Graph Theory
No ratings yet
Pattern Vectors From Algebraic Graph Theory
14 pages
Spectral Clustering: X Through The Parameter W 0. The Resulting
No ratings yet
Spectral Clustering: X Through The Parameter W 0. The Resulting
7 pages
Lecture Notes On Clustering
No ratings yet
Lecture Notes On Clustering
10 pages
Spectral Clustering
No ratings yet
Spectral Clustering
7 pages
GABB18 Paper 5
No ratings yet
GABB18 Paper 5
8 pages
2092 On Spectral Clustering Analysis and An Algorithm
No ratings yet
2092 On Spectral Clustering Analysis and An Algorithm
8 pages
Kernel Spectral Clustering
No ratings yet
Kernel Spectral Clustering
32 pages
Community Detection
No ratings yet
Community Detection
9 pages
LecN10 R
No ratings yet
LecN10 R
9 pages
Adjacency Spectral Embedding
No ratings yet
Adjacency Spectral Embedding
22 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
Approximation Algorithms For Tensor Clustering
No ratings yet
Approximation Algorithms For Tensor Clustering
15 pages
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
No ratings yet
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
6 pages
Unsuper
No ratings yet
Unsuper
15 pages
n25 PDF
No ratings yet
n25 PDF
8 pages
Calculus Strauss 5th Edition Solution Manual
0% (9)
Calculus Strauss 5th Edition Solution Manual
8 pages
I Jcs It 2015060141
No ratings yet
I Jcs It 2015060141
5 pages
Research On Spectral Clustering Algorithms and Prospects
No ratings yet
Research On Spectral Clustering Algorithms and Prospects
5 pages
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
No ratings yet
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
13 pages
Spectral Clustering
No ratings yet
Spectral Clustering
4 pages
Spectral Clustering
No ratings yet
Spectral Clustering
5 pages
Clustering Notes
No ratings yet
Clustering Notes
4 pages
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
100% (1)
Variation of Velocity and Acceleration in Suction and Delivery Pipes Due To Acceleration of Piston
9 pages
Learning Spectral Clustering
No ratings yet
Learning Spectral Clustering
8 pages
Spectral Approach (BU)
No ratings yet
Spectral Approach (BU)
2 pages
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
No ratings yet
2461 Out of Sample Extensions For Lle Isomap Mds Eigenmaps and Spectral Clustering
8 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
TOC Unit 1
No ratings yet
TOC Unit 1
86 pages
Rosen7eExtraExamples0101 PDF
No ratings yet
Rosen7eExtraExamples0101 PDF
16 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
18 pages
Pattern Recognition 2nd Ed. (2009)
No ratings yet
Pattern Recognition 2nd Ed. (2009)
113 pages
9709 s20 QP 31-Solved (Handwritten)
No ratings yet
9709 s20 QP 31-Solved (Handwritten)
12 pages
Advances in Geophysics Volume 55 1st Edition Renata Dmowska
No ratings yet
Advances in Geophysics Volume 55 1st Edition Renata Dmowska
75 pages
18M-302C 6362 Answer Key
No ratings yet
18M-302C 6362 Answer Key
26 pages
5/13/2012 Prof. P. K. Dash 1
No ratings yet
5/13/2012 Prof. P. K. Dash 1
37 pages
Sec. 3
No ratings yet
Sec. 3
8 pages
Curriculum Content: 1. General Physics
No ratings yet
Curriculum Content: 1. General Physics
3 pages
( (3D Terrain) ) 3D Graphic Java - Render Fractal Landscapes - JavaWorld
No ratings yet
( (3D Terrain) ) 3D Graphic Java - Render Fractal Landscapes - JavaWorld
10 pages
Fibonacci Sequence
No ratings yet
Fibonacci Sequence
6 pages
N Is The Smallest Positive Integer That Has 7 Factors. Quantity A
No ratings yet
N Is The Smallest Positive Integer That Has 7 Factors. Quantity A
10 pages
Lamport Algorithm
No ratings yet
Lamport Algorithm
4 pages
Digital Image Processing - Assignment No 2: Problem No. 1: (CLO 2, C-5)
No ratings yet
Digital Image Processing - Assignment No 2: Problem No. 1: (CLO 2, C-5)
7 pages
Assigment 4. Cèsar Rodriguez
No ratings yet
Assigment 4. Cèsar Rodriguez
9 pages
A3 - 1bm15me039 - Nyquist Plot Using Matlab
No ratings yet
A3 - 1bm15me039 - Nyquist Plot Using Matlab
12 pages
Thyroid Cancer Letter
No ratings yet
Thyroid Cancer Letter
8 pages
Ejemplos de Programación de Agentes en JADE
No ratings yet
Ejemplos de Programación de Agentes en JADE
7 pages
Art - Cient.solucion Analitica - Infiltracion.earth Dam - Alexandria University PDF
No ratings yet
Art - Cient.solucion Analitica - Infiltracion.earth Dam - Alexandria University PDF
5 pages
Predicting Body Weight From Body Measurements in Adult Female Sahiwal Cattle
No ratings yet
Predicting Body Weight From Body Measurements in Adult Female Sahiwal Cattle
4 pages
Ashfaq
No ratings yet
Ashfaq
1 page
Special Matrices and Their Applications in Numerical Mathematics: Second Edition
From Everand
Special Matrices and Their Applications in Numerical Mathematics: Second Edition
Miroslav Fiedler
5/5 (1)
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Spectral Clustering 2

Uploaded by

Spectral Clustering 2

Uploaded by

Spectral Clustering

Course: Cluster Analysis and Other

Speakers: Rebecca Nugent1, Larissa

Department of 1 Statistics, 2 Radiology,

Data-driven Method 1 Method 2

Data-driven Method 1 Method 2

 Two general methods

 Directly compute k-way partitioning

 Experimentally has been seen to be

Minimal spanning tree is the graph of minimum length connecting

“closer” vertices Weight as a function of

 Consider two 2-dimensional slightly

 We would like to cluster them into k

 Find x1 , x2 ,..., xk , the k largest

 This method allows us to cluster

 Choice of scaling factor

 Choice of clustering method

Freeman Recursive procedure

Scott/ Affinity A, Finds k eigenvectors of A,

Affinity Matrix Perona/Freeman Shi/Malik

Affinity Matrix Perona/Freeman Shi/Malik

affinity matrix VVT=A

 The presence of two or more stochastic

You might also like