0% found this document useful (0 votes)

10 views9 pages

LecN10 R

Uploaded by

taha23akter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

LecN10 R

Uploaded by

taha23akter

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

APPLICATIONS OF GRAPH LAPLACEANS: Clustering: Background

CLUSTERING
ä Problem: we are given n data items: x1, x2, · · · , xn. Would
• Details on clustering like to ‘cluster’ them, i.e., group them so that each group or cluster
• K-means contains items that are similar in some sense.

• Similarity graphs, KNN graphs ä Example: materials ä Example: Digits

Photovoltaic PCA − digits : 5 −− 7
Superhard 5

Superconductors 4

• Edge cuts, ratio cuts, etc. 3

• Application: segmentation Ferromagnetic 0

−1

−2

−3
5
Catalytic −4 6
Multi−ferroics Thermo−electric 7
−5
−6 −4 −2 0 2 4 6 8

ä Refer to each group as a ‘cluster’ or a ‘class’

ä A basic method: K-Means

10-2 – Clustering

A basic method: K-means Methods based on similarity graphs

ä A basic algorithm that uses Euclidean distance ä Class of Methods that perform clustering by exploiting a graph
that describes the similarities between any two items in the data.
1 Select p initial centers: c1, c2, ..., cp for classes 1, 2, · · · , p
2 For each xi do: determine class of xi as argmink kxi − ck k ä Need to:
3 Redefine each ck to be the centroid of class k
4 Repeat until convergence 1. decide what nodes are in the neighborhood of a given node
c1
2. quantify their similarities - by assigning a weight to any pair of
●
●
● ● ● nodes.
● ● ● ● ä Simple algorithm
●
●
●c ä Works well (gives good Example: For text data: Can decide that any columns i and j
●
● 3 ●
results) but can be slow with a cosine greater than 0.95 are ‘similar’ and assign that cosine
●
● c ● ● ä Performance depends on value to wij
2
● initialization
● ●
●

10-3 – Clustering 10-4 – Clustering

First task: build a ‘similarity’ graph K-nearest neighbor graphs

ä Goal: to build a similarity graph, i.e., a graph that captures ä Given: a set of n data points X = {x1, . . . , xn} → vertices
similarity between any two items ä Given: a proximity measure between two data points xi and xj
●
j
● – as measured by a quantity dist(xi, xj )

● w(i,j)=? ä Want: For each point xi a list of the ‘nearest neighbors’ of xi

●
i (edges between xi and these nodes).
● ä Note: graph will usually be directed → need to symmetrize
●
●

ä Two methods: K-nearest Neighbor graphs or use Gaussian (‘heat’)

kernel

10-5 – Clustering 10-6 – Clustering

Nearest neighbor graphs Two types of nearest neighbor graph often used:
-graph: Edges consist of pairs (xi, xj ) such that
ä For each node, get a few of the nearest neighbors → Graph ρ(xi, xj ) ≤

kNN graph: Nodes adjacent to xi are those nodes x` with the

Data
k with smallest distances ρ(xi, x`).
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
ä -graph is undirected and is geometrically motivated. Issues: 1)
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111 may result in disconnected components 2) what ?
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
ä kNN graphs are directed in general (can be trivially fixed).
Graph ä kNN graphs especially useful in practice.
ä Problem: How to build a nearest-neighbor graph from given data
ä We will revisit this later.

10-7 – Clustering 10-8 – Clustering

Similarity graphs: Using ‘heat-kernels’ Edge cuts, ratio cuts, normalized cuts, ...

Define weight between i and j as: ä Assume now that we have built a ‘similarity graph’
 ä Setting is identical with that of graph partitioning.
−kxi −xj k2
 2
wij = fij × e σX
if kxi − xj k < r ä Need a Graph Laplacean: L = D−W with wii = 0, wij ≥ 0
0 if not and D = diag(W ∗ ones(n, 1)) [in matlab notation]

ä Note kxi − xj k could be any measure of distance... ä Partition vertex set V in two sets A and B with

ä fij = optional = some measure of similarity - other than distance A ∪ B = V, A∩B =∅

ä Only nearby points kept.

ä Define
ä Sparsity depends on parameters X
cut(A, B) = w(u, v)
u ∈A,v∈B

10-9 – Clustering 10-10 – Clustering

ä First (naive) approach: use this measure to partition graph, i.e., Ratio-cuts
... Find A and B that minimize cut(A, B).
ä Standard Graph Partitioning approach: Find A, B by solving
ä Issue: Small sets, isolated nodes, big imbalances,
● Minimize cut(A, B), subject to |A| = |B|
●● ● Min−cut 1
● ● ● ●
● ä Condition |A| = |B| not too meaningful in some applications -
● ●
● ● too restrictive in others.
● ●● ● ●
● ä Minimum Ratio Cut approach. Find A, B by solving:
● ● ● ●
● ● Min−cut 2 cut(A,B)
●● ● Minimize |A|.|B|
●

ä Difficult to find solution (original paper [Wei-Cheng ’91] proposes

Better cut several heuristics)
ä Approximate solution : spectral .

10-11 – Clustering 10-12 – Clustering

Theorem [Hagen-Kahng, 91] If λ2 is the 2nd smallest eigenvalue Therefore, by the Courant-Fischer theorem:
of L, then a lower bound for the cost c of the optimal ratio cut (Lx, x) w(A, B)
partition, is: λ2 ≤ =n× = n × c.
(x, x) |A|.|B|
λ2
c≥ . Hence result.
n
Proof: Consider an optimal partition A, B and let p = |A|/n, q = ä Idea is to use eigenvector associated with λ2 to determine par-
|B|/n. Note that p + q = 1. Let x be the vector with coordinates tition, e.g., based on sign of entries. Use the ratio-cut measure to
actually determine where to split.
q if i ∈ A
xi =
−p if i ∈ B
Note that x ⊥ 1. Also if (i, j) == an edge-cut then |xi − xj | =
|q − (−p)|P = |q + p| = 1,2otherwise xi − xj = 0. Therefore,
T
x Lx = (i,j)∈E (xi − xj ) = w(A, B). In addition:
kxk2 = pq 2n + qp2n = pq(p + q)n = pqn = |A|.|B| n
.

10-14 – Clustering

Normalized cuts [Shi-Malik,2000] ä Therefore:

X
P cut(A, B) = wij = xT Lx
ä Recall notation w(X, Y ) = x∈X,y∈Y w(x, y) - then define: xi=1,xj =0
X
ncut(A, B) = cut(A,B)
+ cut(A,B) w(A, V ) = di = x T W 1 = xT D 1
w(A,V ) w(B,V ) xi=1
X
ä Goal is to avoid small sets A, B w(B, V ) = dj = ( 1 − x)T W 1 = ( 1 − x)T D 1
xj =0
- 1 What is w(A, V ) in the case when wij == 1 ?
ä Goal now: to minimize ncut
ä Let x be an indicator vector:

1 if i ∈ A
xi = xT Lx xT Lx
0 if i ∈ B min ncut(A, B) = min +
A,B xi ∈{0,1} xT Dx ( 1 − x)T Dx
P
ä Recall that: xT Lx = (i,j)∈E wij |xi − xj |2 (note: each
edge counted once)

10-15 – Clustering 10-16 – Clustering

w(A, V ) xT D 1 A few properties
ä Let β= =
w(B, V ) ( 1 − x)T D 1 - 2 Show that
y = x − β( 1 − x)
cut(A, B)
ncut(A, B) = σ ×
w(A, V ) × w(B, V )
y T Ly
ä Then we need to solve: min where σ is a constant
yi {0,−β}y T Dy
Subject to y T D 1 = 0 - 3 How do ratio-cuts and normalized cuts compare when the graph
is d-regular (same degree for each node).
ä + Relax → need to solve Generalized eigenvalue problem
Ly = λDy

ä y1 = 1 is eigenvector associated with eigenvalue λ1 = 0

ä y2 associated with second eigenvalue solves problem.

10-17 – Clustering 10-18 – Clustering

Extension to more than 2 clusters Application: Image segmentation

ä Just like graph partitioning we can: ä First task: obtain a graph from pixels.

1. Apply the method recursively [Repeat clustering on the resulted ä Common idea: use “Heat kernels”
parts] ä Let Fj = feature value (e.g., brightness), and Let Xj = spatial
2. or compute a few eigenvectors and run K-means clustering on these position.
eigenvectors to get the clustering. Then define

−kXi −Xj k2
−kFi −Fj k2  2
wij = e σI2
× e σX
ifkXi − Xj k < r
0 else
ä Sparsity depends on parameters

10-19 – Clustering 10-20 – Clustering

●

Spectral clustering: General approach ●

●
●

● ● ● ●
●
●
1 Given: Collection of data samples {x1, x2, · · · , xn} ●

●
●
●

2 Build a similarity graph between items ●

●
●
j ●
●
● ●
●
● w(i,j)=? ●
i
● ä Alg. Multiplicity of eigenvalue zero = # connected components.
●

●
●

3 Compute (smallest) eigenvector (s) of resulting graph Laplacean

4 Use k-means on eigenvector (s) of Laplacean
ä For Normalized cuts solve generalized eigen problem.
10-21 – Clustering 10-22 – Clustering

Building a nearest neighbor graph Recall: Two common types of nearest neighbor graphs
-graph: Edges consist of pairs (xi, xj ) such that
ä Question: How to build a nearest-neighbor graph from given ρ(xi, xj ) ≤
data?
kNN graph: Nodes adjacent to xi are those nodes x` with the
Data k with smallest distances ρ(xi, x`).
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111 ä -graph is undirected and is geometrically motivated. Issues: 1)
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111 may result in disconnected components 2) what ?
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
000000000000000000000000000000000000000
111111111111111111111111111111111111111
ä kNN graphs are directed in general (can be trivially fixed).
Graph
ä kNN graphs especially useful in practice.
ä Will demonstrate the power of a divide a conquer approach
combined with the Lanczos algorithm.
ä Note: The Lanczos algortithm will be covered in detail later

10-23 – knn 10-24 – knn

Divide and conquer KNN: key ingredient ä Hyperplane is defined as hu, xi = 0, i.e., it splits the set of
data points into two subsets:
ä Key ingredient is Spectral bisection
ä Let the data matrix X = [x1, . . . , xn] ∈ Rd×n X+ = {xi | uT x̂i ≥ 0} and X− = {xi | uT x̂i < 0}.
ä Each column == a data point.
ä Center the data: X̂ = [x̂1, . . . , x̂n] = X − ceT u
where c == centroid; e = ones(d, 1) (matlab) ●

Goal: Split X̂ into halves using a hyperplane.

Method: Principal Direction Divisive Partitioning D. Boley, ’98. ● + SIDE

Idea: Use the (σ, u, v) = largest singular triplet of X̂ with: − SIDE

Hyperplane

uT X̂ = σv T .
ä Note that uT x̂i = uT X̂ei = σv T ei →

10-25 – knn 10-26 – knn

X+ = {xi | vi ≥ 0} and X− = {xi | vi < 0}, Two divide and conquer algorithms

Overlap method: divide current set into two overlapping subsets

where vi is the i-th entry of v. X1, X2

ä In practice: replace above criterion by Glue method: divide current set into two disjoint subsets X1, X2
plus a third set X3 called gluing set.
X+ = {xi | vi ≥ med(v)} & X− = {xi | vi < med(v)}
hyperplane hyperplane

where med(v) == median of the entries of v.

ä For largest singular triplet (σ, u, v) of X̂ : use Golub-Kahan- X1 X2 X1 X3 X2
Lanczos algorithm or Lanczos applied to X̂ X̂ T or X̂ T X̂
ä Cost (assuming s Lanczos steps) : O(n × d × s) ; Usually: d
very small

10-27 – knn 10-28 – knn

The Overlap Method The Glue Method

ä Divide current set X into two overlapping subsets: Divide the set X into two disjoint subsets X1 and X2 with a gluing
subset X3:
X1 = {xi | vi ≥ −hα(Sv )} and X2 = {xi | vi < hα(Sv )},
X1∪X2 = X, X1∩X2 = ∅, X1∩X3 6= ∅, X2∩X3 6= ∅.
• where Sv = {|vi| | i = 1, 2, . . . , n}.
Criterion used for splitting:
• and hα(·) is a function that returns an element larger than (100α)%
of those in Sv . X1 = {xi | vi ≥ 0}, X2 = {xi | vi < 0},
X3 = {xi | −hα(Sv ) ≤ vi < hα(Sv )}.
ä Rationale: to ensure that the two subsets overlap (100α)% of
the data, i.e., Note: gluing subset X3 here is just the intersection of the sets
|X1 ∩ X2| = dα|X|e . X1, X2 of the overlap method.

10-29 – knn 10-30 – knn

Approximate kNN Graph Construction: The Overlap Method Approximate kNN Graph Construction: The Glue Method

function G = kNN-Overlap[X, k, α] function G = kNN-Glue[X, k, α]

if|X| < nk if|X| < nk
G ← Call kNN-BruteForce[X, k] G ← Call kNN-BruteForce[X, k]
else else
(X1, X2) ← Call Divide-Overlap[X, α] (X1, X2, X3) ← Call Divide-Glue[X, α]
G1 ← Call kNN-Overlap[X1, k, α] G1 ← Call kNN-Glue[X1, k, α]
G2 ← Call kNN-Overlap[X2, k, α] G2 ← Call kNN-Glue[X2, k, α]
G ← Call Conquer[G1, G2] G3 ← Call kNN-Glue[X3, k, α]
Call Refine[G] G ← Call Conquer[G1, G2, G3]
EndIf Call Refine[G]
End EndIf
End

10-31 – knn 10-32 – knn

Theorem The time complexity for the overlap method is
To(n) = Θ(dnto ),
1
where: to = log2/(1+α) 2 = .
1 − log2(1 + α)
Theorem The time complexity for the glue method is
Tg(n) = Θ(dntg /α),
2
where tg is the solution to the equation: 2t
+ αt = 1.

Example: When α = 0.1, then to = 1.16 while tg = 1.12.

Reference:
Jie Chen, Haw-Ren Fang and YS, “Fast Approximate kNN Graph
Construction for High Dimensional Data via Recursive Lanczos Bi-
section” JMLR, vol. 10, pp. 1989-2012 (2009).
10-33 – knn

Clustering
No ratings yet
Clustering
28 pages
ML Assignment 2
No ratings yet
ML Assignment 2
6 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
ML Unit - IV
No ratings yet
ML Unit - IV
56 pages
Spec Clus Mod
No ratings yet
Spec Clus Mod
29 pages
Clustering
No ratings yet
Clustering
41 pages
EE3022-Vlsi Lab Manual
No ratings yet
EE3022-Vlsi Lab Manual
142 pages
Unit-Iv Material
No ratings yet
Unit-Iv Material
24 pages
Spectral Clustering 2
No ratings yet
Spectral Clustering 2
39 pages
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
No ratings yet
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
28 pages
3.3. K-Way Clustering Using Normalized Cuts 79
No ratings yet
3.3. K-Way Clustering Using Normalized Cuts 79
37 pages
Chapter 6
No ratings yet
Chapter 6
62 pages
3.1 Graph Clustering Using Normalized Cuts
No ratings yet
3.1 Graph Clustering Using Normalized Cuts
24 pages
Clustering, A Tool To Analyze Data Points
No ratings yet
Clustering, A Tool To Analyze Data Points
61 pages
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
No ratings yet
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
11 pages
Data Clustering (Contd) : CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
Data Clustering (Contd) : CS771: Introduction To Machine Learning Piyush Rai
15 pages
UNIT 4 ML Notes
No ratings yet
UNIT 4 ML Notes
22 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
ML - Unit - 4 - Part Ii
No ratings yet
ML - Unit - 4 - Part Ii
79 pages
Giu 2719 65 22376 2025-02-17T23 42 29
No ratings yet
Giu 2719 65 22376 2025-02-17T23 42 29
37 pages
09 - Spectral Clustering
No ratings yet
09 - Spectral Clustering
22 pages
Markov Random Fields and Segmentation With Graph Cuts: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Markov Random Fields and Segmentation With Graph Cuts: Computer Vision Jia-Bin Huang, Virginia Tech
44 pages
Spectral Clustering Survey
No ratings yet
Spectral Clustering Survey
12 pages
Clustering
No ratings yet
Clustering
27 pages
Clustering
No ratings yet
Clustering
82 pages
PR Module 4 QB
No ratings yet
PR Module 4 QB
37 pages
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
No ratings yet
Region Segmentation Readings: Chapter 10: 10.1 Additional Materials Provided
47 pages
The Latest Research Progress On Spectral Clustering
No ratings yet
The Latest Research Progress On Spectral Clustering
10 pages
Graph Based Clustering
No ratings yet
Graph Based Clustering
78 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
Clustering
No ratings yet
Clustering
75 pages
Spectral Clustering
No ratings yet
Spectral Clustering
7 pages
DS303 Clustering
No ratings yet
DS303 Clustering
20 pages
Asit Kumar Das - M4 BDA Clustering
No ratings yet
Asit Kumar Das - M4 BDA Clustering
99 pages
Chapter 3-Unsupervised Learning - Updated
No ratings yet
Chapter 3-Unsupervised Learning - Updated
54 pages
ML Unit-4
No ratings yet
ML Unit-4
23 pages
Spectral Clustering: X Through The Parameter W 0. The Resulting
No ratings yet
Spectral Clustering: X Through The Parameter W 0. The Resulting
7 pages
Cis515 15 Spectral Clust Chap6
No ratings yet
Cis515 15 Spectral Clust Chap6
10 pages
Tutorial On Spectral Clustering
No ratings yet
Tutorial On Spectral Clustering
26 pages
Luxburg07 Tutorial 4488
No ratings yet
Luxburg07 Tutorial 4488
32 pages
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
No ratings yet
Kernel K-Means, Spectral Clustering and Normalized Cuts: Inderjit S. Dhillon Yuqiang Guan Brian Kulis
6 pages
Clustering Gene Expression Data: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu April 2001
No ratings yet
Clustering Gene Expression Data: CS 838 WWW - Cs.wisc - Edu/ Craven/cs838.html Mark Craven Craven@biostat - Wisc.edu April 2001
12 pages
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
No ratings yet
Outer-Points Shaver-Robust Graph-Based Clustering Via Node Cutting
13 pages
2092 On Spectral Clustering Analysis and An Algorithm
No ratings yet
2092 On Spectral Clustering Analysis and An Algorithm
8 pages
n25 PDF
No ratings yet
n25 PDF
8 pages
5 - Clustering
No ratings yet
5 - Clustering
13 pages
Clustering: Unsupervised Learning Methods 15-381
No ratings yet
Clustering: Unsupervised Learning Methods 15-381
25 pages
LAB6
No ratings yet
LAB6
4 pages
Saba DM
No ratings yet
Saba DM
8 pages
L21 Mining Social Network Graphs
No ratings yet
L21 Mining Social Network Graphs
30 pages
Clustering L7
No ratings yet
Clustering L7
7 pages
주례발표 (20101019) SpectralClusteringforClass
No ratings yet
주례발표 (20101019) SpectralClusteringforClass
18 pages
Unit 4 Machine Learning
No ratings yet
Unit 4 Machine Learning
12 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 4 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 4 Notes
23 pages
Rim Segme Ho
No ratings yet
Rim Segme Ho
10 pages
Lecture12 - Graph-Based Segmentation
No ratings yet
Lecture12 - Graph-Based Segmentation
35 pages
Lect 4
No ratings yet
Lect 4
34 pages
Windows Wmic Command Line Command
No ratings yet
Windows Wmic Command Line Command
24 pages
Tutor Resume Example
100% (1)
Tutor Resume Example
4 pages
PHD Thesis Topics in Commerce in India
100% (2)
PHD Thesis Topics in Commerce in India
4 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
Capstone
No ratings yet
Capstone
6 pages
NM Script
No ratings yet
NM Script
181 pages
Javascript Cheat Sheet
No ratings yet
Javascript Cheat Sheet
1 page
مكتبة المغامرات -4- وحش في المدينة
0% (1)
مكتبة المغامرات -4- وحش في المدينة
49 pages
Megaworks Woofer Cambridge
No ratings yet
Megaworks Woofer Cambridge
20 pages
Defence Exams Current Affairs MCQ's With Explanation
No ratings yet
Defence Exams Current Affairs MCQ's With Explanation
177 pages
Squic
No ratings yet
Squic
21 pages
Oracle® VM VirtualBox® User Manual12
No ratings yet
Oracle® VM VirtualBox® User Manual12
21 pages
Branded Mobile Application Adoption and Customer Engagement Behavior
No ratings yet
Branded Mobile Application Adoption and Customer Engagement Behavior
55 pages
2025 - GPU-Accelerated Parallel Selected Inversion For
No ratings yet
2025 - GPU-Accelerated Parallel Selected Inversion For
12 pages
Samsung V-NAND Technology: Yield More Capacity, Performance, Endurance and Power Efficiency
No ratings yet
Samsung V-NAND Technology: Yield More Capacity, Performance, Endurance and Power Efficiency
8 pages
Chapter17 Projectmanagement 2
No ratings yet
Chapter17 Projectmanagement 2
72 pages
Class 2 - Worksheet No 10 - Numerical Ability - Easy
No ratings yet
Class 2 - Worksheet No 10 - Numerical Ability - Easy
3 pages
Hamidavi Et Al 2020 - Towards Intelligent Structural Design of Buildings A
No ratings yet
Hamidavi Et Al 2020 - Towards Intelligent Structural Design of Buildings A
15 pages
Csir-Crri - Admit Card
No ratings yet
Csir-Crri - Admit Card
3 pages
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
No ratings yet
Algorithms and Data Structures Exercises: Antonio Carzaniga University of Lugano Edition 1.2 January 2009
13 pages
About The Virtual Private Network (VPN) - Information Systems & Technology - University of Waterloo
No ratings yet
About The Virtual Private Network (VPN) - Information Systems & Technology - University of Waterloo
4 pages
Catalog - GPS Tracker - Simply Classic 20191126
No ratings yet
Catalog - GPS Tracker - Simply Classic 20191126
40 pages
Selected Inversion Summary
No ratings yet
Selected Inversion Summary
2 pages
Tort RP1
No ratings yet
Tort RP1
20 pages
White Paper - The Role of Artificial Intelligence To Transform Video Imaging - Hikvision and SourceSecurity - 20240708
No ratings yet
White Paper - The Role of Artificial Intelligence To Transform Video Imaging - Hikvision and SourceSecurity - 20240708
13 pages
LecN2 R
No ratings yet
LecN2 R
6 pages
LecN3 R
No ratings yet
LecN3 R
5 pages
Receipt 881131282923146
No ratings yet
Receipt 881131282923146
1 page
9905D Series Brass Calibration System Datasheet DS 0105
No ratings yet
9905D Series Brass Calibration System Datasheet DS 0105
2 pages
Sony WF-C510 Earbuds-25-26
No ratings yet
Sony WF-C510 Earbuds-25-26
2 pages
Answer Question - Fathya Nabila Az Zahra - XII EI 1 - Bahasa Inggris
No ratings yet
Answer Question - Fathya Nabila Az Zahra - XII EI 1 - Bahasa Inggris
5 pages
Digital Transformation For 18 Hydroelectric Power Plants
No ratings yet
Digital Transformation For 18 Hydroelectric Power Plants
2 pages
Toward Understanding Outcomes Associated With Data Quality Improvement
No ratings yet
Toward Understanding Outcomes Associated With Data Quality Improvement
12 pages
ADF Code Corner: 100. How-To Undo Table Row Selection in Case of Custom Validation Failure
No ratings yet
ADF Code Corner: 100. How-To Undo Table Row Selection in Case of Custom Validation Failure
8 pages
Machine Learning
No ratings yet
Machine Learning
1 page
MB6S Datasheet
No ratings yet
MB6S Datasheet
3 pages
Fill Your Glass With Gold-When It's Half-Full or Even Completely Shattered
From Everand
Fill Your Glass With Gold-When It's Half-Full or Even Completely Shattered
Hillary Saffran
No ratings yet

LecN10 R

Uploaded by

LecN10 R

Uploaded by

APPLICATIONS OF GRAPH LAPLACEANS: Clustering: Background

• Similarity graphs, KNN graphs ä Example: materials ä Example: Digits

• Edge cuts, ratio cuts, etc. 3

• Application: segmentation Ferromagnetic 0

ä Refer to each group as a ‘cluster’ or a ‘class’

A basic method: K-means Methods based on similarity graphs

10-3 – Clustering 10-4 – Clustering

● w(i,j)=? ä Want: For each point xi a list of the ‘nearest neighbors’ of xi

ä Two methods: K-nearest Neighbor graphs or use Gaussian (‘heat’)

10-5 – Clustering 10-6 – Clustering

kNN graph: Nodes adjacent to xi are those nodes x` with the

10-7 – Clustering 10-8 – Clustering

ä fij = optional = some measure of similarity - other than distance A ∪ B = V, A∩B =∅

ä Only nearby points kept.

10-9 – Clustering 10-10 – Clustering

ä Difficult to find solution (original paper [Wei-Cheng ’91] proposes

10-11 – Clustering 10-12 – Clustering

Normalized cuts [Shi-Malik,2000] ä Therefore:

10-15 – Clustering 10-16 – Clustering

ä y1 = 1 is eigenvector associated with eigenvalue λ1 = 0

10-17 – Clustering 10-18 – Clustering

Extension to more than 2 clusters Application: Image segmentation

10-19 – Clustering 10-20 – Clustering

Spectral clustering: General approach ●

2 Build a similarity graph between items ●

3 Compute (smallest) eigenvector (s) of resulting graph Laplacean

10-23 – knn 10-24 – knn

Goal: Split X̂ into halves using a hyperplane.

Idea: Use the (σ, u, v) = largest singular triplet of X̂ with: − SIDE

10-25 – knn 10-26 – knn

Overlap method: divide current set into two overlapping subsets

where med(v) == median of the entries of v.

10-27 – knn 10-28 – knn

10-29 – knn 10-30 – knn

function G = kNN-Overlap[X, k, α] function G = kNN-Glue[X, k, α]

10-31 – knn 10-32 – knn

Example: When α = 0.1, then to = 1.16 while tg = 1.12.

You might also like