0% found this document useful (0 votes)

75 views19 pages

DisjointSet Slide

Union-find data structures can be used to maintain disjoint sets and perform operations like determining if two elements are connected. The key operations are MAKE-SET, FIND-SET, and UNION. Several optimizations can achieve O(log n) time complexity for these operations. Union-by-size and union-by-rank heuristics improve performance by linking smaller or lower ranked trees to larger/higher ranked ones. Path compression further optimizes FIND-SET by collapsing paths directly to the root. Together, union-by-rank and path compression give O(log n) time bounds for all operations.

Uploaded by

RishabhGupta251

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

75 views19 pages

DisjointSet Slide

Uploaded by

RishabhGupta251

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

DISJOINT SET DATA STRUCTURE

Partha P. Chakrabarti & Aritra Hazra

Department of Computer Science and Engineering
Indian Institute of Technology Kharagpur
Disjoint-Set Data Structures: Applications
Minimum Spanning Tree of Graph (G)
Algorithm MST_Kruskal ( G = (V,E) ) {
A = { };
for each v in V do MAKE-SET(v);
for each edge e = (u, v) in E ordered by
increasing weight(u, v) do {
if FIND-SET(u) ≠ FIND-SET(v) then {
A = A + {(u, v)};
UNION(FIND-SET(u), FIND-SET(v));
}
}
return A;
}
Disjoint-Set Data-Type and Operations
•  Primary Operations:
–  MAKE-SET(x): create a new set containing only element x
–  FIND-SET(x): return a canonical element in the set containing x
–  UNION(x, y): replace the sets containing x and y with their union

•  Performance parameters:
–  m = number of calls to FIND-SET and UNION operations
disjoint sets =
–  n = number of elements = number of calls to MAKE-SET
connected components
•  Application: Dynamic connectivity over initially empty graph
–  ADD-NODE(u): add node u (1 MAKE-SET operation)
–  ADD-EDGE(u, v): add an edge between nodes u and v (1 UNION operation)
–  IS-CONNECTED(u, v): is there a path between u and v ? (2 FIND-SET operations)
Disjoint-Set Operations: Implementation (1)
Linked List Implementation Set A: {c, h ,e, b}
•  MAKE-SET(x): O(1)
–  need to create only one
node created with
appropriate pointers Set B: {f, g, d}
•  FIND-SET(x): O(n)
–  need to traverse entire
linked list to find x
Set (A U B): {c, h, e, b, f, g, d}
•  UNION(x,y): O(n)
–  need to point back all
back-pointers of second
list to head of first list
Disjoint-Set Operations: Implementation (2)
•  Array Representation UNION(3,5) or
–  Represent each set as tree of elements UNION(8,6) or
UNION(8,7)
–  Allocate an array of parent[] of length n
–  parent[i]=j (parent of element i is j)

/ /
0 0 •  Analysis of Operations:
–  Total zeros in array = Disjoint-sets
X X
–  FIND-SET(x): O(n) worst-case
–  UNION(x,y): O(n) worst-case
•  UNION(FIND-SET(x), FIND-SET(y))
•  O(n) due to FIND-SET operation
Suppress self-loops for root for brevity! Solution: Smart Union-Find Algorithms !!
Smart Disjoint-Set Operations: Union-by-Size
•  Union-by-Size UNION(x,y) {
–  Maintain a tree size r ß FIND-SET(x);
(number of nodes) for s ß FIND-SET(y);
each root node if(r == s) return r;
–  Link root of smaller else if(size[r] > size[s]) {
tree to root of larger parent[s] ß r;
tree (break tries size[r] = size[r] + size[s];
arbitrarily) return r;
}
FIND-SET(x) { else { UNION(3,5)
while(x is not parent) parent[r] ß s;
x ß parent[x]; size[s] = size[r] + size[s];
return x; return s;
} MAKE-SET(x) {
}
parent[x] ß 0;
}
size[x] ß 1;
return x;
}
Analysis of Union-by-Size Heuristic (1)
Property: Using union-by-size, for every root node r, we have size[r] ≥ 2height(r)
Proof: [ by induction on number of links ]
–  Base case: singleton tree has size 1 and height 0
–  Inductive hypothesis: assume true after first i links

–  Tree rooted at r changes only when a smaller (or

equal) size tree rooted at s is linked into r

–  Case 1. [ height(r) > height(s) ]

sizeʹ[r] > size[r] ≥ 2height(r) = 2heightʹ(r)
–  Case 2. [ height(r) ≤ height(s) ]
sizeʹ[r] = size[r] + size[s] ≥ 2 size[s] ≥ 2 x 2height(s)
= 2height(s) + 1 = 2heightʹ(r)
Analysis of Union-by-Size Heuristic (2)
•  Theorem: Using union-by-size, any UNION or FIND-SET operation takes O(log2 n)
time in the worst case, where n is the number of elements
•  Proof:
–  The running time of each operation is bounded by the tree height
–  Using union-by-size, a tree with n nodes can have height at most log2 n
–  By the previous property, the height is ≤ ⎣log2 n⎦

•  The UNION operation takes O(1) time except for its two calls to FIND-SET
–  FIND-SET required to find out the set representative (which is the root)

•  m number of UNION and FIND-SET operations takes a total of O(m log2 n) time
Smart Disjoint-Set Operations: Union-by-Rank
•  Union-by-Rank rank = height
–  Maintain an integer
rank for each node, UNION(x,y) {
initially 0 r ß FIND-SET(x);
s ß FIND-SET(y);
–  Link root of smaller
if (r == s) return r;
rank to root of larger
else if (rank[r] ≥ rank[s]) {
rank; if tie, increase
rank of larger root by 1 parent[s] ß r;
if(rank[r] == rank[s])
FIND-SET(x) { rank[r] = rank[r] + 1; UNION(3,5)
while(x is not parent) return r;
x ß parent[x]; }
return x; else {
} MAKE-SET(x) {
parent[r] ß s;
parent[x] ß 0;
return s;
rank[x] ß 0;
}
return x;
}
}
Analysis of Union-by-Rank Heuristic (1)
Property-1: If x is not a root node, then rank[x] < rank[parent[x]]
Proof: A node of rank k is created only by linking two roots of rank k – 1.

Property-2: If x is not a root node, then rank[x] will never change again
Proof: Rank changes only for roots; a non-root never becomes a root.

Property-3: If parent[x] changes,

then rank[parent[x]] strictly
increases.
Proof: The parent can change
only for a root, so before linking
parent[x] = 0. After x is linked
using union-by-rank to new root
r we have rank[r] > rank[x].
Analysis of Union-by-Rank Heuristic (2)
Property-4: Any root node of rank k
has ≥ 2k nodes in its tree
Proof: [ by induction on k ]
•  Base case: true for k = 0
•  Inductive hypothesis: assume true
for k – 1
•  A node of rank k is created only
by linking two roots of rank k – 1
•  By inductive hypothesis, each of
two sub-tree has ≥ 2k – 1 nodes
=> resulting tree has ≥ 2k nodes

Property-5: The highest rank of a node is ≤ ⎣log2 n⎦

Proof: Immediately concluded from Property-1 and Property-4
Analysis of Union-by-Rank Heuristic (3)
Property-6: For any integer k ≥ 0, there are ≤ n / 2k nodes with rank k
Proof:
•  Any root node of rank k has ≥ 2k descendants. [by Property-4]
•  Any non-root node of rank k has ≥ 2k descendants because:
§  it had this property just before it became a non-root [by Property-4]
§  its rank does not change once it became a non-root [by Property-2]
§  its set of descendants does not change once it became a non-root
•  Different nodes of rank k cannot have common descendants [by Property-1]
Theorem: Using union-by-rank, any
UNION or FIND-SET operation takes
O(log2 n) time in the worst case, where
n is the number of elements.
Proof: The running time of UNION and
FIND-SET is bounded by the tree
height ≤ ⎣log2 n⎦ [by Property-5]
Smart Disjoint-Set Operations: Path Compression
•  When finding the root r of the tree containing x, change the parent pointer of all
nodes along the path to point directly to r
Path Compression: Example

FIND-SET(x) {
if(x is not parent)
parent[x] ß FIND-SET(parent[x]);
return x;
}
Properties of Union-by-Rank + Path Compression (1)
Property-0: The tree roots, node ranks, and elements within a tree are the same with or
without path compression.
Property-1: If x is not a root node, then rank[x] < rank[parent[x]]
Proof: Path compression can make x point to only an ancestor of parent[x]
Property-2: If x is not a root node, then rank[x] will never change again
Property-3: If parent[x] changes, then rank[parent[x]] strictly increases.
Proof: Path compression doesn’t change any ranks, but it can change parents
If parent[x] doesn’t change during a path compression the inequality continues to hold
if parent[x] changes, then rank[parent[x]] strictly increases
Property-4: Any root node of rank k has ≥ 2k nodes in its tree
Property-5: The highest rank of a node is ≤ ⎣log2 n⎦
Property-6: For any integer k ≥ 0, there are ≤ n / 2k nodes with rank k
Properties of Union-by-Rank + Path Compression (2)
•  Definitions: Rank Groups

–  log* n = 0, when n ≤ 1 i times

Inverse Ackerman

= MIN { i ≥ 0 | log2 log2 … log2 n ≤1}, when n ≥ 2

Function

–  log* n = 0, when n ≤ 1 Recursive Definition

= 1 + log* (log2 n), otherwise
–  Ackerman Function, F(j) = 1, when j = 0
= 2F(j-1), when j ≥ 1
Property-7: The largest
group number is ≤ log* Property-8: Number of nodes in a particular group g is
(log2 n) = log* n – 1 given by, ng < n/F(g)
Proof: Since largest Proof: ng < ΣF(g)r=F(g-1) +1 n/2r < 2n/2F(g-1)+1 = n/2F(g-1) = n/F(g)
possible rank is ⎣log2 n⎦, [ since, n/2r + n/2r+1 + n/2r+2 + … + n/2r+k
hence the result < (n/2r) Σ∞0 (1/2k) = 2n/2r ]
Analysis of Union-by-Rank with Path Compression (1)
x
•  Case-1: If v is root (= x), a child of root or if parent[v] is
in a different rank group; then we charge ONE unit of v
time to FIND-SET operation
•  Case-2: If v ≠ x, and both v and parent[u] are in the parent[u]
same group, then we charge ONE unit of time to node v
u
•  Observation-1: Ranks of nodes in a path from u to x
increases monotonically path-compression
w.r.t. FIND-SET(u)
–  After x is found to be the root, we do path
compression x
–  If later on, x becomes a child of another node and v
& x are in different groups, no more node charges u v
on v in later FIND-SET operations
Analysis of Union-by-Rank with Path Compression (2)
•  Observation-2: If a node v is in group g (g > 0), v can be moved and charged at most
[F(g) – F(g-1)] times before it acquires a parent in a higher group.

•  Complexity Analysis:
–  Time Complexity = (Number of nodes in group g) x (Movement charges across
groups) x (Movement charges with groups) = (n/F(g)) x (log* n) x [F(g) – F(g-1)]
≤ n log* n [ since, (n/F(g))x[F(g) – F(g-1)] ≤ n ]

•  Theorem: The time complexity required to process m UNION and FIND-SET

operations using union-by-rank with path-compression heuristic is O(m log* n) in
the worst case
–  which may be also said as O(m), as log*n ≤ 5 practically
(as otherwise n is more than the number of atoms in universe!!)
Thank you

Module 2 Daa
No ratings yet
Module 2 Daa
34 pages
Disjoint Sets Data Structure: Example. Consider A System of Three Sets (1, 3, 5), (2, 6), (4, 7, 8)
No ratings yet
Disjoint Sets Data Structure: Example. Consider A System of Three Sets (1, 3, 5), (2, 6), (4, 7, 8)
8 pages
UNIT - 1: Disjoint SETS: Equivalence Relations
No ratings yet
UNIT - 1: Disjoint SETS: Equivalence Relations
11 pages
1 Greedy
No ratings yet
1 Greedy
116 pages
Small 16
No ratings yet
Small 16
77 pages
DAA Lecture Notes
No ratings yet
DAA Lecture Notes
171 pages
Lec11 Graphs
No ratings yet
Lec11 Graphs
77 pages
Each of The Elements Is in Exactly One Set at Any Time
No ratings yet
Each of The Elements Is in Exactly One Set at Any Time
47 pages
Algorithms Exam Help
No ratings yet
Algorithms Exam Help
11 pages
Disjoint Sets Data Structure (Chap. 21)
No ratings yet
Disjoint Sets Data Structure (Chap. 21)
32 pages
Lecture 15
No ratings yet
Lecture 15
40 pages
Efficiency of A Good But Not Linear Set Union Algorithm. Tarjan
No ratings yet
Efficiency of A Good But Not Linear Set Union Algorithm. Tarjan
11 pages
Computer Algorithms: Submitted By: Rishi Jethwa Suvarna Angal
No ratings yet
Computer Algorithms: Submitted By: Rishi Jethwa Suvarna Angal
32 pages
BCS301 Model Question Paper 1 With Solutions
100% (2)
BCS301 Model Question Paper 1 With Solutions
40 pages
CS301 Lec36
No ratings yet
CS301 Lec36
24 pages
DAA U-2 (Part1)
No ratings yet
DAA U-2 (Part1)
33 pages
Soda14 Disjoint Set Union
No ratings yet
Soda14 Disjoint Set Union
13 pages
Sets & Disjoint Set Union
No ratings yet
Sets & Disjoint Set Union
27 pages
Disjoint Sets and Joint Sets
No ratings yet
Disjoint Sets and Joint Sets
9 pages
09 Disjoint Set - 2021
No ratings yet
09 Disjoint Set - 2021
25 pages
G5 - A2SV - Union Find (No Code)
No ratings yet
G5 - A2SV - Union Find (No Code)
76 pages
Disjoint Ssets
No ratings yet
Disjoint Ssets
37 pages
Rahman MD Matiur 2230130236 13no
No ratings yet
Rahman MD Matiur 2230130236 13no
5 pages
RABIUS SANY - 2230130218 - 13no
No ratings yet
RABIUS SANY - 2230130218 - 13no
5 pages
Lecture 9: Kruskal's MST Algorithm: Disjoint Set Union-Find
No ratings yet
Lecture 9: Kruskal's MST Algorithm: Disjoint Set Union-Find
12 pages
Operations On Dynamic Sets
No ratings yet
Operations On Dynamic Sets
34 pages
Disjoint in Data Structure
No ratings yet
Disjoint in Data Structure
17 pages
Disjoint Sets: Each of The Elements Is in Exactly One Set at Any Time
No ratings yet
Disjoint Sets: Each of The Elements Is in Exactly One Set at Any Time
28 pages
Union-Find Algorithm - Set 2 (Union by Rank and Path Compression)
No ratings yet
Union-Find Algorithm - Set 2 (Union by Rank and Path Compression)
3 pages
Disjoint Sets Notes
No ratings yet
Disjoint Sets Notes
13 pages
Disjoint Set Data Structure: Find (X) - Determine Which Set An Item With Key X Is In, I.e., Return The Key of
No ratings yet
Disjoint Set Data Structure: Find (X) - Determine Which Set An Item With Key X Is In, I.e., Return The Key of
5 pages
CH5 3
No ratings yet
CH5 3
36 pages
11 Unionfind
No ratings yet
11 Unionfind
14 pages
12 13 Union Find
No ratings yet
12 13 Union Find
53 pages
Unit 2
No ratings yet
Unit 2
19 pages
Unit 2 (Part 1)
No ratings yet
Unit 2 (Part 1)
6 pages
Lecture 19: Swinging From Up-Trees To Graphs: Today's Agenda
No ratings yet
Lecture 19: Swinging From Up-Trees To Graphs: Today's Agenda
24 pages
Disjoint Sets
No ratings yet
Disjoint Sets
16 pages
Unit-1 2
No ratings yet
Unit-1 2
7 pages
Ada U2 Notes
No ratings yet
Ada U2 Notes
7 pages
Unit 2 Daa Updated 26th
No ratings yet
Unit 2 Daa Updated 26th
82 pages
Data Structures For Disjoint Sets - 1.PDF Unit 4
No ratings yet
Data Structures For Disjoint Sets - 1.PDF Unit 4
5 pages
Daa Unit Ii
No ratings yet
Daa Unit Ii
14 pages
Chap 8
No ratings yet
Chap 8
36 pages
Liniar Time Disjoint-Set by Tarjan
No ratings yet
Liniar Time Disjoint-Set by Tarjan
13 pages
Notes - Union-Find Disjoint Sets (UFDS)
No ratings yet
Notes - Union-Find Disjoint Sets (UFDS)
1 page
Dpps For PRMO
100% (1)
Dpps For PRMO
13 pages
Union Find
No ratings yet
Union Find
5 pages
Disjoint Sets Union Find Algorithms
No ratings yet
Disjoint Sets Union Find Algorithms
3 pages
Union-Find and Amortized Analysis
No ratings yet
Union-Find and Amortized Analysis
5 pages
Disjoint Set
No ratings yet
Disjoint Set
4 pages
Correctness of Kruskal's Algorithm: Operations
No ratings yet
Correctness of Kruskal's Algorithm: Operations
7 pages
11 DisjointSets
No ratings yet
11 DisjointSets
12 pages
Unit II DisjointSets
No ratings yet
Unit II DisjointSets
5 pages
Lecture07 DisjointSets
No ratings yet
Lecture07 DisjointSets
2 pages
Unit - 5 Disjoint Set
No ratings yet
Unit - 5 Disjoint Set
22 pages
Unit 7: Disjoint Sets: Course Contents
No ratings yet
Unit 7: Disjoint Sets: Course Contents
8 pages
Algorithms Theory 09 - Union-Find Data Structures
No ratings yet
Algorithms Theory 09 - Union-Find Data Structures
6 pages
DSA2 L14 (Disjoint Set)
No ratings yet
DSA2 L14 (Disjoint Set)
29 pages
ADA Unit-II P1 DisjointSets C
No ratings yet
ADA Unit-II P1 DisjointSets C
26 pages
Unit V Ads
No ratings yet
Unit V Ads
7 pages
Practical Veterinary Forensics - 1st Edition PDF
100% (18)
Practical Veterinary Forensics - 1st Edition PDF
15 pages
Terminating and Non-Terminating Decimals
No ratings yet
Terminating and Non-Terminating Decimals
2 pages
The Fascinating Fibonaccis
100% (1)
The Fascinating Fibonaccis
23 pages
A Paper
No ratings yet
A Paper
29 pages
Sachin SK - Python For Practice
No ratings yet
Sachin SK - Python For Practice
31 pages
CIVL4750 Numerical Solutions To Geotechnical Problems Lecture 1
No ratings yet
CIVL4750 Numerical Solutions To Geotechnical Problems Lecture 1
34 pages
Module 01 Algebra
100% (2)
Module 01 Algebra
15 pages
Boyd Homework Solutions
100% (1)
Boyd Homework Solutions
4 pages
Black Worksheets
No ratings yet
Black Worksheets
160 pages
Analisis Kesilapan Newman
No ratings yet
Analisis Kesilapan Newman
48 pages
Earths Magnetic Personality
No ratings yet
Earths Magnetic Personality
51 pages
HOTS Drill 3 Exercise Paper 1 Quadaratic Function 2015
No ratings yet
HOTS Drill 3 Exercise Paper 1 Quadaratic Function 2015
9 pages
Panduan Perkembangan Pembelajaran Murid: (Students' Learning Progress Guide) Mathematics Form 3
No ratings yet
Panduan Perkembangan Pembelajaran Murid: (Students' Learning Progress Guide) Mathematics Form 3
7 pages
Transformation
No ratings yet
Transformation
49 pages
Part I - Eigenvalue Problem
No ratings yet
Part I - Eigenvalue Problem
15 pages
Lesson04 PDF
No ratings yet
Lesson04 PDF
51 pages
Math 10
No ratings yet
Math 10
5 pages
Best Math Books List
No ratings yet
Best Math Books List
3 pages
Sp14 Gurukul School
No ratings yet
Sp14 Gurukul School
8 pages
Jmi 18 51
No ratings yet
Jmi 18 51
16 pages
Introduction of P Adic Numbers-1
No ratings yet
Introduction of P Adic Numbers-1
7 pages
Ahmed Aymaan Ibrahim 1D Task 1
No ratings yet
Ahmed Aymaan Ibrahim 1D Task 1
3 pages
Test 34 + Answer Key
No ratings yet
Test 34 + Answer Key
7 pages
Prob L11B B
No ratings yet
Prob L11B B
1 page
Second-Order Linear Equations: 2.1 Classical Mechanics
No ratings yet
Second-Order Linear Equations: 2.1 Classical Mechanics
11 pages
Katz - Mentalism in Linguistics PDF
No ratings yet
Katz - Mentalism in Linguistics PDF
15 pages
Transient Simulation: Lecture Iv: II II II
No ratings yet
Transient Simulation: Lecture Iv: II II II
22 pages
Developing Level of Interest of Grade Learners of City Central Elementary School in Relation To Problem Solving Activities in Mathematics
No ratings yet
Developing Level of Interest of Grade Learners of City Central Elementary School in Relation To Problem Solving Activities in Mathematics
2 pages
Induced Emf in A Circular Loop: Lecture Notes, Spring Semester 2017
No ratings yet
Induced Emf in A Circular Loop: Lecture Notes, Spring Semester 2017
4 pages
Mcs 031 qns2 Image: Travelling Salesman Problem Previous Post
No ratings yet
Mcs 031 qns2 Image: Travelling Salesman Problem Previous Post
5 pages
Mahmoud and Ehab and The Message: Input
No ratings yet
Mahmoud and Ehab and The Message: Input
2 pages
Paku Paku and Shortest Path Time Limit: 1 Sec Problem Setter: Problem Tester
No ratings yet
Paku Paku and Shortest Path Time Limit: 1 Sec Problem Setter: Problem Tester
2 pages
Vinu and Quantum Entanglement-2: Input
No ratings yet
Vinu and Quantum Entanglement-2: Input
2 pages
Prob PL10
No ratings yet
Prob PL10
2 pages
DSA Lab 11 Set 1 - Tra C: Input
No ratings yet
DSA Lab 11 Set 1 - Tra C: Input
2 pages
Prob L11A A
No ratings yet
Prob L11A A
2 pages
Prob L11B A
No ratings yet
Prob L11B A
1 page
Wallcraft (Hard Version) : Input
No ratings yet
Wallcraft (Hard Version) : Input
1 page
DSA Lab 10 Set 2 - Internet: Input
No ratings yet
DSA Lab 10 Set 2 - Internet: Input
1 page
Prob-D 2
No ratings yet
Prob-D 2
1 page
Prob D
No ratings yet
Prob D
1 page
Yet Another Coronavirus Question: Input
No ratings yet
Yet Another Coronavirus Question: Input
1 page
Dsa Lab 11 Set 2 - Secondmst: Input
No ratings yet
Dsa Lab 11 Set 2 - Secondmst: Input
1 page
Input
No ratings yet
Input
1 page
Input
No ratings yet
Input
1 page
Input
No ratings yet
Input
1 page
Commitee - United Nations Conference On Trade and Development
No ratings yet
Commitee - United Nations Conference On Trade and Development
4 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet

DisjointSet Slide

Uploaded by

DisjointSet Slide

Uploaded by

DISJOINT SET DATA STRUCTURE

Partha P. Chakrabarti & Aritra Hazra

– Tree rooted at r changes only when a smaller (or

– Case 1. [ height(r) > height(s) ]

Property-3: If parent[x] changes,

Property-5: The highest rank of a node is ≤ ⎣log2 n⎦

– log* n = 0, when n ≤ 1 i times

= MIN { i ≥ 0 | log2 log2 … log2 n ≤1}, when n ≥ 2

– log* n = 0, when n ≤ 1 Recursive Definition

• Theorem: The time complexity required to process m UNION and FIND-SET

You might also like

–  Tree rooted at r changes only when a smaller (or

–  Case 1. [ height(r) > height(s) ]

–  log* n = 0, when n ≤ 1 i times

–  log* n = 0, when n ≤ 1 Recursive Definition

•  Theorem: The time complexity required to process m UNION and FIND-SET