Lecture06_RangeTree

Uploaded by

mahmoudsharaf796

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views5 pages

Lecture06_RangeTree

Uploaded by

mahmoudsharaf796

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Cairo University

Faculty of Computers and Artificial Intelligence

Computer Science Department

Advanced Data Structures Range and Kd Trees Dr. Amin Allam

[For more details, refer to “Introduction to Algorithms” by Thomas Cormen, et al.]

[For more details, refer to “Advanced Data Structures” by Peter Brass]

1 Range tree
A range query asks for the set of stored points (values) belonging to the query interval (range)
[lo, hi]. A simple red-black tree can answer range queries, by initially searching for the minimum
stored value ≥ lo and reporting it, then successively executing the following GetSuccessor(node)
procedure from the last visited node several times until a node containing a value ≥ hi is returned.
After each call (except possibly the last one), the returned node value is reported.

GetSuccessor(node)
if (node . right 6= null)
then
node ← node . right
while (node . left 6= null) node ← node . left
return node
else
while (node 6= root)
if (node= node . parent . left) then return node . parent
node ← node . parent
return null

Note that the parent field need not to be explicitly stored in each node. Alternatively, Whenever
we go from a node to its left child, the node is pushed into a stack of parents to be used whenever
needed. Whenever we go from a right child to its parent, that parent is popped from the stack. The
size of such stack is O(log n) where n is the number of values stored in the balanced tree.
The visited nodes belong to one of the following set of nodes:
• The O(log n) nodes on the path from root to the node having value ≥ lo.
• The k nodes containing values belonging to the query range.
• The O(log n) nodes on the path from root to the node having value ≥ hi.
Each of the above nodes is visited at most 3 times: when it is reached from its parent, when it is
reached from its left child, and when it is reached from its right child. Therefore, the complexity
of executing GetSuccessor() k successive times to answer a range query is O(log n + k).
Unfortunately, the above method cannot be easily generalized to higher dimensions, such as re-
trieving all two dimensional points inside a query rectangle.

1
FCAI-CU AdvDS Range and Kd Trees Amin Allam

The orthogonal 2d range tree is a static data structure which answers 2d range queries. It retrieves
in all two dimensional points (x, y) inside a query rectangle {[qx . lo, qx . hi], [qy . lo, qy . hi]}.
The orthogonal 2d range tree is built in the following way:
• All input points are sorted based on its first (x) coordinate. A static binary search tree keyed by x
is built such input points are stored in leaves, as shown inside the light-orange rectangle below. The
remaining levels are constructed such that the tree is balanced and the key of each internal node
is the smallest x in its right subtree. Each internal node stores an interval spanning the smallest
and largest x existing in its subtree. Contrary to ordinary binary search trees, internal nodes only
guides the search queries and do not store actual input points data.
• For each node nodex in the basic tree keyed by x constructed above: construct an associated tree
exactly as described above, except that it stores only input points existing in the subtree of nodex
and keyed by the second (y) coordinate. The figure shows only some of these trees.

[1,9] [10,10]
6 10
[8,10] (4,10)
10
[1,5] [6,9] [8,8]
4 8 8
[5,10] (3,8)
8
[1,3] [4,5] [6,7] [8,9] [7,7]
3 5 7 9 7
[5,7] (6,7)
7
[1,1] [3,3] [4,4] [5,5] [6,6] [7,7] [8,8] [9,9] [5,5]
1 3 4 5 6 7 8 9 5
(1,4) (3,8) (4,10) (5,2) (6,7) (7,3) (8,1) (9,5) [1,10] (9,5)
5
[4,4]
[3,7] 4
7 [3,4] (1,4)
[2,10] [1,7] 4
8 5 [3,3]
[3,3] [7,7] 3
3 7 [1,4] (7,3)
[2,4] [8,10] (7,3) (6,7) [1,3] [5,7] 3
4 10 3 7 [2,2]
2
[1,2] (5,2)
[2,2] [4,4] [8,8] [10,10] [1,1] [3,3] [5,5] [7,7] 2
2 4 8 10 1 3 5 7 [1,1]
(5,2) (1,4) (3,8) (4,10) (8,1) (7,3) (9,5) (6,7) 1
(8,1)

The following O(log2 n + k) procedure reports all k input points belonging to a query rectangle:
• If the range tree interval is disjoint from the x query interval, stop following the path down.
• If the range tree interval partially overlaps the x query interval, follow both paths down.
• If the range tree interval is entirely contained in the x query interval, stop following the path
down, and do the following starting from the root of the associated tree of the current node:
• If the range tree interval is disjoint from the y query interval, stop following the path down.
• If the range tree interval partially overlaps the y query interval, follow both paths down.
• If the range tree interval is entirely contained in the y query interval, stop following the path
down, and report all input points stored in the leaves of this subtree:

2
FCAI-CU AdvDS Range and Kd Trees Amin Allam

The above complexity follows because each interval of one query coordinate is actually decom-
posed similarly to the canonical representation decomposition of size O(log n) described in the
segment tree lecture.
To retrieve all two dimensional points (x, y) inside the query rectangle {[x=1, x=8], [y=2, y=5]}:
First, we search for the interval [x=1, x=8] in the basic tree to reach the intervals {[1,5],[6,7],[8,8]}.
For each node associated with these intervals, we search for [y=2, y=5] in its associated tree.
Searching for [y=2, y=5] in the associated tree of [1, 5] shown in the left bottom corner in the
above figure, starting from the root [2, 10] we reach the node [2, 4] which is entirely contained in
[2, 5] so we report all input points in the leaves of the subtree of [2, 4] which are [5, 2] and [1, 4].
Searching for [y=2, y=5] in the associated tree of [6, 7], starting from the root [3, 7] we reach the
node [3, 3] which is entirely contained in [2, 5] so we report the one input point this subtree of
[3, 3] which is [7, 3]. Searching for [y=2, y=5] in the associated tree of [8, 8] (which is not shown
in the figure) does not lead to any results.
An orthogonal 2d range tree can be built be sorting all input points based on its x coordinate (if
two input points have equal x coordinate, they are compared based on their y coordinate), and then
recursively calling the following procedure root ← Build2dRangeTree(p[0 . . . n]):
Function Build2dRangeTree(p[ist . . . iend]): ist = start index, iend = 1+ last index
• imed ← b(ist+iend)/2c (index of median point of p[ist . . . iend-1]
• Construct root node • root . key ← p[imed] . x • root . interval ← [p[ist] . x, p[iend-1] . x]
• root . left ← Build2dRangeTree(p[ist . . . imed-1])
• root . right ← Build2dRangeTree(p[imed . . . iend])
• root . assoc tree ← Construct static binary search tree for all p[ist . . . iend-1] points keyed by y
• return root
Let T (n) be the time complexity of the above algorithm. The non-recursive part consists mainly
of constructing root . assoc tree, which can be done by sorting points by y in O(n log n) then
constructing higher levels in O(n). The sorting part can be replaced by just O(n) merging of
two sorted arrays if the recursive function returns also the points sorted by y. Therefore, the non-
recursive part is only O(n) time and space. Thus, the time and space complexity of the whole
algorithm is T (n) = 2T (n/2) + O(n). Solving the recurrence leads to T (n) = O(n log n).
When sorting input points based on its y coordinate, if two input points have equal y coordinate
values, they are compared based on their x coordinate values. The reason is that binary search
trees do not behave properly if equal keys occur. We need to differentiate between keys using any
method, such that the same chosen method is used every time two keys need to be differentiated.
If several points coincide, only one of them should be stored in the tree.
Similarly, the above technique can be generalized to any number of dimensions d. An orthogonal
range tree with d dimensions mainly consists also of a basic tree based on the first coordinate (x)
values of all points, but the associated tree of each node nodex should be an orthogonal range
tree with d − 1 dimensions based on all coordinate values (except the first one (x)) of all points
belonging to the subtree of nodex . The construction time and space complexity of an orthogonal
range tree with d dimensions having n d-dimensional points is T (n) = O(n logd−1 n). The query
time complexity is T (n) = O(logd n + k) where k is the number of points satisfying the query.

3
FCAI-CU AdvDS Range and Kd Trees Amin Allam

2 Kd tree
Kd tree is a static data structure that supports d-dimensional orthogonal range queries in a set
of n d-dimensional points, exactly as orthogonal range tree described before, but with different
time and space requirements. Kd tree requires only O(n) space and O(n log n) construction time,
1
regardless of the number of dimensions. However, the query time complexity O(n1− d + √ k) if the
output consists of k points. In particular, the query time complexity for 2 dimensions is O( n + k)
which is worse than the O(log2 n + k) time complexity of the orthogonal 2d range tree.
Consider eight 2d input points (x,y) shown in the left figure below to be stored in a Kd tree (2d
tree). First, we divide the input points into two equal (or almost-equal) subsets based on their first
(x) coordinate values. Then, we find a vertical line (whose equation is X=constant) that divides
the two subsets. The line L1 (X=4.5) is chosen for that purpose, as shown in the left figure. The
equation of the separating line L1 is then assigned to the root of the Kd tree (in the right figure).
The left subtree should contain all points lying to the left of L1 (have less x coordinate values than
the constant in L1 equation). The right subtree should contain all points lying to the right of L1.
Internal nodes only act as separators, and input points are stored only in leaves.
Then, for each subset of the two subsets created above, we attempt to divide its nodes based on
their second (y) coordinate values. The horizontal line L2 (Y=6.5) divides the left subset of four
points, and the horizontal line L3 (Y=2.5) divides the right subset of four points. The Kd tree
nodes of the second level are created and assigned such line equations as shown in the right figure.
Note that if nodes of a any level contain vertical separators, nodes in the following level should
contain horizontal separators, and vice versa. Similarly, nodes of the third level contains vertical
lines where each line separates the two input points existing in the leaves of its subtree.
L5
(4,8)

(2,7) X=4.5
L2
L1
(3,6)

Y=6.5 Y=2.5
L7 L2 L3
(8,5)
L1

(1,4) X=2 X=3 X=6 X=7

L4 L4 L5 L6 L7

(6,3)
L3
(1,4) (3,6) (2,7) (4,8) (5,2) (7,1) (6,3) (8,5)
(5,2)

L6
(7,1)

To retrieve all two dimensional points (x, y) inside the query rectangle {[x=0, x=4], [y=5, y=7.5]},
we start from the root having the separator line L1 (X=4.5). Obviously, the query rectangle lies
entirely to the left of that separator, because query . x . hi < L1 . x (4<4.5). Thus, we exclude the
right subtree from our search and follow the left path only.
The L2 (Y=6.5) separator is not helpful since it lies inside query . y interval, so the search follows
both left and right paths down. The L4 (X=2) and L5 (X=3) separators are not helpful as well,
so we follow both directions from both nodes to obtain four points. Comparing them against the
original query rectangle, only (3,6) and (2,7) are reported.

4
FCAI-CU AdvDS Range and Kd Trees Amin Allam

A 2d tree can be built by recursively calling the following procedure root ← Build2dTree(p[], X),
where NextCoord(X)=Y and NextCoord(Y)=X:
Function Build2dTree(p[], Coord):
• medcord ← The median Coord value of all points in p[]
• pleft[] ← Points of p[] having Coord values < medcord
• pright[] ← Points of p[] having Coord values ≥ medcord
• Construct root node • root . sep line ← A Coord value separating those of pleft[] and pright[]
• root . left ← Build2dTree(pleft, NextCoord(Coord))
• root . right ← Build2dTree(pright, NextCoord(Coord))
• return root
Since calculating the median of n values requires a O(n) randomized algorithm, the time complex-
ity of the above procedure is T (n) = 2T (n/2)+O(n). Solving the recurrence: T (n) = O(n log n).
The space complexity is S(n) = 2S(n/2) + S(1). Solving the recurrence: S(n) = O(n).
A 2d tree can be queried by recursively calling the following procedure Query(root, query, X):
Query(node, query, Coord)
if (node is leaf ) report the stored point if it is contained in query
else if (Coord=X)
if (query . x . hi < node . sep line) then Query(node . left, query, Y)
else if (query . x . lo ≥ node . sep line) then Query(node . right, query, Y)
else Query(node . left, query, Y), Query(node . right, query, Y)
else if (Coord=Y)
if (query . y . hi < node . sep line) then Query(node . left, query, X)
else if (query . y . lo ≥ node . sep line) then Query(node . right, query, X)
else Query(node . left, query, X), Query(node . right, query, X)
√
To understand the O( n) part of the query time complexity, consider a very thin horizontal query
rectangle. At the first level (Coord=X), the search goes to both left and right directions (so now
two nodes of the second level are visited in addition to the root in first level). At the second level
(Coord=Y), the search goes to only one direction (so each of the two visited nodes in the second
level will lead to one node in the third level, so only two more nodes are visited in the third level).
Thus, the number of visited nodes is 1 (root) + 2 (second level) + 2 (third level) + 4 (fourth level)
1 1
+ 4 + 8 + 8 + . . . + 2 2 log2 n + 2 2 log2 n (because we know that the number of added terms (levels)
1
equals to the tree height = log2 n). Therefore the sum equals 1 + 2(20 + 21 + 22 + · · · + 2 2 log2 n ) =
1 1 1 1 1 √ √
1 + 2(2( 2 log2 n)+1 − 1) = 4(2 2 log2 n ) − 1. Since 2 2 log2 n = 2log2 (n 2 ) = n 2 = n, the sum is O( n).
A Kd tree can be generalized to higher dimensions by cycling through different dimensions. For
example, to handle 3-dimensional input points (x,y,z), the first tree level should separate points
based on their x coordinate values. The second tree level should separate points based on their y
coordinate values. The third tree level should separate points based on their z coordinate values.
The fourth tree level should separate points based on their x coordinate values, and so on.
There exist dynamic insert and delete operations for Kd trees. However, the suggested implemen-
tations are not guaranteed to effectively maintain the balance and defined characteristics of the Kd
tree. Thus, Kd tree cannot be considered a dynamic data structure.

Data Structures Cheat Sheet
71% (14)
Data Structures Cheat Sheet
2 pages
Trees: Discrete Mathematics
No ratings yet
Trees: Discrete Mathematics
40 pages
CS301-Assignment 2 Solution Fall 2024 by M.junaid Qazi
No ratings yet
CS301-Assignment 2 Solution Fall 2024 by M.junaid Qazi
5 pages
Computational Geometry One Dimensional Range SearchingTwo Dimensional Range
No ratings yet
Computational Geometry One Dimensional Range SearchingTwo Dimensional Range
28 pages
Range Tree PDF
100% (1)
Range Tree PDF
10 pages
Data Structures Question Bank
100% (1)
Data Structures Question Bank
19 pages
Project in DSA Java
No ratings yet
Project in DSA Java
5 pages
CS304_M3
No ratings yet
CS304_M3
41 pages
Lecture02_BTree
No ratings yet
Lecture02_BTree
5 pages
Data Structures For Range Searching
No ratings yet
Data Structures For Range Searching
13 pages
Range Searching
No ratings yet
Range Searching
4 pages
Range Quantile Queries: Another Virtue of Wavelet Trees
No ratings yet
Range Quantile Queries: Another Virtue of Wavelet Trees
7 pages
notes07
No ratings yet
notes07
9 pages
Free Video Lectures For MBA
No ratings yet
Free Video Lectures For MBA
22 pages
DSA Seminar 4.8
No ratings yet
DSA Seminar 4.8
19 pages
Range Counting Semi Group Model
No ratings yet
Range Counting Semi Group Model
3 pages
Computational Geometry: Range Trees
No ratings yet
Computational Geometry: Range Trees
66 pages
Unit Iii Greedy and Dynamic Programming
No ratings yet
Unit Iii Greedy and Dynamic Programming
120 pages
3rd DSU (20) MICERO PROJECT Vaishnavi - New
No ratings yet
3rd DSU (20) MICERO PROJECT Vaishnavi - New
20 pages
11 Datastructures2
No ratings yet
11 Datastructures2
123 pages
Level Order Traversal Using Recursion in C++
No ratings yet
Level Order Traversal Using Recursion in C++
4 pages
orthogonal range trees
No ratings yet
orthogonal range trees
7 pages
Range Queries
No ratings yet
Range Queries
4 pages
Introduction To Algorithms: 6.046J/18.401J/SMA5503
No ratings yet
Introduction To Algorithms: 6.046J/18.401J/SMA5503
28 pages
Notes 02
No ratings yet
Notes 02
4 pages
segment tree
No ratings yet
segment tree
6 pages
Ds Lesson Plan
No ratings yet
Ds Lesson Plan
4 pages
LeetCode Notes
No ratings yet
LeetCode Notes
167 pages
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 2 Sample Solution
No ratings yet
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 2 Sample Solution
2 pages
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 3 Sample Solution
No ratings yet
6.851 Advanced Data Structures (Spring'12) Prof. Erik Demaine Problem 3 Sample Solution
2 pages
CS 240 Tutorial 10 Notes: Lo Hi Lo Hi
No ratings yet
CS 240 Tutorial 10 Notes: Lo Hi Lo Hi
4 pages
Binary Search Tree1
No ratings yet
Binary Search Tree1
12 pages
Multidimensional Range Search: Static Collection of Records
No ratings yet
Multidimensional Range Search: Static Collection of Records
30 pages
AP (Exp6) Mrigaank
No ratings yet
AP (Exp6) Mrigaank
6 pages
Binary Search Tree - Deleting A Node
No ratings yet
Binary Search Tree - Deleting A Node
8 pages
Ads Unit Ii Notes
No ratings yet
Ads Unit Ii Notes
31 pages
ISAM: Indexed-Sequential-Access-Method: Adapted From Prof Joe Hellerstein's Notes
No ratings yet
ISAM: Indexed-Sequential-Access-Method: Adapted From Prof Joe Hellerstein's Notes
9 pages
DS UNIT-3 Complete
No ratings yet
DS UNIT-3 Complete
114 pages
M.tech DS-Scheme CIE 2
No ratings yet
M.tech DS-Scheme CIE 2
5 pages
Spanning Tree: R K Mohapatra
No ratings yet
Spanning Tree: R K Mohapatra
29 pages
Lecture Notes For Design and Analysis of Algorithms
No ratings yet
Lecture Notes For Design and Analysis of Algorithms
25 pages
MultidimensionalSearchTrees
No ratings yet
MultidimensionalSearchTrees
100 pages
2019 10 Cornell Cs5199 Segment Trees
No ratings yet
2019 10 Cornell Cs5199 Segment Trees
67 pages
orthogonal range trees
No ratings yet
orthogonal range trees
6 pages
99 Geometric Search
No ratings yet
99 Geometric Search
56 pages
Binary-Tree-Traversals
No ratings yet
Binary-Tree-Traversals
11 pages
07 Kdtrees
No ratings yet
07 Kdtrees
17 pages
1903.04936v1
No ratings yet
1903.04936v1
12 pages
binary lab no 8
No ratings yet
binary lab no 8
12 pages
Unit 5 Trees
No ratings yet
Unit 5 Trees
75 pages
Segment Trees: Cs-201 Project (Group 42)
No ratings yet
Segment Trees: Cs-201 Project (Group 42)
15 pages
Notes 01
No ratings yet
Notes 01
8 pages
Solutions For HW5-CS 6033 Fall 2024
No ratings yet
Solutions For HW5-CS 6033 Fall 2024
13 pages
Data Structures. Harsha
No ratings yet
Data Structures. Harsha
35 pages
B+ and Heaps
No ratings yet
B+ and Heaps
19 pages
Advanced Data Structure Lab Programs
No ratings yet
Advanced Data Structure Lab Programs
84 pages
Unit Ii
No ratings yet
Unit Ii
87 pages
58.tree 2 Notes
No ratings yet
58.tree 2 Notes
12 pages
cmsc420 2020 08 Handouts
No ratings yet
cmsc420 2020 08 Handouts
53 pages
KD-Trees
No ratings yet
KD-Trees
7 pages
Lect0208 PDF
No ratings yet
Lect0208 PDF
7 pages
Tree Data Structure
No ratings yet
Tree Data Structure
13 pages
Week7.pdf Sqrt+Segtree PDF
No ratings yet
Week7.pdf Sqrt+Segtree PDF
6 pages
Binary Indexed Tree
No ratings yet
Binary Indexed Tree
9 pages
Computational Geometry: Range Searching and Kd-Trees
No ratings yet
Computational Geometry: Range Searching and Kd-Trees
59 pages
Avl Trees
100% (1)
Avl Trees
25 pages
Module 3 DAA
No ratings yet
Module 3 DAA
16 pages
An Efficient and Robust Access Method For Points and Rectangles
No ratings yet
An Efficient and Robust Access Method For Points and Rectangles
38 pages
Heap and Priority Queue: Li Yin February 6, 2019
No ratings yet
Heap and Priority Queue: Li Yin February 6, 2019
12 pages
Spatial Indexing I: Point Access Methods
No ratings yet
Spatial Indexing I: Point Access Methods
52 pages
2IL50 Data Structures: 2017-18 Q3 Lecture 9: Range Searching
No ratings yet
2IL50 Data Structures: 2017-18 Q3 Lecture 9: Range Searching
40 pages
Tree Data Structure
No ratings yet
Tree Data Structure
13 pages
AVL Tree
No ratings yet
AVL Tree
5 pages
Efficient Implementation of Range Trees
No ratings yet
Efficient Implementation of Range Trees
15 pages
ECE250 Notes
No ratings yet
ECE250 Notes
23 pages
Part10 Quadtrees Etc
No ratings yet
Part10 Quadtrees Etc
69 pages
Lecture04 Range Searching
No ratings yet
Lecture04 Range Searching
38 pages
Binary Search Trees
No ratings yet
Binary Search Trees
116 pages
2 Question Bank
No ratings yet
2 Question Bank
3 pages
IOI Training Week 7 Advanced Data Structures: 1.1 Square-Root (SQRT) Decomposition
No ratings yet
IOI Training Week 7 Advanced Data Structures: 1.1 Square-Root (SQRT) Decomposition
6 pages
Computational Geometry: Gun Srijuntongsiri
No ratings yet
Computational Geometry: Gun Srijuntongsiri
71 pages
Lecture 17 - Minimum Spanning Tree PDF
No ratings yet
Lecture 17 - Minimum Spanning Tree PDF
16 pages
Priority Search Trees
100% (1)
Priority Search Trees
18 pages
BST Range Search!
No ratings yet
BST Range Search!
17 pages
CSE 326: Data Structures Lecture #21 Multidimensional Search Trees
No ratings yet
CSE 326: Data Structures Lecture #21 Multidimensional Search Trees
42 pages
Multidimensional Search Trees
No ratings yet
Multidimensional Search Trees
119 pages
Segment Tree For Solving Range Minimum Query Problems
No ratings yet
Segment Tree For Solving Range Minimum Query Problems
6 pages
Exploratory Programming for the Arts and Humanities, second edition
From Everand
Exploratory Programming for the Arts and Humanities, second edition
Nick Montfort
4/5 (2)
Big Foot Boutique: "Kick Up Your Heels" in 8 Pairs of Crochet Slippers!
From Everand
Big Foot Boutique: "Kick Up Your Heels" in 8 Pairs of Crochet Slippers!
Annie's
3/5 (1)
Sudoku New: Workouts to sharpen your mind
From Everand
Sudoku New: Workouts to sharpen your mind
Sahil Gupta
No ratings yet

Lecture06_RangeTree

Uploaded by

Lecture06_RangeTree

Uploaded by

Cairo University

Faculty of Computers and Artificial Intelligence

Advanced Data Structures Range and Kd Trees Dr. Amin Allam

[For more details, refer to “Introduction to Algorithms” by Thomas Cormen, et al.]

(1,4) X=2 X=3 X=6 X=7

You might also like