0% found this document useful (0 votes)

16 views19 pages

K-D Trees

kd-Trees, invented by Jon Bentley in the 1970s, are a data structure used for organizing points in a k-dimensional space, allowing efficient range and nearest neighbor searches. The structure involves recursive partitioning of the space into two halves at each node based on a cutting dimension, and operations such as insertion, deletion, and finding minimum values are implemented with specific algorithms. Nearest neighbor searches utilize pruning techniques to optimize search space and improve efficiency, often resulting in a runtime closer to O(2^d + log n) in practice.

Uploaded by

Kamatchi Kartheeban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views19 pages

K-D Trees

Uploaded by

Kamatchi Kartheeban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 19

kd-Trees

Dr.K.Kartheeban
kd-Trees

• Invented in 1970s by Jon Bentley

• Name originally meant “3d-trees, 4d-trees, etc”

where k was the # of dimensions
• Now, people say “kd-tree of dimension d”

• Idea: Each level of the tree compares against

1 dimension.
• Let’s us have only two children at each
d
node (instead of 2 )
kd-trees

• Each level has a

“cutting dimension”
• Cycle through the dimensions x
as you walk down the tree.
y
• Each node contains a
point P = (x,y)
x
• To find (x’,y’) you only
compare coordinate from the y
cutting dimension
- e.g. if cutting dimension is x, x
then you ask: is x’ < x?
kd-tree example
insert: (30,40), (5,25), (10,12), (70,70), (50,30), (35,45)

x 30,40

(70,70) y 5,25
70,70
(35,45)

(30,40)
x
10,12 50,30
(5,25)
(50,30) y
(10,12) 35,45
Insert Code

insert(Point x, KDNode t, int cd) {

if t == null
t = new KDNode(x)
else if (x == t.data)
// error! duplicate
else if (x[cd] < t.data[cd])
t.left = insert(x, t.left, (cd+1) % DIM)
else
t.right = insert(x, t.right, (cd+1) % DIM)
return t
}
FindMin in kd-trees
• FindMin(d): find the point with the smallest value in
the dth dimension.

• Recursively traverse the tree

• If cutdim(current_node) = d, then the minimum

can’t be in the right subtree, so recurse on just
the left subtree
- if no left subtree, then current node is the min for tree
rooted at this node.
• If cutdim(current_node) ≠ d, then minimum could
be in either subtree, so recurse on both subtrees.
- (unlike in 1-d structures, often have to explore
several paths down the tree)
FindMin

FindMin(x-dimension):

x
(35,90) 51,75
(60,80)
(51,75)
y 25,40 70,70
(70,70)

(50,50)

(25,40)
x 55,1
10,30 35,90 60,80

(10,30)
y 1,10 50,50
(1,10)
(55,1)
FindMin

FindMin(y-dimension):

x
(35,90)
(60,80) 51,75
(51,75) y
25,40 70,70
(70,70)

(50,50)

(25,40) x 55,1
10,30 35,90 55,1 60,80
(10,30)
(1,10)
y 1,10 50,50
(55,1) 1,10
FindMin

FindMin(y-dimension): space searched

x
(35,90) 51,75
(60,80)
(51,75)
y 25,40 70,70
(70,70)

(50,50)

(25,40)
x 55,1
10,30 35,90 60,80

(10,30)
y 1,10 50,50
(1,10)
(55,1)
FindMin Code

Point findmin(Node T, int dim, int cd):

// empty tree
if T == NULL: return NULL

/ T splits on the dimension we’re searching

/ => only visit left subtree
if cd == dim:
if t.left == NULL: return t.data
else return findmin(T.left, dim, (cd+1)%DIM)

/ T splits on a different dimension

/ => have to search both
subtrees else:
return minimum(
findmin(T.left, dim,
(cd+1)%DIM), findmin(T.right,
dim, (cd+1)%DIM) T.data
)
Delete in kd-trees

Want to delete node A.

Assume cutting
dimension of A is cd

In BST, we’d
findmin(A.right). cd A

Here, we have to
findmin(A.right, cd)

Everything in Q has Q P
cd-coord < B, and cd B
everything in P has cd-
coord ≥ B
Delete in kd-trees --- No Right Subtree

• What is right subtree is

empty?
• Possible idea: Find the
max in the left subtree?
- Why might this not
work?
x (x,y)
• Suppose I findmax(T.left)
and get point (a,b):

It’s possible that T.left

contains another Q
point with x = a. cd (a,b)
(a,c)
Now, our equal
coordinate invariant is
violated!
No right subtree --- Solution

• Swap the subtrees of node

to be deleted
• B = findmin(T.left)
• Replace deleted node by B

x (x,y)

Now, if there is another

point with x=a, it
appears in the right Q
cd (a,b)
subtree, where it should
(a,c)
Point delete(Point x, Node T, int cd):
if T == NULL: error point not
found! next_cd = (cd+1)%DIM

/ This is the point to

delete: if x = T.data:
/ use min(cd) from right subtree:
if t.right != NULL:
t.data = findmin(T.right, cd, next_cd)
t.right = delete(t.data, t.right, next_cd)
/ swap subtrees and use min(cd) from new
right: else if T.left != NULL:
t.data = findmin(T.left, cd, next_cd)
t.right = delete(t.data, t.left, next_cd)
else
t = null // we’re a leaf: just remove

/ this is not the point, so search for

it: else if x[cd] < t.data[cd]:
t.left = delete(x, t.left,
next_cd) else
t.right = delete(x, t.right, next_cd)

return t
Nearest Neighbor Searching in kd-trees

• Nearest Neighbor Queries are very common: given a point Q find the
point P in the data set that is closest to Q.
• Doesn’t work: find cell that would contain Q and return the point it
contains.
- Reason: the nearest point to P in space may be far from P in
the tree:
- E.g. NN(52,52):

(35,90) 51,75
(60,80)
(51,75)
25,40 70,70
(70,70)

(50,50)
10,30 35,90 55,1 60,80
(25,40)

(10,30) 1,10 50,50

(1,10)
(55,1)
kd-Trees Nearest Neighbor

• Idea: traverse the whole tree, BUT make two

modifications to prune to search space:

1. Keep variable of closest point C found so far.

Prune subtrees once their bounding boxes say
that they can’t contain any point closer than C

2. Search the subtrees in order that maximizes

the chance for pruning
Nearest Neighbor: Ideas, continued

Query d
Point Q
If d > dist(C, Q), then no
point in BB(T) can be
Bounding box closer to Q than C. Hence,
T
of subtree no reason to search
rooted at T subtree rooted at T.

Update the best point so far, if T is better:

if dist(C, Q) > dist(T.data, Q), C := T.data
Recurse, but start with the subtree “closer” to Q:
First search the subtree that would contain Q if we were
inserting Q below T.
Nearest Neighbor, Code best, best_dist are global var
(can also pass into function calls)

def NN(Point Q, kdTree T, int cd, Rect BB):

// if this bounding box is too far, do nothing

if T == NULL or distance(Q, BB) > best_dist: return

/ if this point is better than the

best: dist = distance(Q, T.data)
if dist < best_dist:
best = T.data
best_dist = dist
/ visit subtrees is most promising order:
if Q[cd] < T.data[cd]:
NN(Q, T.left, next_cd, BB.trimLeft(cd, t.data))
NN(Q, T.right, next_cd, BB.trimRight(cd, t.data))
else:
NN(Q, T.right, next_cd, BB.trimRight(cd, t.data))
NN(Q, T.left, next_cd, BB.trimLeft(cd, t.data))

Following Dave Mount’s Notes (page 77)

Nearest Neighbor Facts

• Might have to search close to the whole tree in

the worst case. [O(n)]
• In practice, runtime is closer to:
- d
O(2 + log n)
- log n to find cells “near” the query point
- d
2 to search around cells in that neighborhood

• Three important concepts that reoccur in range /

nearest neighbor searching:
- storing partial results: keep best so far, and update
- pruning: reduce search space by eliminating irrelevant trees.
- traversal order: visit the most promising subtree first.

Multidimensional Search Trees
No ratings yet
Multidimensional Search Trees
100 pages
Unit 5&7 - GraphTheoryAndGreedyApproach
No ratings yet
Unit 5&7 - GraphTheoryAndGreedyApproach
139 pages
L19.Kd Trees
0% (1)
L19.Kd Trees
19 pages
Spatial Data Indexing and Queries
No ratings yet
Spatial Data Indexing and Queries
56 pages
Spatial Indexing I: Point Access Methods
No ratings yet
Spatial Indexing I: Point Access Methods
52 pages
Recitation8 Solutions
No ratings yet
Recitation8 Solutions
5 pages
(PR 2024) Lec14 Unsupervised Learning II
No ratings yet
(PR 2024) Lec14 Unsupervised Learning II
32 pages
Fast and Exact Fixed-Radius Neighbor Search Based On Sorting
No ratings yet
Fast and Exact Fixed-Radius Neighbor Search Based On Sorting
17 pages
99 Geometric Search
No ratings yet
99 Geometric Search
56 pages
Unit-3 Problem Solving by Searching
No ratings yet
Unit-3 Problem Solving by Searching
31 pages
Similarity Search-Kd Tree
No ratings yet
Similarity Search-Kd Tree
5 pages
May Jun 2024 Full Solutions
No ratings yet
May Jun 2024 Full Solutions
24 pages
An Optimal Algorithm For Approximate Nearest
No ratings yet
An Optimal Algorithm For Approximate Nearest
33 pages
Binomial SN Dey Xi
100% (1)
Binomial SN Dey Xi
5 pages
The K-D Tree Data Structure and A Proof For Neighborhood Computation in Expected Logarithmic Time
No ratings yet
The K-D Tree Data Structure and A Proof For Neighborhood Computation in Expected Logarithmic Time
12 pages
KD Tree Doc
No ratings yet
KD Tree Doc
20 pages
K-D Trees and KNN Searches
No ratings yet
K-D Trees and KNN Searches
9 pages
KD Trees
No ratings yet
KD Trees
7 pages
Computational Geometry: Range Searching and Kd-Trees
No ratings yet
Computational Geometry: Range Searching and Kd-Trees
59 pages
Developments in KD Tree and KNN Searches
No ratings yet
Developments in KD Tree and KNN Searches
8 pages
Algo Imm6183
No ratings yet
Algo Imm6183
104 pages
Building A Balanced K-D Tree With Mapreduce
No ratings yet
Building A Balanced K-D Tree With Mapreduce
7 pages
DAA Unit III
No ratings yet
DAA Unit III
53 pages
KDTree Trie
No ratings yet
KDTree Trie
5 pages
KDTree and BallTree
No ratings yet
KDTree and BallTree
14 pages
Week 2 Quiz: Design and Analysis of Algorithms
100% (1)
Week 2 Quiz: Design and Analysis of Algorithms
3 pages
Assignment 3: Kdtree: Due June 4, 11:59 PM
No ratings yet
Assignment 3: Kdtree: Due June 4, 11:59 PM
19 pages
Multidimensional Search Trees
No ratings yet
Multidimensional Search Trees
119 pages
Chapter 8
No ratings yet
Chapter 8
50 pages
Introduction To Approximation Algorithms
No ratings yet
Introduction To Approximation Algorithms
5 pages
Graph Theory (Past Year)
No ratings yet
Graph Theory (Past Year)
17 pages
KD Tree
No ratings yet
KD Tree
41 pages
Computational Geomatory
No ratings yet
Computational Geomatory
212 pages
Ra 2111003010362
No ratings yet
Ra 2111003010362
23 pages
KD Trees
No ratings yet
KD Trees
12 pages
Lecture Materials
No ratings yet
Lecture Materials
1 page
Kdtrees
No ratings yet
Kdtrees
12 pages
Ball Tree
No ratings yet
Ball Tree
4 pages
Advanced Algorithm Lecture Notes
No ratings yet
Advanced Algorithm Lecture Notes
83 pages
Quad Trees: CMSC 420
No ratings yet
Quad Trees: CMSC 420
45 pages
CS168: The Modern Algorithmic Toolbox Lecture #3: Similarity Metrics and Kd-Trees
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #3: Similarity Metrics and Kd-Trees
6 pages
Daa 1
No ratings yet
Daa 1
6 pages
K-D Trees
No ratings yet
K-D Trees
6 pages
Linear Programming by Keithly Navales, Zhaina Herrera, Kate Reyes
No ratings yet
Linear Programming by Keithly Navales, Zhaina Herrera, Kate Reyes
41 pages
2IL50 Data Structures: 2017-18 Q3 Lecture 9: Range Searching
No ratings yet
2IL50 Data Structures: 2017-18 Q3 Lecture 9: Range Searching
40 pages
Reducing Computational Cost: - Nearest-Neighbors Has O (N) Complexity
No ratings yet
Reducing Computational Cost: - Nearest-Neighbors Has O (N) Complexity
20 pages
L17-18 QuadTrees PDF
No ratings yet
L17-18 QuadTrees PDF
45 pages
Wolfe Method
No ratings yet
Wolfe Method
10 pages
Nearest Neighbor Search
No ratings yet
Nearest Neighbor Search
9 pages
Practica 6 de Laboratorio - KD Tree 2
No ratings yet
Practica 6 de Laboratorio - KD Tree 2
5 pages
Algorithms For Fast Vector Quantization: Proc. Data Compression Conference, J. A. Storer
No ratings yet
Algorithms For Fast Vector Quantization: Proc. Data Compression Conference, J. A. Storer
17 pages
Trees For Semidynamic Point Sets: AT&T Bell Labo Ttories Murray Hill, NJ 07974
No ratings yet
Trees For Semidynamic Point Sets: AT&T Bell Labo Ttories Murray Hill, NJ 07974
11 pages
Part10 Quadtrees Etc
No ratings yet
Part10 Quadtrees Etc
69 pages
G3 - R-Tree, R+-Tree
No ratings yet
G3 - R-Tree, R+-Tree
47 pages
Geometric Data Structures: G H e D I G H e D I
No ratings yet
Geometric Data Structures: G H e D I G H e D I
10 pages
07 Kdtrees
No ratings yet
07 Kdtrees
17 pages
Babenko Product Split Trees CVPR 2017 Paper
No ratings yet
Babenko Product Split Trees CVPR 2017 Paper
9 pages
Unit 1
No ratings yet
Unit 1
111 pages
NUS CS2040 Notes
No ratings yet
NUS CS2040 Notes
13 pages
17 MAY - Algebra 2 Spring 2024 Final REVIEW
No ratings yet
17 MAY - Algebra 2 Spring 2024 Final REVIEW
9 pages
CSE 326: Data Structures Lecture #21 Multidimensional Search Trees
No ratings yet
CSE 326: Data Structures Lecture #21 Multidimensional Search Trees
42 pages
BST Range Search!
No ratings yet
BST Range Search!
17 pages
Unit 1 and 2
No ratings yet
Unit 1 and 2
87 pages
Red-Black Trees Operation
No ratings yet
Red-Black Trees Operation
32 pages
CS2040 Note
No ratings yet
CS2040 Note
2 pages
0-1 Knapsack Problem
No ratings yet
0-1 Knapsack Problem
5 pages
An Efficient and Robust Access Method For Points and Rectangles
No ratings yet
An Efficient and Robust Access Method For Points and Rectangles
38 pages
CSE 2418 ADA Syllabus
No ratings yet
CSE 2418 ADA Syllabus
6 pages
Project in DSA Java
No ratings yet
Project in DSA Java
5 pages
Unit 4 Concurrency Control
No ratings yet
Unit 4 Concurrency Control
111 pages
Unit 5 E-Database Transaction
No ratings yet
Unit 5 E-Database Transaction
111 pages
Problem Solving:: Solution
No ratings yet
Problem Solving:: Solution
5 pages
Advanced Data Structure - Unit 1
No ratings yet
Advanced Data Structure - Unit 1
61 pages
M2 Dav
No ratings yet
M2 Dav
148 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
213cse4309 - It Data Security Course Plan
No ratings yet
213cse4309 - It Data Security Course Plan
15 pages
Optim ML
No ratings yet
Optim ML
41 pages
Session 28 C4T5 Method of Weighted Residual and Ritz Method
No ratings yet
Session 28 C4T5 Method of Weighted Residual and Ritz Method
11 pages
Unit 5
No ratings yet
Unit 5
70 pages
Polynomial Equn and Program To Add Two Polynomials
No ratings yet
Polynomial Equn and Program To Add Two Polynomials
8 pages
Branch and Bound
No ratings yet
Branch and Bound
4 pages
Association Rule Mod 3
No ratings yet
Association Rule Mod 3
28 pages
2nd Long Test - Math10
No ratings yet
2nd Long Test - Math10
3 pages
Review Questions On Clustering DBSCAN and HAC
No ratings yet
Review Questions On Clustering DBSCAN and HAC
2 pages
Relational Database Design
No ratings yet
Relational Database Design
85 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
(Event Handling)
No ratings yet
(Event Handling)
33 pages
Application Development and Administration
No ratings yet
Application Development and Administration
71 pages
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
From Everand
Factoring and Algebra - A Selection of Classic Mathematical Articles Containing Examples and Exercises on the Subject of Algebra (Mathematics Series)
CSPacademic
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
No ratings yet
IJCER (WWW - Ijceronline.com) International Journal of Computational Engineering Research
4 pages
MatLab Activity#5
No ratings yet
MatLab Activity#5
32 pages
Phishing Case Study
No ratings yet
Phishing Case Study
5 pages
Integrity and Security
No ratings yet
Integrity and Security
58 pages
2.4.3 1682587387 9795
No ratings yet
2.4.3 1682587387 9795
19 pages
Application Design and Development
No ratings yet
Application Design and Development
52 pages
Intermediate SQL
No ratings yet
Intermediate SQL
52 pages
Figure PPT ch008
No ratings yet
Figure PPT ch008
46 pages
Recurrences
No ratings yet
Recurrences
9 pages
Performance Assessment of Surface Modified Natural Fibre Using Naoh in Composite Concrete
No ratings yet
Performance Assessment of Surface Modified Natural Fibre Using Naoh in Composite Concrete
29 pages
CA, CSIT Final Year Degree Analysis
No ratings yet
CA, CSIT Final Year Degree Analysis
15 pages
21 Algorithms
No ratings yet
21 Algorithms
25 pages
4039 Event Handling
No ratings yet
4039 Event Handling
28 pages
2 Maths em Lesson Wise Importatnt Questions2m 3m 5m For Slow Learners
No ratings yet
2 Maths em Lesson Wise Importatnt Questions2m 3m 5m For Slow Learners
7 pages
DAA - Part B&C
No ratings yet
DAA - Part B&C
3 pages
IT Infrastructure Landscape Overview SE 1
No ratings yet
IT Infrastructure Landscape Overview SE 1
8 pages
Assignmnent - Case Study
No ratings yet
Assignmnent - Case Study
2 pages
Appraisal Guidelines 2023 Final
No ratings yet
Appraisal Guidelines 2023 Final
8 pages
Lab 8 - LP Modeling and Simplex Method
No ratings yet
Lab 8 - LP Modeling and Simplex Method
8 pages
Books 3337 0-Trang-8
No ratings yet
Books 3337 0-Trang-8
2 pages
Rational Zero Theorem, Factoring Polynomials, and Polynomial Equations
No ratings yet
Rational Zero Theorem, Factoring Polynomials, and Polynomial Equations
25 pages
1 - Intro To Numerical Methods
No ratings yet
1 - Intro To Numerical Methods
6 pages
M2C3 Solution of Algebraic and Transcendental Newton Raphson Method
No ratings yet
M2C3 Solution of Algebraic and Transcendental Newton Raphson Method
5 pages
Polynomials Roots Investigation
No ratings yet
Polynomials Roots Investigation
6 pages
Least Squares & Pseudo Inverse
No ratings yet
Least Squares & Pseudo Inverse
12 pages
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

K-D Trees

Uploaded by

K-D Trees

Uploaded by

kd-Trees

• Invented in 1970s by Jon Bentley

• Name originally meant “3d-trees, 4d-trees, etc”

• Idea: Each level of the tree compares against

• Each level has a

insert(Point x, KDNode t, int cd) {

• Recursively traverse the tree

• If cutdim(current_node) = d, then the minimum

FindMin(y-dimension): space searched

Point findmin(Node T, int dim, int cd):

/ T splits on the dimension we’re searching

/ T splits on a different dimension

Want to delete node A.

• What is right subtree is

It’s possible that T.left

• Swap the subtrees of node

Now, if there is another

/ This is the point to

/ this is not the point, so search for

(10,30) 1,10 50,50

• Idea: traverse the whole tree, BUT make two

1. Keep variable of closest point C found so far.

2. Search the subtrees in order that maximizes

Update the best point so far, if T is better:

def NN(Point Q, kdTree T, int cd, Rect BB):

// if this bounding box is too far, do nothing

/ if this point is better than the

Following Dave Mount’s Notes (page 77)

• Might have to search close to the whole tree in

• Three important concepts that reoccur in range /

You might also like