0% found this document useful (0 votes)

71 views22 pages

Optimal Binary Search Tree

The document discusses the optimal binary search tree (OBST) problem. It defines the problem as finding a binary search tree with n keys that minimizes the expected cost of searches. It presents a 3 step approach: 1) Define the optimal substructure - optimal subtrees must contain contiguous key ranges with associated dummy keys as leaves 2) Define a recursive solution to compute the expected cost e[i,j] of searching an OBST containing keys ki to kj 3) Use dynamic programming to fill a table of e[i,j] values and track the optimal roots in a root table, requiring O(n3) time complexity.

Uploaded by

Nandini Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

71 views22 pages

Optimal Binary Search Tree

Uploaded by

Nandini Gupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 22

Optimal Binary

Search Tree
1.Preface

• OBST is one special kind of advanced tree.

• It focus on how to reduce the cost of the search of

the BST.

• It may not have the lowest height !

• It needs 3 tables to record probabilities, cost, and

root.
2.Premise
• It has n keys (representation k1,k2,…,kn) in sorted order (so that
k1<k2<…<kn), and we wish to build a binary search tree from
these keys. For each ki ,we have a probability pi that a search
will be for ki.
• In contrast of, some searches may be for values not in ki, and so
we also have n+1 “dummy keys” d0,d1,…,dn representating not
in ki.
• In particular, d0 represents all values less than k1, and dn
represents all values greater than kn, and for i=1,2,…,n-1, the
dummy key di represents all values between ki and ki+1.
＊ The dummy keys are leaves (external nodes), and the data
keys mean internal nodes.
3.Formula & Prove

• The case of search are two situations, one is success, and the
other, without saying, is failure.
• We can get the first statement :

Success Failure
• Because we have probabilities of searches for each key and
each dummy key, we can determine the expected cost of a
search in a given binary search tree T.
• Let us assume that the actual cost of a search is the number of
nodes examined, i.e., the depth of the node found by the search
in T,plus1(Assuming root at depth 0).
• Then the expected cost of a search in T is : (The second
statement)

Where depthT denotes a node’s depth in the tree T.

k2 k2

k1 k4 k1 k5

d0 d1
d0 d1 d5
k3 k5 k4

d2 d3 d4 d5 d4
k3
Figure (a)

i 0 1 2 3 4 5
d2 d3

pi 0.15 0.10 0.05 0.10 0.20

Figure (b)

qi 0.05 0.10 0.05 0.05 0.05 0.10

• By Figure (a), we can calculate the expected search cost node by node:

Cost=
Node# Depth probability cost
Probability *
k1 1 0.15 0.30 (Depth+1)

k2 0 0.10 0.10
k3 2 0.05 0.15
k4 1 0.10 0.20
K5 2 0.20 0.60
d0 2 0.05 0.15
d1 2 0.10 0.30
d2 3 0.05 0.20
d3 3 0.05 0.20
d4 3 0.05 0.20
d5 3 0.10 0.40
• And the total cost = (0.30 + 0.10 + 0.15 + 0.20 + 0.60 + 0.15 +
0.30 + 0.20 + 0.20 + 0.20 + 0.40 ) = 2.80
• So Figure (a) costs 2.80 ,on another, the Figure (b) costs 2.75,
and that tree is really optimal.
• We can see the height of (b) is more than (a) ,
• The key k5 has the greatest search probability of any key, yet
the root of the OBST shown is k2.
Step1:The structure of an OBST

• To characterize the optimal substructure of OBST,

we start with an observation about subtrees.
Consider any subtree of a BST.
• It must contain keys in a contiguous range ki,…,kj,
for some 1≦i ≦j ≦n.
• In addition, a subtree that contains keys ki,…,kj must
also have as its leaves the dummy keys di-1 ,…,dj.
• We need to use the optimal substructure to show that we
can construct an optimal solution to the problem from
optimal solutions to subproblems.
• Given keys ki ,…, kj, one of these keys, say kr (I ≦r ≦j),
will be the root of an optimal subtree containing these
keys.
• The left subtree of the root kr will contain the keys (ki ,…,
kr-1) and the dummy keys( di-1 ,…, dr-1), and the right
subtree will contain the keys (kr+1 ,…, kj) and the dummy
keys( dr ,…, dj).
• As long as we examine all candidate roots kr, where I ≦r
≦j, and we determine all optimal binary search trees
containing ki ,…, kr-1 and those containing kr+1 ,…, kj , we
are guaranteed that we will find an OBST.
• There is one detail worth nothing about “empty”
subtrees.
• Suppose that in a subtree with keys ki,...,kj, we select ki as
the root.
• By the above argument, ki ‘s left subtree contains the
keys ki,…, ki-1.
• It is natural to interpret this sequence as containing no
keys. It is easy to know that subtrees also contain dummy
keys.
• The sequence has no actual keys but does contain the
single dummy key di-1.
• Symmetrically, if we select kj as the root, then kj‘s right
subtree contains the keys, kj+1 …,kj; this right subtree
contains no actual keys, but it does contain the dummy
key dj.
Step2: A recursive solution
• We are ready to define the value of an optimal solution
recursively.
• We pick our subproblem domain as finding an OBST
containing the keys ki,…,kj, where i≧1, j ≦n, and j ≧
i-1. (It is when j=i-1 that ther are no actual keys; we
have just the dummy key di-1.)
• Let us define e[i,j] as the expected cost of searching an
OBST containing the keys ki,…, kj.
• Ultimately, we wish to compute e[1,n].
• The easy case occurs when j=i-1. Then we have just the dummy
key di-1. The expected search cost is e[i,i-1]= qi-1.
• When j≧1, we need to select a root kr from among ki,…,kj and
then make an OBST with keys ki,…,kr-1 its left subtree and an
OBST with keys kr+1,…,kj its right subtree.
• By the time, what happens to the expected search cost of a
subtree when it becomes a subtree of a node?
• The answer is that the depth of each node in the subtree
increases by 1.
• By the second statement, the excepted search cost of
this subtree increases by the sum of all the probabilities
in the subtree. For a subtree with keys ki,…,kj let us
denote this sum of probabilities as

Thus, if kr is the root of an optimal subtree containing

keys ki,…,kj, we have
• We rewrite e[i,j] as
e[i,j]= e[i,r-1] + e[r+1,j]+w(i,j)

The recursive equation as above assumes that we know

which node kr to use as the root. We choose the root
that gives the lowest expected search cost, giving us
our final recursive formulation:
• The e[i,j] values give the expected search costs in OBST. To
help us keep track of the structure of OBST, we define root[i,j],
for 1≦i≦j≦n, to be the index r for which kr is the root of an
OBST containing keys ki,…,kj.
Step3: Computing the expected
search cost of an OBST
• We store the e[i.j] values in a table e[1..n+1, 0..n]
• The first index needs to run to n+1 rather than n
because in order to have a subtree containing only
the dummy key dn, we will need to compute and
store e[n+1,n].
• The second index needs to start from 0 because in
order to have a subtree containing only the dummy
key d0, we will need to compute and store e[1,0].
• We will use only the entries e[i,j] for which j≧i-1.
we also use a table root[i,j], for recording the root of
the subtree containing keys ki,…, kj. This table uses
only the entries for which 1≦i≦j≦n.
• We will need one other table for efficiency. Rather
than compute the value of w(i,j) from scratch every
time we are computing e[i,j] ----- we tore these values
in a table w[1..n+1,0..n].
• For the base case, we compute w[i,i-1] = qi-1 for 1≦i
≦n.
• For j≧i, we compute :
The algorithm takes as inputs the probabilities p1…..pn and q0….qn
and the size n, and it returns the tables e and root
The tables e[i,j], w[i,j], and root [i,j]computed by Optimal-
BST(for figure b given at slide 6)
Time Complexity

• The OPTIMAL-BST procedure takes Ɵ(n3) time, just like

MATRIX-CHAINORDER.
• We can easily see that its running time is O(n3) since its for
loops are nested three deep and each loop index takes on at
most n values.

TCP2101 Algorithm Design & Analysis: - Binary Search Trees (BST)
No ratings yet
TCP2101 Algorithm Design & Analysis: - Binary Search Trees (BST)
105 pages
Optimal Binary Search Trees: Problem
No ratings yet
Optimal Binary Search Trees: Problem
16 pages
Group D - 8
No ratings yet
Group D - 8
5 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
8 pages
2001 Roura
No ratings yet
2001 Roura
12 pages
Daa Unit-3
No ratings yet
Daa Unit-3
72 pages
Unit IV
No ratings yet
Unit IV
141 pages
Binary Search Tree
No ratings yet
Binary Search Tree
45 pages
Unit-4 Search Trees
No ratings yet
Unit-4 Search Trees
163 pages
Unit 3 Long
No ratings yet
Unit 3 Long
112 pages
Binary Search Tree: Prof. Prateek Vishnoi
No ratings yet
Binary Search Tree: Prof. Prateek Vishnoi
52 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
25 pages
Dsa 4
No ratings yet
Dsa 4
50 pages
Unit-III OBST and All Pair Shortest Path
No ratings yet
Unit-III OBST and All Pair Shortest Path
43 pages
Ds Unit2 Obst After Review - 26
No ratings yet
Ds Unit2 Obst After Review - 26
26 pages
Lec05-AverageCaseBST After
No ratings yet
Lec05-AverageCaseBST After
25 pages
Lecture - 20 - Dynamic Programming - OBST
No ratings yet
Lecture - 20 - Dynamic Programming - OBST
24 pages
Optimal Binary Search Tree (OBST)
No ratings yet
Optimal Binary Search Tree (OBST)
104 pages
Minor Project Ms-64 Algorithm Design and Analysis: Optimal Binary Search Tree
No ratings yet
Minor Project Ms-64 Algorithm Design and Analysis: Optimal Binary Search Tree
14 pages
AAA Week05
No ratings yet
AAA Week05
46 pages
Dsa Manual - 1
No ratings yet
Dsa Manual - 1
19 pages
Search
No ratings yet
Search
33 pages
Daa Mid 2
No ratings yet
Daa Mid 2
17 pages
Optimal Cost Binary Search Trees
No ratings yet
Optimal Cost Binary Search Trees
4 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
8 pages
5.2 Optimal Binary Tree
No ratings yet
5.2 Optimal Binary Tree
9 pages
Gerald Samson
No ratings yet
Gerald Samson
18 pages
Lecture 8
No ratings yet
Lecture 8
30 pages
DS 9
No ratings yet
DS 9
41 pages
BST - 2
No ratings yet
BST - 2
7 pages
DSA Assignment 7 TH
No ratings yet
DSA Assignment 7 TH
4 pages
BST Easy and Medium
No ratings yet
BST Easy and Medium
6 pages
DSAL8 Word
No ratings yet
DSAL8 Word
5 pages
Chap 4 Part I Final
No ratings yet
Chap 4 Part I Final
14 pages
Binary Search Trees-1
No ratings yet
Binary Search Trees-1
44 pages
Binary Search Trees
No ratings yet
Binary Search Trees
34 pages
BST - 1
No ratings yet
BST - 1
12 pages
Dynamic Programming 3
No ratings yet
Dynamic Programming 3
20 pages
Linear Verification For Spanning Trees 1
No ratings yet
Linear Verification For Spanning Trees 1
6 pages
BST I
No ratings yet
BST I
9 pages
Optimal Binary Search TREES (Contd..) : P (K) +COST (L) +COST (R) +W (0, k-1) +W (K, N) .....
No ratings yet
Optimal Binary Search TREES (Contd..) : P (K) +COST (L) +COST (R) +W (0, k-1) +W (K, N) .....
55 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
3 pages
Solutions For HW5-CS 6033 Fall 2024
No ratings yet
Solutions For HW5-CS 6033 Fall 2024
13 pages
Data Structure Report On BST
No ratings yet
Data Structure Report On BST
9 pages
Binary Search Trees: Tim Doolan
No ratings yet
Binary Search Trees: Tim Doolan
14 pages
Lecture7 PDF
No ratings yet
Lecture7 PDF
8 pages
My Obst
No ratings yet
My Obst
14 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
7 pages
Ma/Csse 473 Day 28: Optimal Bsts
No ratings yet
Ma/Csse 473 Day 28: Optimal Bsts
29 pages
Optimal Binary Search Tree
No ratings yet
Optimal Binary Search Tree
25 pages
Problem Statement:: Optimal-Binary-Search-Tree (P, Q, N)
No ratings yet
Problem Statement:: Optimal-Binary-Search-Tree (P, Q, N)
3 pages
Final Report
No ratings yet
Final Report
7 pages
HW 3
No ratings yet
HW 3
4 pages
Practical No 8
No ratings yet
Practical No 8
2 pages
14 3 DP Optimal Binary Search Trees 4up
No ratings yet
14 3 DP Optimal Binary Search Trees 4up
4 pages
Obst
No ratings yet
Obst
11 pages
Practical Session No. 4 Trees: Trees As Basic Data Structures Tree
No ratings yet
Practical Session No. 4 Trees: Trees As Basic Data Structures Tree
11 pages
Unit I Flat LM Cse
No ratings yet
Unit I Flat LM Cse
31 pages
A1 Final Presentation - Quantum Pathfinding
No ratings yet
A1 Final Presentation - Quantum Pathfinding
12 pages
Fixed and Floating Point Representation
No ratings yet
Fixed and Floating Point Representation
5 pages
03-AI-ProblemSolving p2
No ratings yet
03-AI-ProblemSolving p2
191 pages
Handling Preferences in Student-Project Allocation
No ratings yet
Handling Preferences in Student-Project Allocation
40 pages
Comparative Study Between Density Based Clustering - Dbscan and Optics
No ratings yet
Comparative Study Between Density Based Clustering - Dbscan and Optics
4 pages
Frequent Pattern Based Clustering Methods
No ratings yet
Frequent Pattern Based Clustering Methods
23 pages
Unit 1
No ratings yet
Unit 1
19 pages
DATA STRUCURE & Algm II YEAR R2021 2023
No ratings yet
DATA STRUCURE & Algm II YEAR R2021 2023
2 pages
Newton School Course Final
No ratings yet
Newton School Course Final
5 pages
02 Lexical Analysis
No ratings yet
02 Lexical Analysis
86 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
1 page
Write A Program To Print Your Name.: Source Code
No ratings yet
Write A Program To Print Your Name.: Source Code
4 pages
Greedy Algorithms: CSE373: Design and Analysis of Algorithms
No ratings yet
Greedy Algorithms: CSE373: Design and Analysis of Algorithms
52 pages
Tsp-Shortest Path-Route-Finding-With-An-Elaborate-Text-File-Csv-Format-Reader
No ratings yet
Tsp-Shortest Path-Route-Finding-With-An-Elaborate-Text-File-Csv-Format-Reader
36 pages
Dimma:: A Design and Implementation Methodology For Metaheuristic Algorithms - A Perspective From Software Development
No ratings yet
Dimma:: A Design and Implementation Methodology For Metaheuristic Algorithms - A Perspective From Software Development
18 pages
Scheduling Algorithms
No ratings yet
Scheduling Algorithms
20 pages
6 Transportation Problem PDF
No ratings yet
6 Transportation Problem PDF
30 pages
Lecture #15: Regression Trees & Random Forests
No ratings yet
Lecture #15: Regression Trees & Random Forests
34 pages
Group
No ratings yet
Group
12 pages
Task 1: Search Algorithms: Depth First Search (DFS) - Is An Algorithm For Traversing or Searching Tree or Graph Data
No ratings yet
Task 1: Search Algorithms: Depth First Search (DFS) - Is An Algorithm For Traversing or Searching Tree or Graph Data
8 pages
Midterm Review 1
No ratings yet
Midterm Review 1
22 pages
Assignment No. 01: CS101-Introduction To Computing
No ratings yet
Assignment No. 01: CS101-Introduction To Computing
6 pages
IT T33-Data Structures: SMVEC - Department of Information Technology 1
No ratings yet
IT T33-Data Structures: SMVEC - Department of Information Technology 1
14 pages
Object-Oriented Programming (OOP)
No ratings yet
Object-Oriented Programming (OOP)
8 pages
Vector Network Analysis - GRASS-Wiki
No ratings yet
Vector Network Analysis - GRASS-Wiki
6 pages
4th Sem DAA Syllabus (R-23)
No ratings yet
4th Sem DAA Syllabus (R-23)
3 pages
Functions: Practical No.68
No ratings yet
Functions: Practical No.68
7 pages
187HW2 2008
No ratings yet
187HW2 2008
2 pages
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
From Everand
IGNOU BCA Computer Oriented Numerical Technique Previous Year Unsolved Papers BCS 054
Manish Soni
No ratings yet
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
From Everand
De Moiver's Theorem (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet

Optimal Binary Search Tree

Uploaded by

Optimal Binary Search Tree

Uploaded by

Optimal Binary

• OBST is one special kind of advanced tree.

• It focus on how to reduce the cost of the search of

• It may not have the lowest height !

• It needs 3 tables to record probabilities, cost, and

Where depthT denotes a node’s depth in the tree T.

pi 0.15 0.10 0.05 0.10 0.20

qi 0.05 0.10 0.05 0.05 0.05 0.10

• To characterize the optimal substructure of OBST,

Thus, if kr is the root of an optimal subtree containing

The recursive equation as above assumes that we know

• The OPTIMAL-BST procedure takes Ɵ(n3) time, just like

You might also like