FS Mod 3 - Multilevel Indexing and B-Trees

The document discusses multilevel indexing and B-trees. It begins by describing the problem of slow access times when keeping indexes on secondary storage. B-trees were developed as a solution, providing rapid data access and retrieval with minimal overhead. The document then covers B-tree properties such as balancing, paging to improve disk utilization, searching, insertion which can cause splitting and promotion, and deletion which can cause merging or redistribution to maintain the B-tree structure.

Uploaded by

Mahesh R J

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views37 pages

FS Mod 3 - Multilevel Indexing and B-Trees

Uploaded by

Mahesh R J

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

Multilevel Indexing and B-Trees

Introduction-Invention of B-trees
• The goal was the discovery of a general method
for storing and retrieving data in large file
systems that would provide rapid access to the
data with minimal overhead cost.
• Douglas Comer in 1979 wrote an article “The
ubiquitous B-Tree”.
• R Bayer and E.McRight in 1972 published
“organization and Maintainance of Large ordered
Indexes” which announced B-trees to the world.
Statement of the Problem
• Fundamental problem with keeping an index
on Secondary storage is slow. This can be
broken down into two specific problems.
– Searching the index must be faster than binary
searching
– Insertion and deletion must be as fast as search
Indexing the Binary Search Trees
• Looking at the cost of keeping a list in sorted order
we can perform binary searches.
After adding NP MB TM LA UF ND TS NK
AVL Trees
• In honor of the Russian mathematicians, G.M.Adel’son-
Vel’skkii and E.M.Landis who first defined them.
• An AVL tree is hight-balanced tree. There is a limit placed on
the amount of difference allowed between the heights of
any two subtrees sharing common root.
• In AVL tree maximum allowable difference is one.
• An AVL tree hence is called height-balanced 1-tree or HB(1)
tree.
• It is a member of a more general class of height-balanced
trees known as HB(k), which are permitted to be k levels out
of balance.
• Following tree has AVL or HB(1) property.
• BCGEFDA
Paged Binary Trees
• Disk utilization of binary search tree is extremely inefficient.
i.e. when we read a node there are only three useful pieces of
information- key value and address of the left and right
subtree.
• This wastes most of the data read from the disk, which is
critical factor in the cost of searching which we can not afford.
• Paged binary tree attempts to address the problem by locating
multiple binary nodes on the same disk page.
• Here we do not incur the cost of a disk seek just to get few
bytes.
• Once we take time to seek an area of the disk we read entire
page from the file.
• Paging is potential solution to the inefficient
disk utilization of binary search trees.
• By dividing a binary tree into pages and then
storing each page in a block of contiguous
locations on disk, we should be able to reduce
the number of seeks associated with any
search.
• Paging has the potential to result faster
searching on secondary storage.
• In this tree we are able to locate any of the 63 nodes in the
tree with no more two disk accesses.
• Every page holds 7 nodes and can branch to eight new
pages.
• If we extend to one more level we add 64 new pages, we can
find any one of 511 nodes in only three seeks.
Problems with paged trees
• Inefficient disk usage : In previous tree there
are seven nodes per page. Of the 14 reference
fields in a single page 6 of them are reference
nodes within the page. i.e. we are using 14
reference fields to distinguish between 8
subtrees. Still wastage of memory.
• How to build paged tree? : We need sorted
list to build a paged tree.
B-Trees:
• Create a B-Tree for the following elements
An object oriented representation of B-Trees
Class BTree: Supporting Files of B-Tree Nodes

• Class Btree uses in-memory BTreeNode

objects, adds the file access portion and
enforces the consistent size of the nodes.
• The following code defines class Btree .
Searching in B-Tree
• Characteristics of most B-Tree algorithms
1. They are iterative
2. They work in two stages, operating alternatively on
entire pages(Class Btree) and then within pages(class
BTreeNode)

• Searching procedure is iterative, loading a page into

memory and then searching through the page,
looking for the key successively lower levels of the
tree until reaches the leaf level.
Insertion
• There are two important observations we can make
about the insertion, splitting and promotion process:
• The first operation in method Insert is to search to the root for
key using FindLeaf:
thisNode = FindLeaf(key);
• The next step is to insert key into the leaf node
result = thisNode->Insert(key,recAddr)
• When overflow is detected, the node must be split into two
nodes using following code
newNode=NewNode();
thisNode->Split(newNode);
Store(thisNode);
Store(newNode);
• The next step is to update the parent node. Since the largest key
in thisNode has changed,method UpdateKey is used to record
the change
parentNode->UpdateKey(largestKey, thisNode->LargestKey());
Testing the B-Tree
Worst Case Search Depth
• It is important to understand the relationship between the
page size of B-tree , the number of keys to be stored in the
tree, and the number of levels that the tree can extend.
• Example: Suppose we want to store 1000000 keys and
that, given nature of storage hardware and the size of
keys, it is reasonable to consider using a B-tree of order
512.
• In the worst case what will be the max number of disk
accesses required to locate a key in the tree? Or how deep
the tree will be?
• We can answer this by noting every key appears
in the leaf level. Hence , we need to calculate the
maximum height of a tree with 1000000 in the
leaves.
• By observing formal definition of B-tree
properties to calculate minimum number of
descendants that can extend from any level of B-
tree of some given order.
• The worst case occurs when every page of the
tree has only maximum number of descendants.
• In such case the keys are spread over a maximal
height for the tree and a minimal breadth.
• For a B-tree of order m, the minimum number of
descendants from the root page is 2, so the second
level of the tree contains only 2 pages.
• Each of these pages, in turn, has at least m/2
descendants.
• The third level then contains 2Xm/2 pages.
• The general pattern of the relation between depth and
the minimum number of descendants takes following
form:
Deletion, Merging and Redistribution
1. Deletion of C from above tree does not affect the tree.
2. Deletion of P changes P to O in the second level and
the root.
3. Deleting H, Causes an underflow and two leaf nodes
were merged.

Leetcode Pareto Problem Set
No ratings yet
Leetcode Pareto Problem Set
1 page
1972 Bayer Mccreight
No ratings yet
1972 Bayer Mccreight
17 pages
Btree Data Structure
No ratings yet
Btree Data Structure
25 pages
Solution Manual To Chapter 05
100% (1)
Solution Manual To Chapter 05
13 pages
Btree Notes
No ratings yet
Btree Notes
9 pages
AVL Tree
No ratings yet
AVL Tree
29 pages
Data Structure Lecture 7 Tree
No ratings yet
Data Structure Lecture 7 Tree
49 pages
B Tree Application
100% (2)
B Tree Application
6 pages
B Trees
No ratings yet
B Trees
4 pages
Unit 5
No ratings yet
Unit 5
99 pages
DSA Unit 3 Notes
No ratings yet
DSA Unit 3 Notes
105 pages
B Tree
No ratings yet
B Tree
53 pages
Ques. On Heap Sort & Spanning Tree
100% (1)
Ques. On Heap Sort & Spanning Tree
6 pages
Unit-4 Tree Notes
No ratings yet
Unit-4 Tree Notes
81 pages
Data Structures. Harsha
No ratings yet
Data Structures. Harsha
35 pages
Software Design Using C++: An Online Book
No ratings yet
Software Design Using C++: An Online Book
15 pages
Binary Tree Handwritten Notes For Students
No ratings yet
Binary Tree Handwritten Notes For Students
6 pages
FS Mod3
No ratings yet
FS Mod3
46 pages
Data Structures & Algorithm Design: Trees
No ratings yet
Data Structures & Algorithm Design: Trees
38 pages
Cop3502 Final Study Guide 3 - 1
No ratings yet
Cop3502 Final Study Guide 3 - 1
28 pages
B Trees
No ratings yet
B Trees
62 pages
DSA-II UNIT-II B Tree
No ratings yet
DSA-II UNIT-II B Tree
46 pages
Chp2 - Advanced Data Structure
No ratings yet
Chp2 - Advanced Data Structure
88 pages
Mca Tree
No ratings yet
Mca Tree
31 pages
Unit V
No ratings yet
Unit V
55 pages
Ads 2 Part 3
No ratings yet
Ads 2 Part 3
60 pages
9.CCS224 - PART 2 - Lecture 4 (August 3, 2021)
No ratings yet
9.CCS224 - PART 2 - Lecture 4 (August 3, 2021)
30 pages
2 Question Bank
No ratings yet
2 Question Bank
3 pages
B Tree
No ratings yet
B Tree
46 pages
07 Priority Queues Heaps
No ratings yet
07 Priority Queues Heaps
37 pages
B-Trees DS
No ratings yet
B-Trees DS
28 pages
Multi Last
No ratings yet
Multi Last
10 pages
Trees MCQS
No ratings yet
Trees MCQS
12 pages
B-Trees Slides
No ratings yet
B-Trees Slides
24 pages
20200720215503D5797 - 20180725180123D5542 - COMP6049 Pert 6
No ratings yet
20200720215503D5797 - 20180725180123D5542 - COMP6049 Pert 6
75 pages
BCS401 ADA m3 Notes
No ratings yet
BCS401 ADA m3 Notes
24 pages
20mca14c U5
No ratings yet
20mca14c U5
26 pages
Rohini 94994211969
No ratings yet
Rohini 94994211969
6 pages
File Structures: An Object-Oriented Approach With C++ Chapters 9-12
No ratings yet
File Structures: An Object-Oriented Approach With C++ Chapters 9-12
37 pages
M-Way Trees: Multiway Trees: M-Way Search Trees, B-Trees, Operations On B-Trees, B+-Trees
No ratings yet
M-Way Trees: Multiway Trees: M-Way Search Trees, B-Trees, Operations On B-Trees, B+-Trees
20 pages
B-Trees: Based On Materials by D. Frey and T. Anastasio
No ratings yet
B-Trees: Based On Materials by D. Frey and T. Anastasio
33 pages
B Tree
No ratings yet
B Tree
63 pages
Minimum Spanning Tree
No ratings yet
Minimum Spanning Tree
24 pages
Trees
No ratings yet
Trees
30 pages
Splay Tree
No ratings yet
Splay Tree
17 pages
B+ Tree - Wikipedia
No ratings yet
B+ Tree - Wikipedia
35 pages
B Tree
No ratings yet
B Tree
17 pages
2 BPlus Trees
No ratings yet
2 BPlus Trees
26 pages
L04-X-B-Trees của cô nguyễn bích vân- đại học công nghệ thông tin
No ratings yet
L04-X-B-Trees của cô nguyễn bích vân- đại học công nghệ thông tin
24 pages
Trees (BST)
No ratings yet
Trees (BST)
17 pages
Design and Analysis of Algorithms: CSE 5311 Lecture 20 Minimum Spanning Tree
No ratings yet
Design and Analysis of Algorithms: CSE 5311 Lecture 20 Minimum Spanning Tree
44 pages
B Trees and Its Variants
No ratings yet
B Trees and Its Variants
55 pages
Imp Reddy B Trees
No ratings yet
Imp Reddy B Trees
25 pages
B Trees
No ratings yet
B Trees
25 pages
Data Structures Digital Notes-141-153
No ratings yet
Data Structures Digital Notes-141-153
13 pages
5 - Binary Tree
No ratings yet
5 - Binary Tree
7 pages
Prim's and Kruskal's
No ratings yet
Prim's and Kruskal's
20 pages
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
No ratings yet
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
27 pages
AI104 StudentsList and Assignments
No ratings yet
AI104 StudentsList and Assignments
2 pages
r20 Unit 4 Ads Notes
No ratings yet
r20 Unit 4 Ads Notes
25 pages
Various Data Structure
No ratings yet
Various Data Structure
56 pages
Data Structures Lab Project: Implementation and Optimization of Some Lesser-Known BST
No ratings yet
Data Structures Lab Project: Implementation and Optimization of Some Lesser-Known BST
14 pages
DSA - B - Tree
No ratings yet
DSA - B - Tree
19 pages
Augmenting Data Structures, Dynamic Order Statistics, Interval Trees
No ratings yet
Augmenting Data Structures, Dynamic Order Statistics, Interval Trees
25 pages
UNIT 3 Some Questions Ans
No ratings yet
UNIT 3 Some Questions Ans
10 pages
Algorithms: Modern Systems
No ratings yet
Algorithms: Modern Systems
21 pages
Recursion Problems
No ratings yet
Recursion Problems
7 pages
B-Tree Documentation
No ratings yet
B-Tree Documentation
12 pages
3rd Sem Mid Sem Paper
No ratings yet
3rd Sem Mid Sem Paper
13 pages
B Trees
No ratings yet
B Trees
24 pages
Algorithms Solutions: 1. Which of The Following Is/are True?
No ratings yet
Algorithms Solutions: 1. Which of The Following Is/are True?
6 pages
Data Structures Module 3
No ratings yet
Data Structures Module 3
10 pages
DS Trees Short Notes
No ratings yet
DS Trees Short Notes
12 pages
Kruskals Algorithm
No ratings yet
Kruskals Algorithm
15 pages
B Tree
No ratings yet
B Tree
6 pages
Multiway Search Tree
No ratings yet
Multiway Search Tree
16 pages
B-Tree Resume
No ratings yet
B-Tree Resume
4 pages
Data Structures Using C, 2e Jhalak Dutta
No ratings yet
Data Structures Using C, 2e Jhalak Dutta
16 pages
BST Deletion and Traversals
No ratings yet
BST Deletion and Traversals
14 pages
Applications of Trees in Real Life: Niño Dominic M. Matienzo Bsit - It1A
No ratings yet
Applications of Trees in Real Life: Niño Dominic M. Matienzo Bsit - It1A
18 pages
Data Structure Unit 3
No ratings yet
Data Structure Unit 3
7 pages
Balanced Trees
No ratings yet
Balanced Trees
3 pages
Unit-2 (Btree InsertionDelection)
No ratings yet
Unit-2 (Btree InsertionDelection)
4 pages
B Tree: Max Keys m-1 Min Keys (m/2) - 1 Max Child M Min Children m/2
No ratings yet
B Tree: Max Keys m-1 Min Keys (m/2) - 1 Max Child M Min Children m/2
8 pages
Farre BCA4
No ratings yet
Farre BCA4
1 page
B-Trees: Balanced Tree Data Structures
No ratings yet
B-Trees: Balanced Tree Data Structures
10 pages
Software Design Using C++: An Online Book
No ratings yet
Software Design Using C++: An Online Book
11 pages
Btree
No ratings yet
Btree
3 pages
B-Trees: Balanced Tree Data Structures
No ratings yet
B-Trees: Balanced Tree Data Structures
0 pages
Search Tree: Fundamentals and Applications
From Everand
Search Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

FS Mod 3 - Multilevel Indexing and B-Trees

Uploaded by

FS Mod 3 - Multilevel Indexing and B-Trees

Uploaded by

Multilevel Indexing and B-Trees

• Class Btree uses in-memory BTreeNode

• Searching procedure is iterative, loading a page into

You might also like