0% found this document useful (0 votes)

2 views34 pages

CNG351 Lecture 12 B

Uploaded by

berayseray382

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views34 pages

CNG351 Lecture 12 B

Uploaded by

berayseray382

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Indexing Structures for Files

CNG351 - Data Management and File Structures

Lecture - 12
Instructor: Dr. Yeliz Yesilada
Multi-Level Indexes
• Because a single-level index is an ordered file, we can create a
primary index to the index itself;
– In this case, the original index file is called the first-level
index and the index to the index is called the second-level
index.
• We can repeat the process, creating a third, fourth, ..., top level
until all entries of the top level fit in one disk block
• A multi-level index can be created for any type of first-level
index (primary, secondary, clustering) as long as the first-level
index consists of more than one disk block

CNG 351 - lecture 11 2/34

A Two-level Primary Index

CNG 351 - lecture 11 3/34

Multi-Level Indexes
• Multilevel index reduces the number of blocks
accessed when searching for a record given its
indexing field value.
• Such a multi-level index is a form of search tree
– However, insertion and deletion of new index entries is
a severe problem because every level of the index is
an ordered file.
• To retain the benefits of using multilevel indexing
while reducing index insertion and deletion problems,
designers adopted a multilevel index called dynamic
multilevel index.

CNG 351 - lecture 11 4/34

Tree ADT 101

CNG 351 - lecture 11 5/34

A Node in a Search Tree with Pointers to
Subtrees below It

CNG 351 - lecture 11 6/34

A search tree of order p = 3.

CNG 351 - lecture 11 7/34

Using a Search Tree
• We can use a search tree as a mechanism to search for records
stored in a disk file.
• The values in the tree can be values of one of the fields of the
file, called the search field.
• Each key value in the tree is associated with a pointer to the
record in the data file having that value.
• Two issues:
– Balanced tree, is important because it guarantees that no
nodes will be at very high levels and hence require many
block accesses during a search tree.
– Record deletion, some nodes can be empty wasting storage.
– B-trees address these issues.
CNG 351 - lecture 11 8/34
Dynamic Multilevel Indexes
Using B-Trees and B+-Trees
• Most multi-level indexes use B-tree or B+-tree data structures
because of the insertion and deletion problem
– This leaves space in each tree node (disk block) to allow for
new index entries
• These data structures are variations of search trees that allow
efficient insertion and deletion of new search values.
• In B-Tree and B+-Tree data structures, each node corresponds
to a disk block
• Each node is kept between half-full and completely full

CNG 351 - lecture 11 9/34

B-Tree
• The tree is always balanced and the space wasted by deletion ,
if any never becomes excessive.
• Formal definition of a B-tree of order p:
– Each internal node in the B-tree is of the form:
• <P1, <K1, Pr1>, P2, <K2, Pr2>,…., Pq>
– Within each node K1<K2…<Kq-1.
– Each node has at most p tree pointers.
– Each node, except the root and leaf nodes has at least
ceiling (p/2) tree pointers.
– A node with q tree pointers, q<=p, has q-1 search key field
values (and hence q-1 data pointers).
– All leaf nodes are at the same level.

CNG 351 - lecture 11 10/34

Example B-tree with p=3

CNG 351 - lecture 11 11/34

B-Trees

• Insertion:
– An insertion into a node that is not full is quite efficient
– If a node is full the insertion causes a split into two
nodes
– Splitting may propagate to other tree levels
• Deletion:
– A deletion is quite efficient if a node does not become
less than half full
– If a deletion causes a node to become less than half
full, it must be merged with neighboring nodes

CNG 351 - lecture 11 12/34

Inserting 16*, 8* into Example B tree
Root
13 17 24 30

2* 3* 5* 7* 8* 14* 15* 16*

You overflow

13 17 24 30

2* 3* 5* 7* 8*

One new child (leaf node)

generated; must add one more
pointer to its parent, thus one more
key value as well.
Inserting 8* (cont.)
• Copy up the
13 17 24 30
middle value (leaf
Entry to be inserted in parent node.
split) (Note that 5 is
s copied up and
5
continues to appear in the leaf.)

2* 3* 7* 8*

5 13 17 24 30 You overflow!
Insertion into B tree (cont.)
• Understand
difference between
copy-up and push- 5 13 17 24 30
up

• Observe how
minimum We split this node, redistribute entries evenly,
occupancy is and push up middle key.
guaranteed in both
leaf and index pg 
splits.
Entry to be inserted in parent node.
17 (Note that 17 is pushed up and only
appears once in the index. Contrast
this with a leaf split.)

5 13 24 30
Example B Tree After Inserting 8*
Root
17

5 13 24 30

2* 3* 7* 8* 14* 15* 19* 20* 22* 27* 29* 33* 34* 38* 39*

Notice that root was split, leading to increase in height.

CNG 351 - lecture 11 16/34

Delete 19* and 20*
Root
17

5 13 24 30

2* 3* 7* 8* 14* 16* 19* 20* 22* 27* 29* 33* 34* 38* 39*

ow
nde r fl
u 22*
You

22* 24*

Have we still forgot something?

Deleting 19* and 20* (cont.)
Root

5 13 27 30

2* 3* 7* 8* 14* 16* 22* 24* 29* 33* 34* 38* 39*

• Notice how 27 is copied up.

• But can we move it up?
• Now we want to delete 24
• Underflow again! But can we redistribute this time?

CNG 351 - lecture 11 18/34

w n g!
i
flo sibl
Deleting 24* un
r
de ith
• Observe the two leaf nodes ou ew
Y erg
are merged, and 27 is M
discarded from their parent,
but … 30

• Observe `pull down’ of index

entry (below).
22* 27* 29* 33* 34* 38* 39*

New root 5 13 17 30

2* 3* 7* 8* 14* 16* 22* 27* 29* 33* 34* 38* 39*

B+-Trees
• Is a variation of a B-tree.
• In a B-tree, pointers to data records exist at all levels
of the tree.
• In a B+-tree, all pointers to data records exists at the
leaf-level nodes (data pointers are only at the leaf
nodes).
• A B+-tree can have less levels (or higher capacity of
search values) than the corresponding B-tree.

CNG 351 - lecture 11 20/34

The Nodes of a B+-tree

CNG 351 - lecture 11 21/34

B+ Tree

25 50 75

5 10 15 20 25 30 50 55 60 65 75 80 85 90

B+ tree is a B tree that have its Leaf nodes form linked lists

22
Inserting a Data Entry into a B+ Tree: Summary
• Find correct leaf L.
• Put data entry onto L.
– If L has enough space, done!
– Else, must split L (into L and a new node L2)
• Redistribute entries evenly, put middle key in L2
• copy up middle key.
• Insert index entry pointing to L2 into parent of L.
• This can happen recursively
– To split index node, redistribute entries evenly, but push up
middle key. (Contrast with leaf splits.)
• Splits “grow” tree; root split increases height.
– Tree growth: gets wider or one level taller at top.

CNG 351 - lecture 11 23/34

Insertion Example
• Add Record with Key 28:

• Add Record with Key 70: 50 55 60 65 70

24
Insertion Example (cont.)

• Add Record with Key

95: 75 80 85 90 95
– Split Leaf Node:
25 50 60 75 85
– Split Parent Node:

• New B+ Tree:

25
Rotation

• B+ trees can incorporate rotation to reduce the

number of node splits. A rotation occurs when a leaf
node is full, but one of its sibling nodes is not full.
• Example: Insert 70
• Before

• After

26
Deleting a Data Entry from a B+ Tree: Summary
• Start at root, find leaf L where entry belongs.
• Remove the entry.
– If L is at least half-full, done!
– If L has only d-1 entries,
• Try to re-distribute, borrowing from sibling (adjacent node with
same parent as L).
• If re-distribution fails, merge L and sibling.
• If merge occurred, must delete entry (pointing to L or sibling) from
parent of L.
• Merge could propagate to root, decreasing height.

CNG 351 - lecture 11 27/34

B * Tree
• Same as B tree but non-root nodes to be at least 2/3
full instead of 1/2. To maintain this, instead of
immediately splitting up a node when it gets full, its
keys are shared with the node next to it. i.e. rotate
before you split

28
Leaf Below Index Below Delete Actions
Fill Factor Fill Factor
NO NO Delete the record from the leaf. If the key
appears in the index, use the next key to
replace it.

YES NO Combine the leaf and its sibling. Change

the index to reflect the change.
YES YES 1.Combine the leaf and its sibling.
2.Adjust the index to reflect the change.
3.Combine the index with its sibling.
Continue combining index nodes until you
reach a node with the correct fill factor or
you reach the root.

29
Deletion Example
• Original B+ tree (After inserting Key 95):

30
Deletion Example (cont.)
• Delete Record with Key 70:

31
Deletion Example (cont.)
• Delete
Record
with
Key
25:

32
Deletion Example (cont.)
• Delete
Record
with Key
60:

• The leaf containing 60 (60 65) will be below the fill factor
after the deletion. Thus, we must combine leaf nodes.
• With recombined leaves, the index will be reduced by one key.
Hence, it will also fall below the fill factor. Thus, we must
combine index nodes.
• Sixty appears as the only key in the root index node.
Obviously, it will be removed with the deletion.
33
Summary
• Multilevel Indexes
• Dynamic Multilevel Indexes
– Using B-Trees, and
– Using B+-Trees
• Indexes on Multiple Keys

CNG 351 - lecture 11 34/34

Btree Data Structure
No ratings yet
Btree Data Structure
25 pages
AbInitio String Functions
100% (3)
AbInitio String Functions
13 pages
B - Tree
No ratings yet
B - Tree
46 pages
Data Structure Lecture 7 Tree
No ratings yet
Data Structure Lecture 7 Tree
49 pages
B and B+ Tree
No ratings yet
B and B+ Tree
33 pages
Master Plan Porto Romano Bay Albania
100% (1)
Master Plan Porto Romano Bay Albania
138 pages
Unit 5
No ratings yet
Unit 5
99 pages
B Tree
No ratings yet
B Tree
53 pages
IndexedFiles Fall2023-Part2
No ratings yet
IndexedFiles Fall2023-Part2
68 pages
Indexing
No ratings yet
Indexing
77 pages
Indexing
No ratings yet
Indexing
56 pages
n3 BTrees
No ratings yet
n3 BTrees
14 pages
Index 4
No ratings yet
Index 4
40 pages
Unit-5 B+Trees & Hashing
No ratings yet
Unit-5 B+Trees & Hashing
37 pages
Unit V
No ratings yet
Unit V
55 pages
Multiway Search Trees
No ratings yet
Multiway Search Trees
40 pages
Module - 3 Advanced Data Structures: C Manasa Asst. Professor Dept. of CSE, DSCE
No ratings yet
Module - 3 Advanced Data Structures: C Manasa Asst. Professor Dept. of CSE, DSCE
54 pages
Lesson 04
No ratings yet
Lesson 04
58 pages
CNG351 Lecture 12 B
No ratings yet
CNG351 Lecture 12 B
34 pages
Ads 2 Part 3
No ratings yet
Ads 2 Part 3
60 pages
2 - Indexing Structures - Ch14
No ratings yet
2 - Indexing Structures - Ch14
50 pages
Tutorial 10 Indexing
No ratings yet
Tutorial 10 Indexing
36 pages
B Trees and B Trees
No ratings yet
B Trees and B Trees
24 pages
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
No ratings yet
24-Multi-Level Indexing, Dynamic Multilevel Indexing, B-Tree-11-09-2024
40 pages
Indexing: Data Structure and Algorithm Analysis
No ratings yet
Indexing: Data Structure and Algorithm Analysis
22 pages
Multilevel Indexing and B+ Trees: CENG 351 File Structures 1
No ratings yet
Multilevel Indexing and B+ Trees: CENG 351 File Structures 1
38 pages
n04-B Trees
No ratings yet
n04-B Trees
19 pages
K 12 Grade 11 Practical Research 1 Simplified
91% (479)
K 12 Grade 11 Practical Research 1 Simplified
41 pages
LM6 - B+ Tree Index Files - B Tree Index Files
No ratings yet
LM6 - B+ Tree Index Files - B Tree Index Files
27 pages
Class 15
No ratings yet
Class 15
18 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
CSE 301 Lecture-8-Indexing WT
No ratings yet
CSE 301 Lecture-8-Indexing WT
31 pages
B+ Tree
No ratings yet
B+ Tree
34 pages
Data Structures Digital Notes-141-153
No ratings yet
Data Structures Digital Notes-141-153
13 pages
Multi Last
No ratings yet
Multi Last
10 pages
Tree-Structured Indexes: Computer Science Department Columbia University
No ratings yet
Tree-Structured Indexes: Computer Science Department Columbia University
13 pages
B - Trees
No ratings yet
B - Trees
19 pages
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
No ratings yet
B Tree: Muhammad Haris Department of Computer Science M.haris@nu - Edu.pk
27 pages
B+ Tree: What Is A B+ Tree Searching Insertion Deletion
No ratings yet
B+ Tree: What Is A B+ Tree Searching Insertion Deletion
24 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
B and B+ Tree
No ratings yet
B and B+ Tree
33 pages
Data Structures Using C, 2e Jhalak Dutta
No ratings yet
Data Structures Using C, 2e Jhalak Dutta
16 pages
B+ Trees: What Are B+ Trees Used For Whatisabtree What Is A B+ Tree Searching Insertion Deletion
No ratings yet
B+ Trees: What Are B+ Trees Used For Whatisabtree What Is A B+ Tree Searching Insertion Deletion
30 pages
B Trees and Its Variants
No ratings yet
B Trees and Its Variants
55 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
CS143: Index: Basic Problem Random-Order File
No ratings yet
CS143: Index: Basic Problem Random-Order File
12 pages
Indexing and B+ Tress
No ratings yet
Indexing and B+ Tress
6 pages
Heartofcoaching Sample
100% (1)
Heartofcoaching Sample
19 pages
B-Trees and B+-Trees: Jay Yim CS 157B Dr. Lee
No ratings yet
B-Trees and B+-Trees: Jay Yim CS 157B Dr. Lee
34 pages
Definition of B-Trees Properties Specialization Examples 2-3 Trees Insertion of B-Tree Remove Items From B-Tree
No ratings yet
Definition of B-Trees Properties Specialization Examples 2-3 Trees Insertion of B-Tree Remove Items From B-Tree
21 pages
B+ Trees: Brian Lee CS157B Section 1 Spring 2006
No ratings yet
B+ Trees: Brian Lee CS157B Section 1 Spring 2006
28 pages
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
No ratings yet
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
26 pages
B Trees and B Trees
No ratings yet
B Trees and B Trees
34 pages
B Tree
No ratings yet
B Tree
5 pages
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
No ratings yet
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
26 pages
Indexing and Hashing: (Emphasis On B+ Trees)
No ratings yet
Indexing and Hashing: (Emphasis On B+ Trees)
23 pages
B+ Tree: What Is A B+ Tree Searching Insertion Deletion
No ratings yet
B+ Tree: What Is A B+ Tree Searching Insertion Deletion
26 pages
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
No ratings yet
B+ Tree: by Li Wen CS157B Professor: Sin-Min Lee
26 pages
DLL - FP Wk8 Day 1
No ratings yet
DLL - FP Wk8 Day 1
5 pages
ESO 207A / 211 Data Structures and Algorithms
No ratings yet
ESO 207A / 211 Data Structures and Algorithms
13 pages
COFIMCO Installation and Operation Manual
100% (2)
COFIMCO Installation and Operation Manual
11 pages
Global Market Forecast 2015-2034 PDF
No ratings yet
Global Market Forecast 2015-2034 PDF
27 pages
Testing & Commissioning of Irrigation System
No ratings yet
Testing & Commissioning of Irrigation System
13 pages
Evaluation of Gas Hydrate in Gas Pipeline Transportation
No ratings yet
Evaluation of Gas Hydrate in Gas Pipeline Transportation
107 pages
Company Law Sujith
No ratings yet
Company Law Sujith
8 pages
Parts Catalog: TJ053E-AS50
No ratings yet
Parts Catalog: TJ053E-AS50
14 pages
Aero Seal
No ratings yet
Aero Seal
14 pages
New Methos For Granulating Diamond and Powder
No ratings yet
New Methos For Granulating Diamond and Powder
2 pages
G9 DLL Q1 Week4
No ratings yet
G9 DLL Q1 Week4
3 pages
CAF 2 Tax Study Plan
No ratings yet
CAF 2 Tax Study Plan
5 pages
07820100024353
No ratings yet
07820100024353
20 pages
Chapter 2 Architectural Models
No ratings yet
Chapter 2 Architectural Models
44 pages
Pseudo Holday - Handle COVID 19 - Facebook Prophet
No ratings yet
Pseudo Holday - Handle COVID 19 - Facebook Prophet
27 pages
INSPI - Yaoure-ESIA-Appendix-34-Cultural-Heritage-Management-Plan
100% (1)
INSPI - Yaoure-ESIA-Appendix-34-Cultural-Heritage-Management-Plan
7 pages
Control of Static Electricity Work Instruction
No ratings yet
Control of Static Electricity Work Instruction
7 pages
CNG351 Lecture 10 DML Part 2
No ratings yet
CNG351 Lecture 10 DML Part 2
26 pages
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
No ratings yet
Flowchart and Guidelines For Non-Degree Applications 2025 Via Google Form
2 pages
CNG351 Lecture 11 Part 2
No ratings yet
CNG351 Lecture 11 Part 2
32 pages
CNG351 Lecture 12 A
No ratings yet
CNG351 Lecture 12 A
21 pages
KPCSW Report.2022
No ratings yet
KPCSW Report.2022
43 pages
Air Conditioning
No ratings yet
Air Conditioning
4 pages
1.1 Mechanical Tender Drawing For Sanwa Project (R2)
No ratings yet
1.1 Mechanical Tender Drawing For Sanwa Project (R2)
9 pages
CNG213Assignment3 Fall 2024 2025
No ratings yet
CNG213Assignment3 Fall 2024 2025
4 pages
Notice of Recurrence: U.S. Department of Labor
No ratings yet
Notice of Recurrence: U.S. Department of Labor
4 pages
Pricing Strategy
No ratings yet
Pricing Strategy
1 page
Mobile Data
No ratings yet
Mobile Data
5 pages
EMS-Motor Starter Wiring and Enclosuers
No ratings yet
EMS-Motor Starter Wiring and Enclosuers
3 pages
Data Sheet
No ratings yet
Data Sheet
3 pages
Pol Party Raz
No ratings yet
Pol Party Raz
1 page
NorthWestNet NUSIRG Internet Guide
From Everand
NorthWestNet NUSIRG Internet Guide
NorthWestNet
No ratings yet
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
From Everand
ADVANCED DATA STRUCTURES FOR ALGORITHMS: Mastering Complex Data Structures for Algorithmic Problem-Solving (2024)
VIOLET CASTRO
No ratings yet
Minecrafter Architect: Amazing Starter Homes
From Everand
Minecrafter Architect: Amazing Starter Homes
Megan Miller
4.5/5 (10)
Instruction for Using a Slide Rule
From Everand
Instruction for Using a Slide Rule
W. Stanley
No ratings yet
How to make a Kentucky stick chair
From Everand
How to make a Kentucky stick chair
Les Kenny
5/5 (1)

CNG351 Lecture 12 B

Uploaded by

CNG351 Lecture 12 B

Uploaded by

Indexing Structures for Files

CNG351 - Data Management and File Structures

CNG 351 - lecture 11 2/34

CNG 351 - lecture 11 3/34

CNG 351 - lecture 11 4/34

CNG 351 - lecture 11 5/34

CNG 351 - lecture 11 6/34

CNG 351 - lecture 11 7/34

CNG 351 - lecture 11 9/34

CNG 351 - lecture 11 10/34

CNG 351 - lecture 11 11/34

CNG 351 - lecture 11 12/34

2* 3* 5* 7* 8* 14* 15* 16*

One new child (leaf node)

Notice that root was split, leading to increase in height.

CNG 351 - lecture 11 16/34

Have we still forgot something?

2* 3* 7* 8* 14* 16* 22* 24* 29* 33* 34* 38* 39*

• Notice how 27 is copied up.

CNG 351 - lecture 11 18/34

• Observe `pull down’ of index

2* 3* 7* 8* 14* 16* 22* 27* 29* 33* 34* 38* 39*

CNG 351 - lecture 11 20/34

CNG 351 - lecture 11 21/34

CNG 351 - lecture 11 23/34

• Add Record with Key 70: 50 55 60 65 70

• Add Record with Key

• B+ trees can incorporate rotation to reduce the

CNG 351 - lecture 11 27/34

YES NO Combine the leaf and its sibling. Change

CNG 351 - lecture 11 34/34

You might also like