0% found this document useful (0 votes)

134 views

FS Lecture

Indexed sequential files provide both indexed and sequential access simultaneously. They use blocks to localize changes to a sequence set of records. A simple index and sequence set form a simple prefix B+ tree, where the index contains shortest record prefixes rather than full keys. Loading processes the data in a single pass to sequentially write the sorted sequence set and index blocks. B+ trees and simple prefix B+ trees provide height-balanced indexed access but differ in whether the index contains full keys or prefixes.

Uploaded by

Tanvir1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views

FS Lecture

Uploaded by

Tanvir1987

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 17

File Structures

Indexed Sequential File Access

and Prefix B+ Trees

1
Indexed Sequential Access
• Up to this point, we have had to choose between
viewing a file from an indexed point of view or from a
sequential point of view.
• Here, we are looking for a single organizational method
that provides both of these views simultaneously.
• Why care about obtaining both views simultaneously? If
an application requires both interactive random access
and cosequential batch processing, both sets of actions
have to be carried out efficiently. (E.g., a student record
system at a University).

March 16 & 21, 2000 2

Maintaining a Sequence Set: The
Use of Blocks I
• A sequence set is a set of records in physical key order which
is such that it stays ordered as records are added and deleted.
• Since sorting and resorting the entire sequence set as records
are added and deleted is expensive, we look at other strategies.
In particular, we look at a way to localize the changes.
• The idea is to use blocks that can be read into memory and
rearranged there quickly. Like in B-Trees, blocks can be split,
merged or their records re-distributed as necessary.

3
Maintaining a Sequence Set: The
Use of Blocks II
• Using blocks, we can thus keep a sequence set in order
by key without ever having to sort the entire set of
records.
• However, there are certain costs associated with this
approach:
– A Blocked file takes up more space than an
unblocked file because of internal fragmentation.
• The order of the records is not necessarily physically
sequential throughout the file. The maximum
guaranteed extent of physical sequentiality is within a
block.
4
Maintaining a Sequence Set: The
Use of Blocks III
• An important aspect of using blocks is the choice
of a block size. There are 2 considerations to keep
in mind when choosing a block size:
– The block size should be such that we can hold
several blocks in memory at once
– The block size should be such that we can
access a block without having to bear the cost
of a disk seek within the block read or block
write operation.
5
Adding a Simple Index to the
Sequence Set
• Each of the blocks we created for our Sequence Set
contains a range of records that might contain the
record we are seeking.
• We can construct a simple single-level index for these
blocks.
• The combination of this kind of index with the
sequence set of blocks provides complete indexed
sequential access. This method works well as long as
the entire index can be held in memory.
• If the entire index cannot be held in memory, then we
can use a B+ Tree which is a B-Tree index plus a
sequence set that holds the records.

6
The Content of the Index:
Separators Instead of Keys
• The index serves as a kind of road map for for the
sequence set ==> We do not need to have keys in
the index set.
• What we really need are separators capable of
distinguishing between two blocks.
• We can save space by using variable-length
separators and placing the shortest separator in the
index structure.
• Rules are: Key < separator ==> Go left .
Key = separator ==> Go right .
Key > separator ==> Go right
7
The Simple Prefix B+ Tree
• The separators we just identified can be formed
into a B-Tree index of the sequence set blocks and
the B-Tree index is called the index set.
• Taken together with the sequence set, the index set
forms a file structure called a simple prefix B+
Tree.
• “simple prefix” indicates that the index set
contains shortest separators, or prefixes of the
keys rather than copies of the actual keys.
8
Simple Prefix B+ Tree
Maintenance
• Changes localized to single blocks in the sequence set:
Make the changes to the sequence set and to the index set.
• Changes involving multiple blocks in the sequence set:
– If blocks are split in the sequence set, a new separator
must be inserted into the index set
– If blocks are merged in the sequence set, a separator
must be removed from the index set.
– If records are re-distributed between blocks in the
sequence set, the value of a separator in the index set
must be changed.

9
Index Set Block Size
• The physical size of a node for the index set is usually the same
as the physical size of a block in the sequence set. We, then,
speak of index set blocks, rather than nodes.
• There are a number of reasons for using a common block size for
the index and sequence sets:
– The block size for the sequence set is usually chosen because
there is a good fit among this block size, the characteristics of
the disk drive, and the amount of memory available.
– A common block size makes it easier to implement a
buffering scheme to create a virtual simple prefix B+Tree
– The index set blocks and sequence set blocks are often
mingled within the same file to avoid seeking between 2
separate files while accessing the simple prefix B+Tree.

10
Internal Structure of Index Set
Blocks: A Variable-Order B-Tree
• Given a large, fixed-size block for the index set, how
do we store the separators within it?
• There are many ways to combine the list of
separators, the index to separators, and the list of
Relative Block Numbers (RBNs) into a single index
set block.
• One possible approach includes a separator count
and keeps a count of the total length of separators.

March 16 & 21, 2000 11

Loading a Simple Prefix B+ Tree I
• Successive Insertions is not a good method because splitting
and redistribution are relatively expensive and would be best to
use only for tree maintenance.
• Starting from a sorted file, however, we can place the records
into sequence set blocks one by one, starting a new block when
the one we are working with fills up. As we make the transition
between two sequence set blocks, we can determine the
shortest separator for the blocks. We can collect these
separators into an index set block that we build and hold in
memory until it is full.

March 16 & 21, 2000 12

Loading a Simple Prefix B+ Tree II:
Advantages
• The advantages of loading a simple Prefix B+ Tree almost always
outweigh the disadvantages associated with the possibility of
creating blocks that contain too few records or too few separators.
• A particular advantage is that the loading process goes more
quickly because:
– The output can be written sequentially;
– we make only one pass over the data;
– No blocks need to be reorganized as we proceed.
• Advantages after the tree is loaded
– The blocks are 100% full.
– Sequential loading creates a degree of spatial locality within our
file ==> Seeking can be minimized.

March 16 & 21, 2000 13

B+ Trees
• The difference between a simple prefix B+ Tree and a plain B+ Tree
is that the plain B+ Tree does not involve the use of prefixes as
separators. Instead, the separators in the index set are simply copies
of the actual keys.
• Simple Prefix B+ Tree are often more desirable than plain B+ Trees
because the prefix separators take up less space than the full keys.
• B+ Trees, however, are sometimes more desirable since 1) they do
not need variable length separator fields and 2) some key sets are not
always easy to compress effectively.

March 16 & 21, 2000 14

B-Trees, B+Trees and Simple
Prefix B+ Trees in Perspective I
• B and B+ Trees are not the only tools useful for File Structure Design.
Simple Indexes are useful when they can be held fully into memory and
Hashing can provide much faster access than B and B+ Trees.
• Common Characteristics of B and B+ and Prefix B+ Trees:
– Paged Index Structures ==> Broad and shallow trees
– Height-Balanced Trees
– The trees are grown Bottom Up and the operations used are: block
splitting, merging and re-distribution
– Two-to-Three Splitting and redistribution can be used to obtain
greater storage efficiency.
– Can be implemented as Virtual Tree Structures.
– Can be adapted for use with variable-length records.

March 16 & 21, 2000 15

B-Trees, B+Trees and Simple
Prefix B+ Trees in Perspective II
Differences between the various structures:
• B-Trees: multi-level indexes to data files that are entry-sequenced.
Strengths: simplicity of implementation. Weaknesses: excessive
seeking necessary for sequential access.
• B-Trees with Associated Information: These are B-Trees that
contain record contents at every level of the B-Tree. Strengths: can
save up space. Weaknesses: Works only when the record
information is located within the B-Tree. Otherwise, too much
seeking is involved in retrieving the record information.

March 16 & 21, 2000 16

B-Trees, B+Trees and Simple
Prefix B+ Trees in Perspective III
Differences between the various structures (Cont’d):
• B+ Trees: In a B+ Tree all the key and record info is contained in a
linked set of blocks known as the sequence set. Indexed access is
provided through the Index Set. Advantages over B-Trees: 1) The
sequence set can be processed in a truly linear, sequential way; 2) The
index is built with a single key or separator per block of data records
rather than with one key per data record. ==> index is smaller and hence
shallower.
• Simple Prefix B+ Trees: The separators in the index set are smaller than
the keys in the sequence set ==> Tree is even smaller.

March 16 & 21, 2000 17

DS Utech Final May2022
No ratings yet
DS Utech Final May2022
7 pages
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
From Everand
Data Structures & Algorithms Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
1/5 (1)
B+ Trees1
No ratings yet
B+ Trees1
24 pages
FS Mod4
No ratings yet
FS Mod4
12 pages
Module - 4: 10.1 Indexed Sequential Access
No ratings yet
Module - 4: 10.1 Indexed Sequential Access
14 pages
Module 4 PDF
No ratings yet
Module 4 PDF
38 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
B+ Trees
No ratings yet
B+ Trees
13 pages
Unit Iv Indexing and Hashing: Basic Concepts
No ratings yet
Unit Iv Indexing and Hashing: Basic Concepts
35 pages
CS2202_IndexingHashing
No ratings yet
CS2202_IndexingHashing
83 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
Indexed Sequential Access
No ratings yet
Indexed Sequential Access
10 pages
CH 13
No ratings yet
CH 13
34 pages
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
No ratings yet
CSE 544: Lecture 11 Storing Data, Indexes: Monday, 5/1/2006
52 pages
Lesson 8 Cs450 - Indexing
No ratings yet
Lesson 8 Cs450 - Indexing
31 pages
DBMS-Unit5-PPT (1)
No ratings yet
DBMS-Unit5-PPT (1)
40 pages
2 - Indexing Structures - Ch14
No ratings yet
2 - Indexing Structures - Ch14
50 pages
Index Sequential Access & Prefix B+ Tree: File Structures - Module IV
No ratings yet
Index Sequential Access & Prefix B+ Tree: File Structures - Module IV
14 pages
Solution 3
No ratings yet
Solution 3
7 pages
File Structures Subject Code: 17IS72: Part - A
No ratings yet
File Structures Subject Code: 17IS72: Part - A
109 pages
Lecture12 6 Slides Per Page
No ratings yet
Lecture12 6 Slides Per Page
6 pages
Index and Hashing
No ratings yet
Index and Hashing
82 pages
Dbms. 5 Unit Part-B
No ratings yet
Dbms. 5 Unit Part-B
8 pages
CH 12 Updated
No ratings yet
CH 12 Updated
55 pages
DS_TM_Study_Material_Presentations_Unit-4_1TM
No ratings yet
DS_TM_Study_Material_Presentations_Unit-4_1TM
22 pages
10.1.1.219.7269 ModernBTreeTechniques
No ratings yet
10.1.1.219.7269 ModernBTreeTechniques
203 pages
The Ubiquitous B-Tree: Computer Sctence Department, Purdue Untverstty, West Lafayette, Indiana 47907
No ratings yet
The Ubiquitous B-Tree: Computer Sctence Department, Purdue Untverstty, West Lafayette, Indiana 47907
17 pages
Data Indexing Presentation
No ratings yet
Data Indexing Presentation
38 pages
unit-5-indexing-2024
No ratings yet
unit-5-indexing-2024
50 pages
B - Trees
No ratings yet
B - Trees
19 pages
os exam
No ratings yet
os exam
10 pages
Indexing
No ratings yet
Indexing
53 pages
Indexing: Contents
No ratings yet
Indexing: Contents
13 pages
Dsa Mock Insem Question Bank
No ratings yet
Dsa Mock Insem Question Bank
18 pages
Designing Data-Intensive Apps - Ch 3
No ratings yet
Designing Data-Intensive Apps - Ch 3
7 pages
File Organizations and Indexes
No ratings yet
File Organizations and Indexes
51 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing
No ratings yet
CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing
25 pages
B+ Tree & B Tree
No ratings yet
B+ Tree & B Tree
38 pages
FS Mod3
No ratings yet
FS Mod3
46 pages
IT3020 L06 Indexing
No ratings yet
IT3020 L06 Indexing
41 pages
DBMS Indexing Methods
No ratings yet
DBMS Indexing Methods
33 pages
Lecture 5.Pptx 2
No ratings yet
Lecture 5.Pptx 2
22 pages
File Organization
No ratings yet
File Organization
47 pages
09_FIle.pptx
No ratings yet
09_FIle.pptx
22 pages
DINLect1.pptx
No ratings yet
DINLect1.pptx
69 pages
Unit V
No ratings yet
Unit V
55 pages
History of File Structures
No ratings yet
History of File Structures
26 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
9 pages
Prefix B-Trees: Rudolf Bayer and Karl Unterauer Technische Universitiit Miinchen
No ratings yet
Prefix B-Trees: Rudolf Bayer and Karl Unterauer Technische Universitiit Miinchen
16 pages
Index Dbms
No ratings yet
Index Dbms
5 pages
DBMS Unit-Iv
No ratings yet
DBMS Unit-Iv
9 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
No ratings yet
DBMS Indexing B - Tree To B Tree (197222, 197125, 197155)
41 pages
Indexing and Hashing: (Emphasis On B+ Trees)
No ratings yet
Indexing and Hashing: (Emphasis On B+ Trees)
23 pages
02 Blocking - Addional
No ratings yet
02 Blocking - Addional
74 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Oracle OBIEE Interview Q & A
From Everand
Oracle OBIEE Interview Q & A
Mohammed Azizuddin Aamer
3/5 (1)
Basic Concepts in Data Structures
From Everand
Basic Concepts in Data Structures
K.Meenendranath Reddy
No ratings yet
Splay Trees: in Splay Trees, There Are No Such Rules
No ratings yet
Splay Trees: in Splay Trees, There Are No Such Rules
44 pages
DAA - Unit 1 (PG)
No ratings yet
DAA - Unit 1 (PG)
54 pages
Hashtables in Java Programming
No ratings yet
Hashtables in Java Programming
23 pages
Data Structures Test Part 3 Review: O (N) O (Log2 N) O (N Log2 N) O (1) O (N N)
No ratings yet
Data Structures Test Part 3 Review: O (N) O (Log2 N) O (N Log2 N) O (1) O (N N)
7 pages
Recap 05062005
No ratings yet
Recap 05062005
9 pages
A Heap Is A Data Structure That Stores A Collection of Objects (With Keys), and Has The Following Properties
No ratings yet
A Heap Is A Data Structure That Stores A Collection of Objects (With Keys), and Has The Following Properties
36 pages
Wald 2025 Traversal research paper 2025 .
No ratings yet
Wald 2025 Traversal research paper 2025 .
9 pages
Queue Lab Manual
No ratings yet
Queue Lab Manual
6 pages
Circular Singly Linked List: Data Structures Using C Satish 8886503423
No ratings yet
Circular Singly Linked List: Data Structures Using C Satish 8886503423
32 pages
BCSL-033 Data and File Structures Lab - 240905 - 080604
No ratings yet
BCSL-033 Data and File Structures Lab - 240905 - 080604
46 pages
Dsa Notes Topic Upto Tree
No ratings yet
Dsa Notes Topic Upto Tree
53 pages
CS (ECE) 301-DS-Suggestion
No ratings yet
CS (ECE) 301-DS-Suggestion
2 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
11 pages
Queue, Deque, and Priority Queue Implementations
No ratings yet
Queue, Deque, and Priority Queue Implementations
40 pages
Report (1)
No ratings yet
Report (1)
54 pages
Perfect Hash Table-Based Telephone Directory
100% (2)
Perfect Hash Table-Based Telephone Directory
62 pages
Algorith Datastructure
No ratings yet
Algorith Datastructure
125 pages
CIND119 Module 4 - How To Build A Decision Tree
No ratings yet
CIND119 Module 4 - How To Build A Decision Tree
7 pages
1.collections in Java - Javatpoint
No ratings yet
1.collections in Java - Javatpoint
18 pages
S.Y.-B.Sc_.-III_Data-Structure-I
No ratings yet
S.Y.-B.Sc_.-III_Data-Structure-I
18 pages
How To Return An Array in Java
No ratings yet
How To Return An Array in Java
19 pages
Hashing
No ratings yet
Hashing
38 pages
Tree and Its Terminology
No ratings yet
Tree and Its Terminology
16 pages
chapter -4 stack & queue
No ratings yet
chapter -4 stack & queue
25 pages
Chapter-4 - Data Structure-File Structure
No ratings yet
Chapter-4 - Data Structure-File Structure
34 pages
Chapter 3 - Data Structures in c - Tree Graph (1)
No ratings yet
Chapter 3 - Data Structures in c - Tree Graph (1)
23 pages
Ads 221fa04669
No ratings yet
Ads 221fa04669
16 pages
Types of Indexes
No ratings yet
Types of Indexes
9 pages
Chp2 - Advanced Data Structure
No ratings yet
Chp2 - Advanced Data Structure
88 pages

FS Lecture

Uploaded by

FS Lecture

Uploaded by

File Structures

Indexed Sequential File Access

March 16 & 21, 2000 2

March 16 & 21, 2000 11

March 16 & 21, 2000 12

March 16 & 21, 2000 13

March 16 & 21, 2000 14

March 16 & 21, 2000 15

March 16 & 21, 2000 16

March 16 & 21, 2000 17

You might also like