0% found this document useful (0 votes)

10 views83 pages

Chapter - 3 - Indexing Structures For Files

Chapter 3 discusses various indexing structures for files, including single-level ordered indexes, multilevel indexes, and dynamic multilevel indexes using B-Trees and B+-Trees. It explains the characteristics of primary, clustering, and secondary indexes, as well as their efficiency in searching records. The chapter also highlights the advantages of using multilevel indexes to improve search performance and the challenges associated with insertion and deletion in these structures.

Uploaded by

Mạnh Cường Nguyễn Văn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views83 pages

Chapter - 3 - Indexing Structures For Files

Uploaded by

Mạnh Cường Nguyễn Văn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 83

Chapter 3

Indexing Structures for Files

Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

2
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

3
Single-level index introduction
◼ A single-level index is an auxiliary file that
makes it more efficient to search for a record in
the data file.
◼ The index is usually specified on one field of the
file (although it could be specified on several
fields).
◼ One form of an index is a file of entries <field
value, pointer to record>, which is ordered by
field value.
◼ The index is called an access path on the field.

4
Single-level index introduction (cont.)
◼ The index file usually occupies considerably less
disk blocks than the data file because its entries
are much smaller.
◼ A binary search on the index yields a pointer to
the file record.
◼ Indexes can also be characterized as dense or
sparse:
❑ A dense index has an index entry for every search key
value (and hence every record) in the data file.
❑ A sparse (or nondense) index, on the other hand, has
index entries for only some of the search values

5
Example 1
Given the following data file:
EMPLOYEE(NAME, SSN, ADDRESS, JOB, SAL, ... )
Suppose that:
◼ Record size R = 150 bytes, block size B = 512 bytes, r = 30.000 records
◼ SSN Field size VSSN = 9 bytes, record pointer size PR = 7 bytes
Then, we get:
◼ Blocking factor: bfr = B/R = 512/150 = 3 records/block
◼ Number of blocks needed for the file: b= r/bfr = 30.000/3 = 10.000 blocks

For an dense index on the SSN field:

◼ Index entry size: Ri = (VSSN+ PR) = (9+7) = 16 bytes
◼ Index blocking factor bfri = B/RI = 512/16 = 32 entries/block
◼ Number of blocks for index file: b i = r/bfri = (30000/32)= 938 blocks
◼ Search for and retrieve a record needs: log2bi  + 1 = log2938  + 1 = 11
block accesses

◼ This is compared to an average linear search cost of:

(b/2)= 10000/2 = 5000 block accesses
◼ If the file records are ordered, the binary search cost would be:
 log2b  =  log210000  = 14 block accesses
6
Types of Single-level Ordered Indexes

◼ Primary Indexes

◼ Clustering Indexes

◼ Secondary Indexes

7
Primary Index

◼ Defined on an ordered data file.

❑ The data file is ordered on a key field.

◼ One index entry for each block in the data file

❑ First record in the block, which is called the block anchor

◼ A similar scheme can use the last record in a block.

8
Primary key field Data file

ID Name DoB Salary Sex

1
2
Index file 3
(<K(i), P(i)> entries)
4
Primary Block
key value pointer 6

1 7

4 8
8 9
12 10

12
13
15

9
Primary Index

◼ Number of index entries?

❑ Number of blocks in data file.

◼ Dense or Nondense?
❑ Nondense

◼ Search/ Insert/ Update/ Delete?

10
Clustering Index

◼ Defined on an ordered data file.

❑ The data file is ordered on a non-key field.

◼ One index entry each distinct value of the field.

❑ The index entry points to the first data block that
contains records with that field value

11
Clustering field Data file

Dept_No Name DoB Salary Sex

1
1
Index file 2
(<K(i), P(i)> entries)
2
Clustering Block
field value pointer 2
1 2
2
2
3
3
4
3
5
4
4
5

12
Dept_No Name DoB Salary Sex
Clustering field
1
1

2
2
Index file
2
(<K(i), P(i)> entries)
2
Clustering Block 2
field value pointer
1
3
2
3
3
4
4
5
4

Data file 13
Clustering Index

◼ Number of index entries?

❑ Number of distinct indexing field values in data file .

◼ Dense or Nondense?
❑ Nondense

◼ Search/ Insert/ Update/ Delete?

◼ At most one primary index or one clustering
index but not both.

14
Secondary index
◼ A secondary index provides a secondary means of
accessing a file.
❑ The data file is unordered on indexing field.
◼ Indexing field:
❑ secondary key (unique value)
❑ nonkey (duplicate values)

◼ The index is an ordered file with two fields:

❑ The first field: indexing field.
❑ The second field: block pointer or record pointer.

◼ There can be many secondary indexes for the same file.

15
Index file Secondary
(<K(i), P(i)> entries) key field Data file

Index field Block 5

value pointer
13
3
8
4
5 6
6 15
8 3
9
9
11
21
13 … 11
15
18 4
21 23
23 18

Secondary index on key field

16
Secondary index on key field

◼ Number of index entries?

❑ Number of record in data file

◼ Dense or Nondense?
❑ Dense

◼ Search/ Insert/ Update/ Delete?

17
Secondary index on non-key field
◼ Discussion: Structure of Secondary index on non-
key field?
◼ Option 1: include duplicate index entries with the
same K(i) value - one for each record.
◼ Option 2: keep a list of pointers <P(i, 1), ..., P(i, k)>
in the index entry for K(i).
◼ Option 3:
❑ more commonly used.
❑ one entry for each distinct index field value + an extra
level of indirection to handle the multiple pointers.

18
Blocks of record pointers Indexing field Data file

Dept Name DoB Job Sex

_No

…
3
Index file 5
(<K(i), P(i)> entries) 1

…
Field Block
2
value pointer
3
4
…
1
2 3
3 3
…

4 1
…

5 5
1
…

Secondary Index on non-key field: option 3

Secondary index on nonkey field

◼ Number of index entries?

❑ Number of records in data file
❑ Number of distinct index field values

◼ Dense or Nondense?
❑ Dense/ nondense

◼ Search/ Insert/ Update/ Delete?

20
Summary of Single-level indexes

◼ Ordered file on indexing field?

❑ Primary index
❑ Clustering index
◼ Indexing field is Key?
❑ Primary index
❑ Secondary index
◼ Indexing field is not Key?
❑ Clustering index
❑ Secondary index
21
Summary of Single-level indexes

◼ Dense index?
❑ Secondary index

◼ Nondense index?
❑ Primary index
❑ Clustering index
❑ Secondary index

22
Summary of Single-level indexes

23
Example 2
Given the following data file:
EMPLOYEE(NAME, SSN, ADDRESS, JOB, SAL, ... )
Suppose that:
◼ Record size R = 150 bytes, block size B = 512 bytes, r = 30.000 records
◼ SSN Field size VSSN = 9 bytes, block pointer size P = 6 bytes
Then, we get:
◼ Blocking factor: bfr = B/R = 512/150 = 3 records/block
◼ Number of blocks needed for the file: b = r/bfr = 30.000/3 = 10.000 blocks

For a primary index on the ordering key field SSN:

◼ Index entry size: Ri = (VSSN+ P) = (9+6) = 15 bytes
◼ Index blocking factor bfri= B/Ri = 512/15 = 34 entries/block
◼ Number of blocks for index file: b i= b/bfri = 10000/34 = 295 blocks
◼ Search for and retrieve a record needs: log2bi  + 1 = log2 295  + 1 = 10
block accesses

◼ This is compared to a dense index cost of: 11 block accesses

24
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

25
Multi-Level Indexes
◼ Because a single-level index is an ordered file, we
can create a primary index to the index itself.
❑ The original index file is called the first-level index and the
index to the index is called the second-level index.
◼ We can repeat the process, creating a third, fourth,
..., top level until all entries of the top level fit in
one disk block.
◼ A multi-level index can be created for any type of
first-level index (primary, secondary, clustering) as
long as the first-level index consists of more than
one disk block.

26
A two-level primary
index resembling
ISAM (Indexed
Sequential Access
Method)
organization.

27
Example 3
Given the following data file:
EMPLOYEE(NAME, SSN, ADDRESS, JOB, SAL, ... )
Suppose that:
◼ Record size R=150 bytes, block size B=512 bytes, r=30000 records
◼ SSN Field size VSSN=9 bytes, block pointer size P=6 bytes
Then, we get:
◼ Blocking factor: bfr= B/R = 512/150 = 3 records/block
◼ Number of blocks needed for the file: b= r/bfr= 30000/3  = 10000 blocks
For a primary index on the ordering key field SSN (Example 2):
◼ Index entry size: Ri=(VSSN+ P)=(9+6)=15 bytes
◼ Index blocking factor bfri= B/Ri = 512/15 = 34 entries/block
◼ Number of blocks for index file: b i= b/bfri = 10000/34  = 295 blocks
◼ Search for and retrieve a record needs: log2bi  + 1 = log2295  + 1 = 10 block
accesses
For a multilevel index on the ordering key field SSN:
◼ Index blocking factor bfri= B/Ri = 512/15 = 34 entries/block
o This is the fan-out fo of the multilevel index.
◼ Number of 1st level index blocks: b1 = 295 blocks
◼ Number of 2nd level index blocks: b2 =  b1 / fo =  295 / 34 = 9 blocks
◼ Number of 3th level index blocks: b3 =  b2 / fo =  9 / 34 = 1 block → top level
◼ Number of level of this multilevel index: x = 3 levels
◼ Search for and retrieve a record needs: x + 1 = 4 blocks
28
31
32
33
34
35
36
37
Multi-Level Indexes

◼ Such a multi-level index is a form of search

tree.
◼ However, insertion and deletion of new index
entries is a severe problem because every
level of the index is an ordered file.

38
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees
3
and B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

39
Dynamic Multilevel Indexes Using B-
Trees and B+-Trees
◼ Most multi-level indexes use B-tree or B+-tree data
structures because of the insertion and deletion
problem.
❑ This leaves space in each tree node (disk block) to allow
for new index entries
◼ These data structures are variations of search trees
that allow efficient insertion and deletion of new
search values.
◼ In B-Tree and B+-Tree data structures, each node
corresponds to a disk block.
◼ Each node is kept between half-full and completely
full.
40
Dynamic Multilevel Indexes Using B-
Trees and B+-Trees (cont.)
◼ An insertion into a node that is not full is quite
efficient.
❑ If a node is full, the insertion causes a split into
two nodes.
◼ Splitting may propagate to other tree levels.
◼ A deletion is quite efficient if a node does not
become less than half full.
◼ If a deletion causes a node to become less than
half full, it must be merged with neighboring
nodes.
41
Difference between B-tree and B+-tree

◼ In a B-Tree, pointers to data records exist at

all levels of the tree.
◼ In a B+-Tree, all pointers to data records exist
at the leaf-level nodes.
◼ A B+-Tree can have less levels (or higher
capacity of search values) than the
corresponding B-tree.

42
B-tree Structures

43
The Nodes of a B+-Tree

44
The Nodes of a B+-Tree (cont.)

45
Example 4: Calculate the order of a B-tree
◼ Suppose that:
❑ Search field V = 9 bytes, disk block size B = 512 bytes
❑ Record (data) pointer Pt = 7 bytes, block pointer is P = 6 bytes.
◼ Each B-tree node can have at most p tree pointers, p – 1
data pointers, and p – 1 search key field values.
◼ These must fit into a single disk block if each B-tree node is to
correspond to a disk block:
(p*P) + ((p-1)*(Pt+V))  B
 (p*6) + ((p-1)*(7+9))  512
 (22*p)  528
◼ We can choose to be a large value that satisfies the above
inequality, which gives p = 23 (p = 24 is not chosen because
of additional information).

46
Example 5: Calculate approximate number
of entries of a B-tree
◼ Suppose that:
❑ Search field of Example 3 is a non-ordering key field, and we construct a B-Tree on
this field.
❑ Each node of the B-tree is 69 percent full.
◼ Each node, on the average, will have: p * 0.69 = 23 * 0.69 = 15.87 ≈ 16
pointers → 15 search key field values.
◼ The average fan-out fo = 16. We can start at the root and see how many
values and pointers can exist, on the average, at each subsequent level:
Level Nodes Index entries Pointers
Root: 1 node 15 entries 16 pointers
Level 1: 16 nodes 240 entries 256 pointers
Level 2: 256 nodes 3840 entries 4096 pointers
Level 3: 4096 nodes 61,440 entries
◼ At each level, number of entries = the total number of pointers at the
previous level * the average number of entries in each node.
◼ A two-level B-tree holds 3840+240+15 = 4095 entries on the average; a
three-level B-tree holds 65,535 entries on the average. 47
Example 6: Calculate the order of a B+-tree
◼ Suppose that:
❑ Search key field V=9 bytes, block size B=512bytes
❑ Record pointer is Pr = 7bytes, block pointer is P = 6bytes.
◼ An internal node of the B+-tree can have up to p tree pointers and p-
1 search field values; these must fit into a single block. Hence, we
have:
(p*P) + ((p-1)*V)  B
 (p*6) + ((p-1)*9)  512

 15*p  512

◼ We can choose p to be the largest value satisfying the above

inequality, which give p = 34.
◼ This is larger than the value of 23 for the B-Tree, resulting in a larger
fan-out and more entries in each internal node of a B+-Tree than in
the corresponding B-Tree.

48
Example 6: Calculate the order of a B+-tree
(cont.)
◼ The leaf nodes of B+-tree will have the same number of
values and pointers, except that the pointers are data
pointers and a next pointer. Hence, the order pleaf for the
leaf nodes can be calculated as follows:
(pleaf * (Pt+V))+P  B
 (pleaf * (7+9))+6  512
 (16 * pleaf)  506
◼ If follows that each leaf node hold up to pleaf = 31 key
value/data pointer combinations, assuming that the data
pointers are record pointers.

49
Example 7: Calculate approximate number
of entries of a B+-tree
◼ Suppose that we construct a B+-Tree on the field of Example 6:
❑ Search key field V = 9 bytes, block size B = 512bytes
❑ Record pointer is Pr = 7bytes, block pointer is P = 6bytes.
❑ Each node is 69 percent full.
◼ On the average, each internal node will be have 34*0.69 ≈ 23.46 or
approximately 23 pointers, and hence 22 values.
◼ Each leaf node, on the average, will hold 0.69*pleaf = 0.69*31 ≈ 21.39 or
approximately 21 data record pointers.
◼ A B+-tree will have the following average number of entries at each level:
Level Nodes Index entries Pointers
Root 1 nodes 22 entries 23 pointers
Level 1 23 23*22 = 506 232=529 pointers
Level 2 529 529*22 = 11,638 233=12,167 pointers
Leaf level 12,167 12,167 *21 = 255,507
◼ A 3-level B+-tree holds up to 255,507 record pointers, on the average.
◼ Compare this to the 65,535 entries for corresponding B-tree in Example 4.
50
B+-Tree: Insert entry

51
B+-Tree: Insert entry (cont.)

52
Example of insertion in B+-tree

p = 3 and pleaf = 2

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

53
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

54
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

55
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

56
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

57
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

58
Example of insertion in B+-tree (cont.)

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

59
B+-Tree: Delete entry
◼ Remove the entry from the leaf node.
◼ If it happens to occur in an internal node:
❑ Remove.
❑ The value to its left in the leaf node must replace it in the internal
node.
◼ Deletion may cause underflow in leaf node:
❑ Try to find a sibling leaf node – a leaf node directly to the left or to
the right of the node with underflow.
❑ Redistribute the entries among the node and its siblings.
(Common method: The left sibling first and the right sibling later)
❑ If redistribution fails, the node is merged with its sibling.
❑ If merge occurred, must delete entry (pointing to node and
sibling) from parent node.

60
B+-Tree: Delete entry (cont.)

◼ If an internal node is underflow:

❑ Redistribute the entries among the node, its siblings and
entry pointing to node and sibling of parent node .
❑ If redistribution fails, the node is merged with its sibling and
the entry pointing to node and sibling of parent node .
❑ If merge occurred, must delete entry pointing to node and
sibling from parent node.
❑ If the root node is empty → the merged node becomes the
new root node.
◼ Merge could propagate to root, reduce the tree
levels.

61
Example of deletion from B+-tree

p = 3 and pleaf = 2.

Deletion sequence: 5, 12, 9

Delete 5

62
Example of deletion from B+-tree (cont.)
P = 3 and pleaf = 2.

Deletion sequence: 5, 12, 9

Delete 12: underflow

(redistribute)

63
Example of deletion from B+-tree (cont.)
p = 3 and pleaf = 2.

Deletion sequence: 5, 12, 9

Delete 9:
Underflow (merge with left, redistribute)

64
Example of deletion from B+-tree (cont.)
p = 3 and pleaf = 2.

Deletion sequence: 5, 12, 9

65
Search using B-trees and B+-trees
K=8
5<8

7< 8 <= 8

found

66
Search using B-trees and B+-trees
◼ Search conditions on indexing attributes
❑ =, <, >, ≤, ≥, between, MINIMUM value, MAXIMUM
value
◼ Search results
❑ Zero, one, or many data records
◼ Search cost
❑ B-trees
◼ From 1 to (1 + the number of tree levels) + data accesses
❑ B+-trees
◼ 1 (root level) + the number of tree levels + data accesses

◼ Logically ordering for a data file

67
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

69
Indexes on Multiple Keys
◼ In many retrieval and update requests, multiple
attributes are involved.
◼ If a certain combination of attributes is used
frequently, it is advantageous to set up an access
structure to provide efficient access by a key value
that is a combination of those attributes.
◼ If an index is created on attributes <A1, A2, … , An>,
the search key values are tuples with n values: <v1,
v2, … , vn>.
◼ A lexicographic ordering of these tuple values
establishes an order on this composite search key.
◼ An index on a composite key of n attributes works
similarly to any index discussed so far.

70
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

71
Other File Indexes
◼ Hash indexes
❑ The hash index is a secondary structure to access the
file by using hashing on a search key other than the one
used for the primary data file organization.
◼ Bitmap indexes
❑ A bitmap index is built on one particular value of a
field (the column in a table) with respect to all the rows
(records) and is an array of bits.
◼ Function-based indexes
❑ In Oracle, an index such that the value that results from
applying a function (expression) on a field or some fields
becomes the key to the index

72
Other File Indexes

◼ Hash indexes
❑ The hash index is a secondary structure to
access the file by using hashing on a search
key other than the one used for the primary
data file organization.
◼ access structures similar to indexes, based on
hashing
❑ Support for equality searches on the hash
field

73
Hash indexes

◼ The hash index is a secondary

structure to access the file by using
hashing on a search key other than the
one used for the primary data file
organization.
❑ access structures similar to indexes, based
on hashing
◼ Support for equality searches on the
hash field

74
hashing
function:
the sum of
the digits
of Emp_id
modulo 10

75
Bitmap indexes
◼ A bitmap index is built on one particular value
of a field (the column in a table) with respect to
all the rows (records) and is an array of bits.
❑ Each bit in the bitmap corresponds to a row. If the bit is
set, then the row contains the key value.
◼ In a bitmap index, each indexing field value is
associated with pointers to multiple rows.
◼ Bitmap indexes are primarily designed for data
warehousing or environments in which queries
reference many columns in an ad hoc fashion.
❑ The number of distinct values of the indexed field is
small compared to the number of rows.
❑ The indexed table is either read-only or not subject to
significant modification by DML statements.
76
Bitmap indexes

77
Bitmap indexes

78
Function-based indexes
◼ The use of any function on a column prevents the
index defined on that column from being used.
❑ Indexes are only used with some specific search
conditions on indexed columns.

◼ In Oracle, a function-based index is an index

such that the value that results from applying
some function (expression) on a field or a
collection of fields becomes the key to the index.
❑ A function-based index can be either a B-tree or a
bitmap index.

79
Function-based indexes

80
Contents

1 Single-level Ordered Indexes

2 Multilevel Indexes
Dynamic Multilevel Indexes Using B-Trees and
3
B+-Trees
4 Indexes on Multiple Keys
5 Other File Indexes
6 Indexes in Today‘s DBMSs

81
Index Creation
CREATE [ UNIQUE ] INDEX <index name>
ON <table name> ( <column name> [ <order> ] { , <column name> [ <order> ] } )
[ CLUSTER ] ;

◼ UNIQUE is used to guarantee that no two rows of a table

have duplicate values in the key column or column.
◼ CLUSTER is used when the index to be created should also
sort the data file records on the indexing attribute.

CREATE INDEX DnoIndex ON EMPLOYEE (Dno)

CLUSTER ;

82
B-tree index in Oracle 19c

83
B-tree for a clustered index in MS
SQL Server

84
Review questions
1) Define the following terms: indexing field, primary key field, clustering
field, secondary key field, block anchor, dense index, and nondense
(sparse) index.
2) What are the differences among primary, secondary, and clustering
indexes? How do these differences affect the ways in which these
indexes are implemented? Which of the indexes are dense, and which
are not?
3) Why can we have at most one primary or clustering index on a file, but
several secondary indexes?
4) How does multilevel indexing improve the efficiency of searching an
index file?
5) What is the order p of a B-tree? Describe the structure of B-tree nodes.
6) What is the order p of a B+-tree? Describe the structure of both internal
and leaf nodes of a B+-tree.
7) How does a B-tree differ from a B+-tree? Why is a B+-tree usually
preferred as an access structure to a data file?

85
86

DAMA-DMBOK: Data Management Body of Knowledge: 2nd Edition. ISBN 1634622340, 978-1634622349
88% (34)
DAMA-DMBOK: Data Management Body of Knowledge: 2nd Edition. ISBN 1634622340, 978-1634622349
23 pages
CS6010 - SOCIAL NETWORK ANALYSIS - Unit 1 Notes
67% (6)
CS6010 - SOCIAL NETWORK ANALYSIS - Unit 1 Notes
25 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
Lec06-Indexing in Dbms
No ratings yet
Lec06-Indexing in Dbms
21 pages
Ch17Notes Indexing Structures For Files
No ratings yet
Ch17Notes Indexing Structures For Files
39 pages
Data Science Tutorial Library - 370+ Free Tutorials
100% (1)
Data Science Tutorial Library - 370+ Free Tutorials
14 pages
File Organization
No ratings yet
File Organization
41 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
Chapter - 2 - Revision
No ratings yet
Chapter - 2 - Revision
26 pages
Hashing & Indexing
No ratings yet
Hashing & Indexing
69 pages
Screenshot 2025-03-12 at 9.41.04 AM
No ratings yet
Screenshot 2025-03-12 at 9.41.04 AM
41 pages
Index 2
No ratings yet
Index 2
24 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Week 15 Physical Database Design Index - CH 17 Updated
No ratings yet
Week 15 Physical Database Design Index - CH 17 Updated
35 pages
Indexing
No ratings yet
Indexing
89 pages
20-M4-File Organization - Single Level Indexing-09-09-2024
No ratings yet
20-M4-File Organization - Single Level Indexing-09-09-2024
28 pages
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
No ratings yet
Lecture-13 Indexing and Its Types: Subject: DBMS Subject Code: BCA-S301T Faculty: Saurabh Jha
16 pages
Unit5 File Organization
No ratings yet
Unit5 File Organization
112 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
Index 1
No ratings yet
Index 1
25 pages
Indexing
No ratings yet
Indexing
27 pages
Indexing
No ratings yet
Indexing
53 pages
08 File Handling
No ratings yet
08 File Handling
18 pages
CO3-Session-09 & 10
No ratings yet
CO3-Session-09 & 10
41 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
SingleLevelIndexing Examples
No ratings yet
SingleLevelIndexing Examples
24 pages
Lec 09
No ratings yet
Lec 09
52 pages
Lec 20-24
No ratings yet
Lec 20-24
91 pages
File Org & Indexing - DPP 02
No ratings yet
File Org & Indexing - DPP 02
5 pages
Lec20Indexing v1
No ratings yet
Lec20Indexing v1
57 pages
Indexing
No ratings yet
Indexing
41 pages
Indexing Structures: Professor Navneet Goyal Department of Computer Science & Information Systems BITS, Pilani
No ratings yet
Indexing Structures: Professor Navneet Goyal Department of Computer Science & Information Systems BITS, Pilani
87 pages
Chapter 3 File Organization Indexed Methods
No ratings yet
Chapter 3 File Organization Indexed Methods
31 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
25 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
38 pages
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE302L TH VL2024250101553 2024-09-02 Reference-Material-I
48 pages
DBMS1 Week 4
No ratings yet
DBMS1 Week 4
14 pages
Unit 4 - Indexing
No ratings yet
Unit 4 - Indexing
8 pages
Week 7 - Indexing Structures
No ratings yet
Week 7 - Indexing Structures
25 pages
7-Indexing and Block
No ratings yet
7-Indexing and Block
20 pages
CO3 Notes Indexing
No ratings yet
CO3 Notes Indexing
11 pages
Index Method1
No ratings yet
Index Method1
24 pages
I3306-chap2-TD2-EN - Fa23-24-Solution
No ratings yet
I3306-chap2-TD2-EN - Fa23-24-Solution
6 pages
Indexing Dbms
No ratings yet
Indexing Dbms
22 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
Unit 5
No ratings yet
Unit 5
54 pages
Indexing
No ratings yet
Indexing
62 pages
Index Architecture: Febriliyan Samopa
No ratings yet
Index Architecture: Febriliyan Samopa
110 pages
Chapter 3
No ratings yet
Chapter 3
50 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
Indexing
No ratings yet
Indexing
6 pages
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
No ratings yet
FALLSEM2019-20 ITE1003 ETH VL2019201002592 Reference Material I 06-Nov-2019 Indexing
32 pages
Exercise 18.18 in The Text Book ("Fundamentals of Database Systems-6th Edition", Elmasri Et Al.)
No ratings yet
Exercise 18.18 in The Text Book ("Fundamentals of Database Systems-6th Edition", Elmasri Et Al.)
2 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
23 pages
Indexing
No ratings yet
Indexing
8 pages
Sap Hana: Sudha Paluru
No ratings yet
Sap Hana: Sudha Paluru
56 pages
Co3 Session 21
No ratings yet
Co3 Session 21
53 pages
Indexing Structures For Files: Database Design Database Design
No ratings yet
Indexing Structures For Files: Database Design Database Design
9 pages
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
No ratings yet
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
20 pages
Indexing Structures For Files
No ratings yet
Indexing Structures For Files
30 pages
Tutorial 2 & 3 Update
No ratings yet
Tutorial 2 & 3 Update
29 pages
Excel Pivot Tables
No ratings yet
Excel Pivot Tables
10 pages
Oracle Database Upgrade To 11Gr2: From Etwiki
100% (1)
Oracle Database Upgrade To 11Gr2: From Etwiki
8 pages
DBMS Question Bank
No ratings yet
DBMS Question Bank
2 pages
DBMS MCQ
No ratings yet
DBMS MCQ
59 pages
Gajanan
No ratings yet
Gajanan
23 pages
What Is A DBMS?
No ratings yet
What Is A DBMS?
47 pages
Dbms Lab Manual Bcs403-New-1
No ratings yet
Dbms Lab Manual Bcs403-New-1
76 pages
PowerCenter Level1 Unit01
No ratings yet
PowerCenter Level1 Unit01
18 pages
Unit 2 DS
No ratings yet
Unit 2 DS
116 pages
DBS201 SQL Practice Problems: Sample Questions and SQL Answers
No ratings yet
DBS201 SQL Practice Problems: Sample Questions and SQL Answers
5 pages
PDF&Rendition 1 1
No ratings yet
PDF&Rendition 1 1
8 pages
Dbms Qutn Bank Updated
No ratings yet
Dbms Qutn Bank Updated
6 pages
Hospital Example From Atre, S. Data Base:: Structured Techniques For Design, Performance, and Management
No ratings yet
Hospital Example From Atre, S. Data Base:: Structured Techniques For Design, Performance, and Management
22 pages
FINAL EXAM PERFECT UGRD-ITE6202B Data Structure and Algorithms
No ratings yet
FINAL EXAM PERFECT UGRD-ITE6202B Data Structure and Algorithms
31 pages
Understanding The Data Warehouse Lifecycle
No ratings yet
Understanding The Data Warehouse Lifecycle
9 pages
Current Log
No ratings yet
Current Log
50 pages
Practical File
No ratings yet
Practical File
47 pages
A Data Mart Is A Subset of A Data Warehouse Focused On A Particular Line of Business, Department, or Subject Area
No ratings yet
A Data Mart Is A Subset of A Data Warehouse Focused On A Particular Line of Business, Department, or Subject Area
4 pages
COMP1638: Database Management and Administration Lab 8 Flashback Technologies
No ratings yet
COMP1638: Database Management and Administration Lab 8 Flashback Technologies
5 pages
Database Management System
No ratings yet
Database Management System
4 pages
Q1 Consolidation
No ratings yet
Q1 Consolidation
19 pages
Data Lakes Powering The Future of Big Data
No ratings yet
Data Lakes Powering The Future of Big Data
8 pages
Event Management System: Conclusion Comparision
No ratings yet
Event Management System: Conclusion Comparision
1 page
Lab 4 - Installation of Hadoop and MapReduce WordCount Example
No ratings yet
Lab 4 - Installation of Hadoop and MapReduce WordCount Example
14 pages
PHP 2
No ratings yet
PHP 2
4 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet
Q Tips: Fast, Scalable, and Maintainable Kdb+
From Everand
Q Tips: Fast, Scalable, and Maintainable Kdb+
Nick Psaris
No ratings yet
Learn MongoDB in 24 Hours
From Everand
Learn MongoDB in 24 Hours
Alex Nordeen
5/5 (2)

Chapter - 3 - Indexing Structures For Files

Uploaded by

Chapter - 3 - Indexing Structures For Files

Uploaded by

Chapter 3

Indexing Structures for Files

1 Single-level Ordered Indexes

1 Single-level Ordered Indexes

For an dense index on the SSN field:

◼ This is compared to an average linear search cost of:

◼ Defined on an ordered data file.

◼ One index entry for each block in the data file

◼ A similar scheme can use the last record in a block.

ID Name DoB Salary Sex

◼ Number of index entries?

◼ Search/ Insert/ Update/ Delete?

◼ Defined on an ordered data file.

◼ One index entry each distinct value of the field.

Dept_No Name DoB Salary Sex

◼ Number of index entries?

◼ Search/ Insert/ Update/ Delete?

◼ The index is an ordered file with two fields:

◼ There can be many secondary indexes for the same file.

Index field Block 5

Secondary index on key field

◼ Number of index entries?

◼ Search/ Insert/ Update/ Delete?

Dept Name DoB Job Sex

Secondary Index on non-key field: option 3

◼ Number of index entries?

◼ Search/ Insert/ Update/ Delete?

◼ Ordered file on indexing field?

For a primary index on the ordering key field SSN:

◼ This is compared to a dense index cost of: 11 block accesses

1 Single-level Ordered Indexes

◼ Such a multi-level index is a form of search

1 Single-level Ordered Indexes

◼ In a B-Tree, pointers to data records exist at

◼ We can choose p to be the largest value satisfying the above

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

p = 3 and pleaf = 2 Insertion Sequence: 8, 5, 1, 7, 3, 12, 9, 6

◼ If an internal node is underflow:

Deletion sequence: 5, 12, 9

Deletion sequence: 5, 12, 9

Delete 12: underflow

Deletion sequence: 5, 12, 9

Deletion sequence: 5, 12, 9

◼ Logically ordering for a data file

1 Single-level Ordered Indexes

1 Single-level Ordered Indexes

◼ The hash index is a secondary

◼ In Oracle, a function-based index is an index

1 Single-level Ordered Indexes

◼ UNIQUE is used to guarantee that no two rows of a table

CREATE INDEX DnoIndex ON EMPLOYEE (Dno)

You might also like