0% found this document useful (0 votes)

65 views10 pages

Indexing

Indexing allows for quick retrieval of records from a database. There are two main types of indexing: primary and secondary. Primary indexing uses a primary key to index records, while secondary indexing uses a non-key field. Indexes take up less space than storing the entire table but increase overhead for insert/delete operations. B-tree indexing is commonly used and balances search time and storage space. It structures data into internal nodes and leaf nodes to allow for both sequential and random access.

Uploaded by

prasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views10 pages

Indexing

Uploaded by

prasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 10

Indexing

Indexing is defined as a data structure technique which allows you to quickly retrieve records
from a database file. It is based on the same attributes on which the Indices has been done.

An index

 Takes a search key as input

 Efficiently returns a collection of matching records.

An Index is a small table having only two columns. The first column comprises a copy of the
primary or candidate key of a table. Its second column contains a set of pointers for holding the
address of the disk block where that specific key value stored.

Types of Indexing

Type of Indexes

Database Indexing is defined based on its indexing attributes. Two main types of indexing
methods are:

a. Primary Indexing
b. Secondary Indexing

a. Primary Indexing
Primary Index is an ordered file which is fixed length size with two fields. The first field is the
same a primary key and second, filed is pointed to that specific data block. In the primary Index,
there is always one to one relationship between the entries in the index table.

The primary Indexing is also further divided into two types.

 Dense Index
 Sparse Index

i. Dense Index
In a dense index, a record is created for every search key valued in the database. This helps you
to search faster but needs more space to store index records. In this Indexing, method records
contain search key value and points to the real record on the disk.

ii. Sparse Index

It is an index record that appears for only some of the values in the file. Sparse Index helps you
to resolve the issues of dense Indexing. In this method of indexing technique, a range of index
columns stores the same data block address, and when data needs to be retrieved, the block
address will be fetched.

However, sparse Index stores index records for only some search-key values. It needs less space,
less maintenance overhead for insertion, and deletions but It is slower compared to the dense
Index for locating records.
Secondary Index

The secondary Index can be generated by a field which has a unique value for each record, and it
should be a candidate key. It is also known as a non-clustering index.

This two-level database indexing technique is used to reduce the mapping size of the first level.
For the first level, a large range of numbers is selected because of this; the mapping size always
remains small.

Example of secondary Indexing

In a bank account database, data is stored sequentially by acc_no; you may want to find all
accounts in of a specific branch of ABC bank.

Here, you can have a secondary index for every search-key. Index record is a record point to a
bucket that contains pointers to all the records with their specific search-key value.
Clustering Index

In a clustered index, records themselves are stored in the Index and not pointers. Sometimes the
Index is created on non-primary key columns which might not be unique for each record. In such
a situation, you can group two or more columns to get the unique values and create an index
which is called clustered Index. This also helps you to identify the record faster.

Example:

Let's assume that a company recruited many employees in various departments. In this case,
clustering indexing should be created for all employees who belong to the same dept.

It is considered in a single cluster, and index points point to the cluster as a whole. Here,
Department _no is a non-unique key.

What is Multilevel Index?

Multilevel Indexing is created when a primary index does not fit in memory. In this type of
indexing method, you can reduce the number of disk accesses to short any record and kept on a
disk as a sequential file and create a sparse base on that file.
B-Tree Index

B-tree index is the widely used data structures for Indexing. It is a multilevel index format
technique which is balanced binary search trees. All leaf nodes of the B tree signify actual data
pointers.

Moreover, all leaf nodes are interlinked with a link list, which allows a B tree to support both
random and sequential access.
 Lead nodes must have between 2 and 4 values.
 Every path from the root to leaf are mostly on an equal length.

 Non-leaf nodes apart from the root node have between 3 and 5 children nodes.

 Every node which is not a root or a leaf has between n/2] and n children.

Advantages of Indexing

Important pros/ advantage of Indexing are:

 It helps you to reduce the total number of I/O operations needed to retrieve that data, so
you don't need to access a row in the database from an index structure.
 Offers Faster search and retrieval of data to users.

 Indexing also helps you to reduce tablespace as you don't need to link to a row in a table,
as there is no need to store the ROWID in the Index. Thus you will able to reduce the
tablespace.

 You can't sort data in the lead nodes as the value of the primary key classifies it.

Disadvantages of Indexing

Important drawbacks/cons of Indexing are:

 To perform the indexing database management system, you need a primary key on the
table with a unique value.
 You can't perform any other indexes on the Indexed data.

 You are not allowed to partition an index-organized table.

 SQL Indexing Decrease performance in INSERT, DELETE, and UPDATE query.

B+ Tree

o The B+ tree is a balanced binary search tree. It follows a multi-level index format.
o In the B+ tree, leaf nodes denote actual data pointers. B+ tree ensures that all leaf nodes
remain at the same height.
o In the B+ tree, the leaf nodes are linked using a link list. Therefore, a B+ tree can support
random access as well as sequential access.
Structure of B+ Tree

o In the B+ tree, every leaf node is at equal distance from the root node. The B+ tree is of
the order n where n is fixed for every B+ tree.
o It contains an internal node and leaf node.

Internal node

o An internal node of the B+ tree can contain at least n/2 record pointers except the root
node.
o At most, an internal node of the tree contains n pointers.

Leaf node

o The leaf node of the B+ tree can contain at least n/2 record pointers and n/2 key values.
o At most, a leaf node contains n record pointer and n key values.
o Every leaf node of the B+ tree contains one block pointer P to point to next leaf node.

Searching a record in B+ Tree

Suppose we have to search 55 in the below B+ tree structure. First, we will fetch for the
intermediary node which will direct to the leaf node that can contain a record for 55.

So, in the intermediary node, we will find a branch between 50 and 75 nodes. Then at the end,
we will be redirected to the third leaf node. Here DBMS will perform a sequential search to find
55.
B+ Tree Insertion

Suppose we want to insert a record 60 in the below structure. It will go to the 3rd leaf node after
55. It is a balanced tree, and a leaf node of this tree is already full, so we cannot insert 60 there.

n this case, we have to split the leaf node, so that it can be inserted into tree without affecting the
fill factor, balance and order.

The 3rd leaf node has the values (50, 55, 60, 65, 70) and its current root node is 50. We will split
the leaf node of the tree in the middle so that its balance is not altered. So we can group (50, 55)
and (60, 65, 70) into 2 leaf nodes.

If these two has to be leaf nodes, the intermediate node cannot branch from 50. It should have 60
added to it, and then we can have pointers to a new leaf node.
This is how we can insert an entry when there is overflow. In a normal scenario, it is very easy to
find the node where it fits and then place it in that leaf node.

B+ Tree Deletion

Suppose we want to delete 60 from the above example. In this case, we have to remove 60 from
the intermediate node as well as from the 4th leaf node too. If we remove it from the
intermediate node, then the tree will not satisfy the rule of the B+ tree. So we need to modify it to
have a balanced tree.

After deleting node 60 from above B+ tree and re-arranging the nodes, it will show as follows:

B Tree Index Files B+ Tree Index Files

This is a binary tree structure similar to B+ This is a balanced tree with intermediary nodes and leaf
B Tree Index Files B+ Tree Index Files

tree. But here each node will have only two nodes. Intermediary nodes contain only pointers/address
branches and each node will have some to the leaf nodes. All leaf nodes will have records and
records. Hence here no need to traverse till leaf all are at same distance from the root.
node to get the data.

It has more height compared to width. Most width is more compared to height.

Number of nodes at any intermediary level 'l' Each intermediary node can have n/2 to n children. Only
is 2 l. Each of the intermediary nodes will have root node will have 2 children.
only 2 sub nodes.

Even a leaf node level will have 2 l nodes. Leaf node stores (n-1)/2 to n-1 values
Hence total nodes in the B Tree
are 2l+1−12l+1−1.

It might have fewer nodes compared to B+ tree Automatically Adjust the nodes to fit the new record.
as each node will have data. Similarly it re-organizes the nodes in the case of delete,
if required. Hence it does not alter the definition of B+
tree.

Since each node has record, there might not be Reorganization of the nodes does not affect the
required to traverse till leaf node. performance of the file. This is because, even after the
rearrangement all the records are still found in leaf
nodes and are all at equidistance. There is no change in
distance of records from neither root nor the time to
traverse till leaf node.

If the tree is very big, then we have to traverse If there is any rearrangement of nodes while insertion or
through most of the nodes to get the records. deletion, then it would be an overhead. It takes little
Only few records can be fetched at the effort, time and space. But this disadvantage can be
intermediary nodes or near to the root. Hence ignored compared to the speed of traversal
this method might be slower.

Yamaha R1 Service Manual 2007
100% (1)
Yamaha R1 Service Manual 2007
426 pages
Obiee Functions
No ratings yet
Obiee Functions
20 pages
Microsoft Access 2000
No ratings yet
Microsoft Access 2000
17 pages
Agitation Laboratory Report
100% (4)
Agitation Laboratory Report
34 pages
UNIT V Imp Questions
No ratings yet
UNIT V Imp Questions
12 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
28 pages
SQL Commands
No ratings yet
SQL Commands
48 pages
Multiple Choice Questions (MCQS) On Fundamental of Computers - Set 13 - Loksewa Exam
100% (1)
Multiple Choice Questions (MCQS) On Fundamental of Computers - Set 13 - Loksewa Exam
3 pages
SQL
No ratings yet
SQL
88 pages
Data Dictionary Tutorial
No ratings yet
Data Dictionary Tutorial
4 pages
Oracle Indexes
No ratings yet
Oracle Indexes
3 pages
Oracle Index Types
No ratings yet
Oracle Index Types
4 pages
Materialized View
No ratings yet
Materialized View
31 pages
Breitling - Histograms, Myths and Facts Oracle
No ratings yet
Breitling - Histograms, Myths and Facts Oracle
42 pages
Quiz Answers
No ratings yet
Quiz Answers
15 pages
SQL Commands
No ratings yet
SQL Commands
4 pages
Devices and Databases: o o o o o o
No ratings yet
Devices and Databases: o o o o o o
4 pages
MP Patwari Old Question Paper PDF
80% (10)
MP Patwari Old Question Paper PDF
174 pages
Guidelines For Application-Specific Indexes: See Also
No ratings yet
Guidelines For Application-Specific Indexes: See Also
10 pages
DBMS Unit-3
100% (1)
DBMS Unit-3
97 pages
Word 2011 Shortcut Keys
No ratings yet
Word 2011 Shortcut Keys
6 pages
Dbms New
No ratings yet
Dbms New
156 pages
Compiled by Raiz Maharjan, 2018
No ratings yet
Compiled by Raiz Maharjan, 2018
8 pages
New DBA
No ratings yet
New DBA
8 pages
(DDL) Create / Alter / Drop / Rename Table
No ratings yet
(DDL) Create / Alter / Drop / Rename Table
19 pages
Database Backup &amp Restore - MSSQL
No ratings yet
Database Backup &amp Restore - MSSQL
15 pages
UNIT-6 Important Questions & Answers
No ratings yet
UNIT-6 Important Questions & Answers
20 pages
Computer Networking Course Exam Paper
50% (2)
Computer Networking Course Exam Paper
3 pages
Advantages of An Integrated Database System With Regards To Expanding Website Capability
No ratings yet
Advantages of An Integrated Database System With Regards To Expanding Website Capability
1 page
Oracle SQL Sat PDF
No ratings yet
Oracle SQL Sat PDF
89 pages
Hashing in DBMS
No ratings yet
Hashing in DBMS
9 pages
Chapter 9 MySQL
No ratings yet
Chapter 9 MySQL
29 pages
Indexes in Database
100% (1)
Indexes in Database
38 pages
Oltp Vs Datawarehouse DSS
No ratings yet
Oltp Vs Datawarehouse DSS
2 pages
SQL Practice EMPLOYEE Table Created
No ratings yet
SQL Practice EMPLOYEE Table Created
20 pages
T - SQL Notes
No ratings yet
T - SQL Notes
46 pages
DCL Triggers
No ratings yet
DCL Triggers
22 pages
SQL Server Backup
100% (1)
SQL Server Backup
12 pages
Unit Iii SQL
No ratings yet
Unit Iii SQL
35 pages
Data Dictionary
No ratings yet
Data Dictionary
3 pages
Obtaining and Interpreting Execution Plans Using Dbms - Xplan: David Kurtz
No ratings yet
Obtaining and Interpreting Execution Plans Using Dbms - Xplan: David Kurtz
68 pages
Indexes
No ratings yet
Indexes
4 pages
SQL Tuning
No ratings yet
SQL Tuning
60 pages
Bitmap
No ratings yet
Bitmap
11 pages
Training Assignments: SQL Basics
No ratings yet
Training Assignments: SQL Basics
5 pages
Indexes in Oracle
No ratings yet
Indexes in Oracle
6 pages
SQL Bootcamp Intro
No ratings yet
SQL Bootcamp Intro
26 pages
PLSQL Interview Questions 1
No ratings yet
PLSQL Interview Questions 1
6 pages
Oracle Interview Preparation
No ratings yet
Oracle Interview Preparation
2 pages
Components of DBMS1
No ratings yet
Components of DBMS1
7 pages
List of SQL Commands: Background
No ratings yet
List of SQL Commands: Background
6 pages
Troubleshooting 'Latch Cache Buffers Chains' Wait Contention
No ratings yet
Troubleshooting 'Latch Cache Buffers Chains' Wait Contention
4 pages
Bitmap Index Internals
No ratings yet
Bitmap Index Internals
54 pages
SQL Interview Questions
100% (1)
SQL Interview Questions
10 pages
Materialized Views: Snapshots
No ratings yet
Materialized Views: Snapshots
3 pages
Database Management
No ratings yet
Database Management
5 pages
DBMS Concepts and SQL - Students Notes
No ratings yet
DBMS Concepts and SQL - Students Notes
28 pages
Office Automation Unit 5
No ratings yet
Office Automation Unit 5
17 pages
DBA Duites
No ratings yet
DBA Duites
3 pages
Dbms Lab Record 2 Sem All Solved Full
No ratings yet
Dbms Lab Record 2 Sem All Solved Full
9 pages
ORACLE 12C Complete Self-Assessment Guide
From Everand
ORACLE 12C Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
DBMS - Indexing: Dense Index
No ratings yet
DBMS - Indexing: Dense Index
5 pages
Unit - 1
No ratings yet
Unit - 1
34 pages
UNIT-3 Part 1 Requirement Analyis
No ratings yet
UNIT-3 Part 1 Requirement Analyis
16 pages
Unit - 2 Part 2 Principles
No ratings yet
Unit - 2 Part 2 Principles
7 pages
UNIT 5 File Organization in DBMS
No ratings yet
UNIT 5 File Organization in DBMS
22 pages
Unit - 1 Part 1 Introduction
No ratings yet
Unit - 1 Part 1 Introduction
14 pages
File Organization in DBMS
No ratings yet
File Organization in DBMS
13 pages
Normalization Problems
No ratings yet
Normalization Problems
5 pages
Normal Forms 1 2 3 BCNF
No ratings yet
Normal Forms 1 2 3 BCNF
9 pages
Database Objects in DBMS
No ratings yet
Database Objects in DBMS
3 pages
Database Users
No ratings yet
Database Users
2 pages
AEDT Icepak Intro 2019R1 L3 Flow and Thermal Boundary Conditions
No ratings yet
AEDT Icepak Intro 2019R1 L3 Flow and Thermal Boundary Conditions
20 pages
Losing Track of Time
No ratings yet
Losing Track of Time
2 pages
Chapter 2 Architectural Models
No ratings yet
Chapter 2 Architectural Models
44 pages
Candidate Registration Report
No ratings yet
Candidate Registration Report
2 pages
Charles Oman
No ratings yet
Charles Oman
49 pages
MIP GET VIEW BOQDripSystem
No ratings yet
MIP GET VIEW BOQDripSystem
6 pages
Cho em gần anh thêm chút nữa Sheet music for Piano (Solo)
No ratings yet
Cho em gần anh thêm chút nữa Sheet music for Piano (Solo)
1 page
ReadISACA QAE Databases On ISACA PERFORMTica
No ratings yet
ReadISACA QAE Databases On ISACA PERFORMTica
9 pages
Substitute Leadership
100% (1)
Substitute Leadership
1 page
Natwar Lal Joshi - Resume 2023
No ratings yet
Natwar Lal Joshi - Resume 2023
1 page
University of The East Caloocan Campus
No ratings yet
University of The East Caloocan Campus
5 pages
Yashaswini (DBMS)
No ratings yet
Yashaswini (DBMS)
8 pages
Car Safety Comprehension
100% (1)
Car Safety Comprehension
9 pages
Assignment 1 - KTEE203.1
No ratings yet
Assignment 1 - KTEE203.1
11 pages
HW8-smoother Tuning DIAL
100% (1)
HW8-smoother Tuning DIAL
5 pages
Pseudo Holday - Handle COVID 19 - Facebook Prophet
No ratings yet
Pseudo Holday - Handle COVID 19 - Facebook Prophet
27 pages
Appraisal Form
No ratings yet
Appraisal Form
12 pages
Geo 111 Cartography and Map Analysis
No ratings yet
Geo 111 Cartography and Map Analysis
2 pages
Alcohol Detection and Monitoring
No ratings yet
Alcohol Detection and Monitoring
11 pages
Transfer and Bevel Gears
No ratings yet
Transfer and Bevel Gears
3 pages
Calculation of Electrical Induction Near Power Lines
No ratings yet
Calculation of Electrical Induction Near Power Lines
22 pages
Geographical Investigations
No ratings yet
Geographical Investigations
10 pages
Vero, Krishia Ann G. (DRRR Week #2)
No ratings yet
Vero, Krishia Ann G. (DRRR Week #2)
3 pages
Midwifery Society of Nepal (MIDSoN)
No ratings yet
Midwifery Society of Nepal (MIDSoN)
5 pages
Lab Assignment 2
No ratings yet
Lab Assignment 2
7 pages
Module 1.session 3.ISCM.2021
No ratings yet
Module 1.session 3.ISCM.2021
18 pages
Elgamatic 100
No ratings yet
Elgamatic 100
1 page
Role of Principal
No ratings yet
Role of Principal
3 pages

Indexing

Uploaded by

Indexing

Uploaded by

Indexing

 Takes a search key as input

The primary Indexing is also further divided into two types.

ii. Sparse Index

Example of secondary Indexing

What is Multilevel Index?

Important pros/ advantage of Indexing are:

Important drawbacks/cons of Indexing are:

 You are not allowed to partition an index-organized table.

 SQL Indexing Decrease performance in INSERT, DELETE, and UPDATE query.

Searching a record in B+ Tree

B Tree Index Files B+ Tree Index Files

You might also like