0% found this document useful (0 votes)

239 views6 pages

Database Index PDF

Uploaded by

Ananya Getahun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

239 views6 pages

Database Index PDF

Uploaded by

Ananya Getahun

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Database index

A database index is a data structure that improves the speed of data retrieval operations on a database table at
the cost of additional writes and storage space to maintain the index data structure. Indexes are used to quickly
locate data without having to search every row in a database table every time a database table is accessed.
Indexes can be created using one or more columns of a database table, providing the basis for both rapid
random lookups and efficient access of ordered records.

An index is a copy of selected columns of data from a table, called a database key or simply key, that can be
searched very efficiently that also includes a low-level disk block address or direct link to the complete row of
data it was copied from. Some databases extend the power of indexing by letting developers create indexes on
functions or expressions. For example, an index could be created on upper(last_name), which would
only store the upper-case versions of the last_name field in the index. Another option sometimes supported
is the use of partial indices, where index entries are created only for those records that satisfy some conditional
expression. A further aspect of flexibility is to permit indexing on user-defined functions, as well as
expressions formed from an assortment of built-in functions.

Contents
Usage
Support for fast lookup
Policing the database constraints
Index architecture and indexing methods
Non-clustered
Clustered
Cluster
Column order
Applications and limitations
Types of indexes
Bitmap index
Dense index
Sparse index
Reverse index
Primary index
Secondary index
Index implementations
Index concurrency control
Covering index
Standardization
See also
References
Usage

Support for fast lookup

Most database software includes indexing technology that enables sub-linear time lookup to improve
performance, as linear search is inefficient for large databases.

Suppose a database contains N data items and one must be retrieved based on the value of one of the fields. A
simple implementation retrieves and examines each item according to the test. If there is only one matching
item, this can stop when it finds that single item, but if there are multiple matches, it must test everything. This
means that the number of operations in the average case is O(N) or linear time. Since databases may contain
many objects, and since lookup is a common operation, it is often desirable to improve performance.

An index is any data structure that improves the performance of lookup. There are many different data
structures used for this purpose. There are complex design trade-offs involving lookup performance, index
size, and index-update performance. Many index designs exhibit logarithmic (O(log(N))) lookup performance
and in some applications it is possible to achieve flat (O(1)) performance.

Policing the database constraints

Indexes are used to police database constraints, such as UNIQUE, EXCLUSION, PRIMARY KEY and
FOREIGN KEY. An index may be declared as UNIQUE, which creates an implicit constraint on the
underlying table. Database systems usually implicitly create an index on a set of columns declared PRIMARY
KEY, and some are capable of using an already-existing index to police this constraint. Many database systems
require that both referencing and referenced sets of columns in a FOREIGN KEY constraint are indexed, thus
improving performance of inserts, updates and deletes to the tables participating in the constraint.

Some database systems support an EXCLUSION constraint that ensures that, for a newly inserted or updated
record, a certain predicate holds for no other record. This can be used to implement a UNIQUE constraint
(with equality predicate) or more complex constraints, like ensuring that no overlapping time ranges or no
intersecting geometry objects would be stored in the table. An index supporting fast searching for records
satisfying the predicate is required to police such a constraint.[1]

Index architecture and indexing methods

Non-clustered

The data is present in arbitrary order, but the logical ordering is specified by the index. The data rows may be
spread throughout the table regardless of the value of the indexed column or expression. The non-clustered
index tree contains the index keys in sorted order, with the leaf level of the index containing the pointer to the
record (page and the row number in the data page in page-organized engines; row offset in file-organized
engines).

In a non-clustered index,

The physical order of the rows is not the same as the index order.
The indexed columns are typically non-primary key columns used in JOIN, WHERE, and
ORDER BY clauses.
There can be more than one non-clustered index on a database table.

Clustered

Clustering alters the data block into a certain distinct order to match the index, resulting in the row data being
stored in order. Therefore, only one clustered index can be created on a given database table. Clustered indices
can greatly increase overall speed of retrieval, but usually only where the data is accessed sequentially in the
same or reverse order of the clustered index, or when a range of items is selected.

Since the physical records are in this sort order on disk, the next row item in the sequence is immediately
before or after the last one, and so fewer data block reads are required. The primary feature of a clustered
index is therefore the ordering of the physical data rows in accordance with the index blocks that point to
them. Some databases separate the data and index blocks into separate files, others put two completely
different data blocks within the same physical file(s).

Cluster

When multiple databases and multiple tables are joined, it is called a cluster (not to be confused with clustered
index described previously). The records for the tables sharing the value of a cluster key shall be stored
together in the same or nearby data blocks. This may improve the joins of these tables on the cluster key, since
the matching records are stored together and less I/O is required to locate them.[2] The cluster configuration
defines the data layout in the tables that are parts of the cluster. A cluster can be keyed with a B-Tree index or
a hash table. The data block where the table record is stored is defined by the value of the cluster key.

Column order
The order that the index definition defines the columns in is important. It is possible to retrieve a set of row
identifiers using only the first indexed column. However, it is not possible or efficient (on most databases) to
retrieve the set of row identifiers using only the second or greater indexed column.

For example, in a phone book organized by city first, then by last name, and then by first name, in a particular
city, one can easily extract the list of all phone numbers. However, it would be very tedious to find all the
phone numbers for a particular last name. One would have to look within each city's section for the entries
with that last name. Some databases can do this, others just won't use the index.

In the phone book example with a composite index created on the columns (city, last_name,
first_name), if we search by giving exact values for all the three fields, search time is minimal—but if we
provide the values for city and first_name only, the search uses only the city field to retrieve all
matched records. Then a sequential lookup checks the matching with first_name. So, to improve the
performance, one must ensure that the index is created on the order of search columns.

Applications and limitations

Indexes are useful for many applications but come with some limitations. Consider the following SQL
statement: SELECT first_name FROM people WHERE last_name = 'Smith';. To
process this statement without an index the database software must look at the last_name column on every row
in the table (this is known as a full table scan). With an index the database simply follows the index data
structure (typically a B-tree) until the Smith entry has been found; this is much less computationally expensive
than a full table scan.
Consider this SQL statement: SELECT email_address FROM customers WHERE
email_address LIKE '%@wikipedia.org';. This query would yield an email address for every
customer whose email address ends with "@wikipedia.org", but even if the email_address column has been
indexed the database must perform a full index scan. This is because the index is built with the assumption that
words go from left to right. With a wildcard at the beginning of the search-term, the database software is
unable to use the underlying index data structure (in other words, the WHERE-clause is not sargable). This
problem can be solved through the addition of another index created on reverse(email_address)
and a SQL query like this: SELECT email_address FROM customers WHERE
reverse(email_address) LIKE reverse('%@wikipedia.org');. This puts the wild-
card at the right-most part of the query (now gro.aidepikiw@%), which the index on reverse(email_address)
can satisfy.

When the wildcard characters are used on both sides of the search word as %wikipedia.org%, the index
available on this field is not used. Rather only a sequential search is performed, which takes O(N) time.

Types of indexes

Bitmap index

A bitmap index is a special kind of indexing that stores the bulk of its data as bit arrays (bitmaps) and answers
most queries by performing bitwise logical operations on these bitmaps. The most commonly used indexes,
such as B+ trees, are most efficient if the values they index do not repeat or repeat a small number of times. In
contrast, the bitmap index is designed for cases where the values of a variable repeat very frequently. For
example, the sex field in a customer database usually contains at most three distinct values: male, female or
unknown (not recorded). For such variables, the bitmap index can have a significant performance advantage
over the commonly used trees.

Dense index

A dense index in databases is a file with pairs of keys and pointers for every record in the data file. Every key
in this file is associated with a particular pointer to a record in the sorted data file. In clustered indices with
duplicate keys, the dense index points to the first record with that key.[3]

Sparse index

A sparse index in databases is a file with pairs of keys and pointers for every block in the data file. Every key
in this file is associated with a particular pointer to the block in the sorted data file. In clustered indices with
duplicate keys, the sparse index points to the lowest search key in each block.

Reverse index

A reverse-key index reverses the key value before entering it in the index. E.g., the value 24538 becomes
83542 in the index. Reversing the key value is particularly useful for indexing data such as sequence numbers,
where new key values monotonically increase.

Primary index
The primary index contains the key fields of the table and a pointer to the non-key fields of the table. The
primary index is created automatically when the table is created in the database.

Secondary index

It is used to index fields that are neither ordering fields nor key fields (there is no assurance that the file is
organized on key field or primary key field). One index entry for every tuple in the data file (dense index)
contains the value of the indexed attribute and pointer to the block/record.

Index implementations
Indices can be implemented using a variety of data structures. Popular indices include balanced trees, B+ trees
and hashes.[4]

In Microsoft SQL Server, the leaf node of the clustered index corresponds to the actual data, not simply a
pointer to data that resides elsewhere, as is the case with a non-clustered index.[5] Each relation can have a
single clustered index and many unclustered indices.[6]

Index concurrency control

An index is typically being accessed concurrently by several transactions and processes, and thus needs
concurrency control. While in principle indexes can utilize the common database concurrency control
methods, specialized concurrency control methods for indexes exist, which are applied in conjunction with the
common methods for a substantial performance gain.

Covering index
In most cases, an index is used to quickly locate the data record(s) from which the required data is read. In
other words, the index is only used to locate data records in the table and not to return data.

A covering index is a special case where the index itself contains the required data field(s) and can answer the
required data.

Consider the following table (other fields omitted):

ID Name Other Fields

12 Plug ...
13 Lamp ...
14 Fuse ...

To find the Name for ID 13, an index on (ID) is useful, but the record must still be read to get the Name.
However, an index on (ID, Name) contains the required data field and eliminates the need to look up the
record.

Covering indexes are each for a specific table. Queries which JOIN/ access across multiple tables, may
potentially consider covering indexes on more than one of these tables.[7]
A covering index can dramatically speed up data retrieval but may itself be large due to the additional keys,
which slow down data insertion & update. To reduce such index size, some systems allow including non-key
fields in the index. Non-key fields are not themselves part of the index ordering but only included at the leaf
level, allowing for a covering index with less overall index size.

Standardization
No standard defines how to create indexes, because the ISO SQL Standard does not cover physical aspects.
Indexes are one of the physical parts of database conception among others like storage (tablespace or
filegroups). RDBMS vendors all give a CREATE INDEX syntax with some specific options that depend on
their software's capabilities.

See also
Index locking
Index (search engine)
Inverted index

References
1. PostgreSQL 9.1.2 Documentation: CREATE TABLE (http://www.postgresql.org/docs/9.1/static/s
ql-createtable.html)
2. Overview of Clusters (http://download.oracle.com/docs/cd/B12037_01/server.101/b10743/sche
ma.htm#sthref1069) Oracle® Database Concepts 10g Release 1 (10.1)
3. Database Systems: The Complete Book. Hector Garcia-Molina, Jeffrey D. Ullman, Jennifer D.
Widom
4. Gavin Powell (2006). Chapter 8: Building Fast-Performing Database Models (http://searchsecur
ity.techtarget.com/generic/0,295582,sid87_gci1184450,00.html). Beginning Database Design.
Wrox Publishing. ISBN 978-0-7645-7490-0.
5. "Clustered Index Structures" (http://msdn2.microsoft.com/en-us/library/ms177443.aspx). SQL
Server 2005 Books Online (September 2007).
6. Daren Bieniek; Randy Dess; Mike Hotek; Javier Loria; Adam Machanic; Antonio Soto; Adolfo
Wiernik (January 2006). "Chapter 4: Creating Indices" (http://www.microsoft.com/mspress/book
s/9364.aspx). SQL Server 2005 Implementation and Management. Microsoft Press.
7. Covering Indexes for Query Optimization (http://literatejava.com/sql/covering-indexes-query-opt
imization/)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Database_index&oldid=984197027"

This page was last edited on 18 October 2020, at 19:29 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this
site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia
Foundation, Inc., a non-profit organization.

Hostel Management System Project Report
20% (5)
Hostel Management System Project Report
98 pages
Be 03000091
No ratings yet
Be 03000091
4 pages
Heap File Vs Sorted Files
No ratings yet
Heap File Vs Sorted Files
35 pages
RGO-MD070 - Item Codification Utility
No ratings yet
RGO-MD070 - Item Codification Utility
16 pages
Cit208 Calculus Educational Consult Eze-Ego QQQZZZW Updated
No ratings yet
Cit208 Calculus Educational Consult Eze-Ego QQQZZZW Updated
31 pages
Indexing Hashing Files
No ratings yet
Indexing Hashing Files
68 pages
CS 6312 DBMS Lab Manual Final
100% (1)
CS 6312 DBMS Lab Manual Final
74 pages
First Term SS3 Data Processing
No ratings yet
First Term SS3 Data Processing
22 pages
Oracle Advanced Techniques PL SQL
No ratings yet
Oracle Advanced Techniques PL SQL
269 pages
CS614 Finalterm Subjective Referencefile
No ratings yet
CS614 Finalterm Subjective Referencefile
27 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
100% (1)
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
7 pages
DP Ss3 Note First Term
100% (2)
DP Ss3 Note First Term
43 pages
DBMS External Internal Question Bank
No ratings yet
DBMS External Internal Question Bank
10 pages
Table Control Using Wizard in Module Pool Programming
83% (6)
Table Control Using Wizard in Module Pool Programming
90 pages
OrientDB Manual
No ratings yet
OrientDB Manual
1,490 pages
Database Indexing
No ratings yet
Database Indexing
4 pages
Document 4
No ratings yet
Document 4
20 pages
Advantage of Selenium Over QTP: 1 Free (No License Fees Is Required) 2 Support Large Number of Browser (QTP
No ratings yet
Advantage of Selenium Over QTP: 1 Free (No License Fees Is Required) 2 Support Large Number of Browser (QTP
66 pages
7-MongoDB Storage Engine
No ratings yet
7-MongoDB Storage Engine
32 pages
3.unit 3
No ratings yet
3.unit 3
19 pages
Lec 8 Indexing & Data Structures For Query Processing
No ratings yet
Lec 8 Indexing & Data Structures For Query Processing
51 pages
File Organization
No ratings yet
File Organization
41 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
DBMS Unit 5
No ratings yet
DBMS Unit 5
24 pages
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
No ratings yet
Database Management Systems, R. Ramakrishnan and J. Gehrke 1
32 pages
Dbms Unit III Notes 2022-23
No ratings yet
Dbms Unit III Notes 2022-23
18 pages
MSSQL - InDEX Creation and Usage
No ratings yet
MSSQL - InDEX Creation and Usage
8 pages
MIS Chapter 1
No ratings yet
MIS Chapter 1
42 pages
DeadLock On Adding Foreign Key Constraint With DDL - LOCK - TIMEOUT
No ratings yet
DeadLock On Adding Foreign Key Constraint With DDL - LOCK - TIMEOUT
6 pages
Unit - 4
No ratings yet
Unit - 4
42 pages
Lecture 14. Indexes and Constraints - MSTeams
No ratings yet
Lecture 14. Indexes and Constraints - MSTeams
5 pages
Screenshot 2025-03-12 at 9.41.04 AM
No ratings yet
Screenshot 2025-03-12 at 9.41.04 AM
41 pages
Dbms r18 Unit 5 Notes
No ratings yet
Dbms r18 Unit 5 Notes
24 pages
Hashing & Indexing Structures - Single Level & Multi Level Indices
No ratings yet
Hashing & Indexing Structures - Single Level & Multi Level Indices
1 page
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
SQL Server Index Basics
No ratings yet
SQL Server Index Basics
5 pages
Increasing Database Performance Using Indexes
No ratings yet
Increasing Database Performance Using Indexes
10 pages
Co2 - Index in DBMS 1
No ratings yet
Co2 - Index in DBMS 1
29 pages
Unit5 File Organization
No ratings yet
Unit5 File Organization
112 pages
DBMS Experiment - Lab 7
No ratings yet
DBMS Experiment - Lab 7
24 pages
Report Merged
No ratings yet
Report Merged
62 pages
7-Indexing and Block
No ratings yet
7-Indexing and Block
20 pages
Indexing Lecture Nov 2023 Detailed
No ratings yet
Indexing Lecture Nov 2023 Detailed
37 pages
Lec20Indexing v1
No ratings yet
Lec20Indexing v1
57 pages
CLL F399 Technical Reference Manual
No ratings yet
CLL F399 Technical Reference Manual
33 pages
UNIT1 Notes ABDA
No ratings yet
UNIT1 Notes ABDA
7 pages
Top 50 Mainframe Interview Questions
No ratings yet
Top 50 Mainframe Interview Questions
7 pages
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
No ratings yet
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
29 pages
Object-Oriented Programming: This Self
No ratings yet
Object-Oriented Programming: This Self
18 pages
DBMS Seminar
No ratings yet
DBMS Seminar
12 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
Inheritance (Object-Oriented Programming)
No ratings yet
Inheritance (Object-Oriented Programming)
9 pages
Dbms Mod3
No ratings yet
Dbms Mod3
54 pages
SQL Indexes 2
No ratings yet
SQL Indexes 2
10 pages
Introduction To Indexing in Database Management Systems Print
No ratings yet
Introduction To Indexing in Database Management Systems Print
12 pages
Indexes
No ratings yet
Indexes
70 pages
Dbms Suggestion For Semester 5
No ratings yet
Dbms Suggestion For Semester 5
5 pages
11.2 Indexing
No ratings yet
11.2 Indexing
26 pages
DBMS Unit9
No ratings yet
DBMS Unit9
44 pages
DBMS
No ratings yet
DBMS
3 pages
DBMS A1
No ratings yet
DBMS A1
10 pages
Index Architecture: Febriliyan Samopa
No ratings yet
Index Architecture: Febriliyan Samopa
110 pages
Introduction To Indexes
No ratings yet
Introduction To Indexes
35 pages
Indexes
No ratings yet
Indexes
4 pages
Primary Indexing
No ratings yet
Primary Indexing
7 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
7 pages
DBMS Unit-5
No ratings yet
DBMS Unit-5
5 pages
DBMS-U5 Notes
No ratings yet
DBMS-U5 Notes
16 pages
Indexing
No ratings yet
Indexing
62 pages
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
No ratings yet
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
22 pages
C# Overloaded Operators: by Glen Mccluskey
No ratings yet
C# Overloaded Operators: by Glen Mccluskey
3 pages
Indexing - DBMS
No ratings yet
Indexing - DBMS
20 pages
Lesson 4 - Indexing
No ratings yet
Lesson 4 - Indexing
6 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
DBMS - R2017 - Anna University
No ratings yet
DBMS - R2017 - Anna University
20 pages
Indexing
No ratings yet
Indexing
6 pages
S - UNIT VII Indexing in Database
No ratings yet
S - UNIT VII Indexing in Database
9 pages
SAP HANA Course Content PDF
0% (1)
SAP HANA Course Content PDF
4 pages
Bda Unit Iv
No ratings yet
Bda Unit Iv
4 pages
Final Petrol PPM - Merged
No ratings yet
Final Petrol PPM - Merged
36 pages
Sample Questions - DBMS
No ratings yet
Sample Questions - DBMS
5 pages
Indexing
No ratings yet
Indexing
6 pages
How Does Database Indexing Work
No ratings yet
How Does Database Indexing Work
4 pages
An in Depth Look at Database Indexing
No ratings yet
An in Depth Look at Database Indexing
3 pages
Zabbix1 8manual
No ratings yet
Zabbix1 8manual
86 pages
Best - Practices - For - Minimizing - Oracle - EBS - R12.1.3 - Upgrade - Downtime - Oct - 14 (1) F PDF
No ratings yet
Best - Practices - For - Minimizing - Oracle - EBS - R12.1.3 - Upgrade - Downtime - Oct - 14 (1) F PDF
82 pages
PostgreSQL IQ
No ratings yet
PostgreSQL IQ
27 pages
R22 Unit 5
No ratings yet
R22 Unit 5
23 pages
Database PDF
No ratings yet
Database PDF
22 pages
22040007-3w 2 PDF
No ratings yet
22040007-3w 2 PDF
1 page
IBM Exam 000-612 Questions
No ratings yet
IBM Exam 000-612 Questions
4 pages
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet

Database Index PDF

Uploaded by

Database Index PDF

Uploaded by

Database index

Support for fast lookup

Policing the database constraints

Index architecture and indexing methods

Applications and limitations

Index concurrency control

Consider the following table (other fields omitted):

ID Name Other Fields

Retrieved from "https://en.wikipedia.org/w/index.php?title=Database_index&oldid=984197027"

This page was last edited on 18 October 2020, at 19:29 (UTC).

You might also like