0% found this document useful (0 votes)

59 views7 pages

An Efficient Approach For Data Indexing in Datawarehousing and Datamining

This document summarizes an approach for efficient data indexing in data warehousing and data mining. It discusses different indexing techniques like sequential indexing, B-tree indexing, and advanced techniques like inverted list indexing and bitmap indexing. Inverted list indexing improves over B-tree indexing by allowing keyword lookups and supporting multi-dimensional queries. The paper proposes a bitmap indexing technique that further increases efficiency and reduces data retrieval time compared to other approaches.

Uploaded by

Luna Kim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views7 pages

An Efficient Approach For Data Indexing in Datawarehousing and Datamining

Uploaded by

Luna Kim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

International Journal of Innovations in Engineering and Technology (IJIET)

An Efficient Approach for Data Indexing in

Datawarehousing and Datamining
Naveen Garg
Ph D Scholar (Computer Science)
Sai Nath University, Jharkhand, India

Dr. H.M. Rai

Professor
N.C. College of Engineering, Panipat, India

Abstract - Today, many tools and techniques have increased in the performance of database management as data
warehousing and data mining. These warehouses provide storage functionality and responsiveness to queries beyond
capacity of transaction oriented database. This paper focuses on Indexing of the data warehouse and omits the
requirement of significant manual intervention such as the data acquisition, data quality management, and functionality
and performance optimization. Many approaches are used for data indexing such as sequential, B-tree key and some
other advanced techniques. A comparison is done using some characteristics between theses indexing techniques. Based
on the results of comparison, a Bitmapped indexing technique is created which increases the efficiency and reduces the
time for retrieval of data.

Keywords: OLTP, OLAP, DSS

1. INTRODUCTION
Growth of humanity is based on the growth of knowledge which itself is based on growth of knowledge bases.
Knowledge bases are derived from databases using knowledge of its experts. Knowledge, such extracted helps in the
development of experts through proper decisions and in turn results in humanity. Hence, for development of
knowledge bases it is essential to identify, analyze and stipulate data elements and simulate process using them.
Actual data/information available in real world are non linear in nature and analog in characteristics. Depending on
their analogous nature and types of their characteristics they are further classified as visual, audio or text data etc.
When they are kept in the actual form in storage systems they are called analog data bases. For quick retrieval and
processing they must be converted into digital equivalents.

A study of actual forms of data has revealed that the data processing of actual or analog data is very difficult and
complicated. This is a proven fact that the digital data processing is the only easiest, simplest, fastest, accurate most
efficient type of processing. Fortunately, it is possible to convert all analog form of data into equivalent digital form,
which are suitable to great extent for their development as knowledge bases.
The analog data when converted to digital equivalent possess only the approximate replica as the conversion itself is
based on the successive approximation methodology and the Niquist sampling rate [1].

Processing of digital data becomes faster, more accurate and gives better efficiency and that too at the minimum cost
resulting in most effective methodology. Thus, it becomes essential to convert acquired actual data in analog form to
digital equivalents which in turn requires large storage space. So, the structured storage methodology involves
handling of large as well as very large amount of data. The storage space is almost directly proportional to the
volume of data.

The proper methodology must be evolved to store and extract data effectively [4]. This has resulted in the concept of
data bases and data storage systems [2].

1.1 Data Indexing

Vol. 1 Issue 4 December 2012 108 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

Once data has been cleaned properly they are to be stored in large storages such as functional data bases i.e. Data
ware houses. Only to dump data in storage will again create jumbled type of data. In both cases faster search
techniques need to be evolved for better query processing. To arrange data in a database in such a way that
retrieval/accessing and updating becomes easier and faster, a process known as indexing comes in. An index access
structure is similar to the index used in a text book which lists important terms at the end of the book in alphabetical
order along with a list of address page numbers. In this case we use these addresses to locate a term in the text book
by searching the specified pages. Alternatively, if no other guidance is given, the whole text book has to be searched
word by word to find the term the user is interested in. This corresponds to linear search on a file. Of course, most
books do have additional informatics such as chapter and section-titles that can help users to find a term without
having to search through the whole book. However, the index is the only exact indication of where each term occurs
in the book.
With the emergence of powerful new indexing technologies, instant and ad-hoc queries and fast data analysis are
possible using existing databases. Despite the good customer service and data analysis capabilities, many customer
services and data warehousing applications lack good performance. To meet this challenge in business applications
such as customer services, e-commerce etc., data warehouses must deliver data quickly through user friendly
methods [5].
The main purpose of data is to access and use it. Quick access of information requires storage of data in structured
form. The storage of data in structured form helps develop efficient and faster search technique to handle more
complex queries and retrieve data with maximum precision.
The evolution of storage and access technique starts with the evolution of flat files. This was suitable, when files are
small. As flat files require sequential scan of all the records in the file, the data access/retrieval time increases with
the increase in the volume of data and thus results in more costly processing.
II. B-TREE INDEXES
Like the white pages of a telephone book, which lists people names in alphabetical order, B-tree allows users to
perform partial key lookups and view the data in sorted order.
B-trees offered a vast improvement over hashed keys in terms of flexibilty and were a great boon to OLTP systems
because they do not require unique and arbitrary key values.
However, B-tree indexes are limited in that they are case sensitive and require a left to right match between the
criteria entered and the values in the index. For example to find a record of ‘Chandragupta Maurya’, it is to be
noticed that his name was entered as ‘Maurya Chandragupta’ including exact capitalization, punctuation and
spacing. Most databases still utilize B-tree indexes, even in some RDBMs as their primary indexes.
The first revolution in end user data access came in the 1980s with the development of 4GLs tools. Fourth
Generation Languages have made it possible to develop new applications in a fraction of time required by
conventional programming techniques, enabling milions of users to be brought online.
But it was still not enough. Organizations could not catch hold of the return on investment they had expected from
their investments, in 4GL technology due to following reasons [6].
• While the number of users and frequency of data access continued to expand, the database kept growing
larger as well.
• Although new applications allowed users access to corporate data, it was in a rigid, predefined manner that
protected system resources.
• Users who were not comfortable with a character based application environment or who dit not take time to
learn the cryptic commands and data layouts were still dependent on the I.T. department for their data
access needs.
• To get information that went beyond the pre-established reports could take weeks by the time I.T.
scheduled time to write a new report.
• Individual access was limited to one system at a time and access to multiple data sources on various hosts
from the same terminal was virtually impossible.
III. ADVANCE INDEXING TECHNIQUES
Although the indexing has been around since the early days of computers, there have been great advances in
indexing technology over the years. Advanced indexing technology is the most effective way to reduce the disk I/O
required to query, analyze, summarize and retrieve data. Following advanced indexes deliver dramatic performance
improvements without major investments in hardware.
• Inverted List Indexes

Vol. 1 Issue 4 December 2012 109 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

• Bitmapped Indexes
• Aggregation Indexes

2.1. Inverted List Indexes

Inverted list indexes [3] provide much greater functionality and flexibility than B-tree indexes. Inverted list indexes
reverse the structure with its pointers. They store the data from the database as keys, so the data content can be
quickly searched on with pointers back to the database as data in the index, resulting in quick retrieval of data
records.
Inverted list indexes are far superior to B-tree indexes. It can perform keyword lookups, provide an instants up-front
qualifying count and support unlimited multicolumn and multidimensional queries. They enhance data access in
both online (OLTP) and decision support (OLAP) environments.
Added advantage in this system is that both users and I.T. professionals benefit from the added functionality and
enhanced performance gained users can intuitively search through data, finding records in a way that is obvious and
logical.

2.2. Bit Mapped Indexing

Another type of advanced indexing technique is a bitmap or bitmapped indexing.

Table 1. Bit-Map Table

Bitmap indexes provide high speed index-only query processing for instant counts, keyword searches and
multicolumn combinations using multiple criteria without concatenating the columns into multipart keys.The
structure of a bitmapped index resembles a spreadsheet. The possible values go across the top, the record numbers
down one side, and a flag or a ‘bit’ is set to ON or OFF in each cell, depending for a Y/N flag might resemble the
table 1.
Initially, bitmaps were limited to low cardinality columns or coded data with few values such as ‘Y/N’ or 0 to 1
because they grew unmanageably large for high cardinality columns with many possible values especially with large
amount of data. The early bitmapped indexes could not efficiently handle high cardinality data such as textual name
and descriptive fields or numeric data with many values because the bit map that must be created and maintained
becomes enormous.
But, the present data warehousing solutions rely solely on bitmapped indexes as their indexing methodology due to
its faster indexing rate. The performance impact of high cardinality data is achieved. Early concept of data being
fairly static in nature and low maintainability has changed considerably.

Vol. 1 Issue 4 December 2012 110 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

2.3. Aggregation Indexes

Data warehousing or Decision support applications contain millions of rows of data that users want to summarize for
business intelligence. One method of summarizing information for data analysis is to perform a table scan of the
famous personalities of Haryana in Sports and sort of data, but that can take enormous amount of time of CPU for
each query even with parallel processing. For example some sorted source data for ‘famous personalities of
Haryanain Sports’ for Sex, Category, Place, and year are shown in table 2.
Table 2: Aggregation Index
Sex Category Place Year
F Hockey Rohtak 2007
F Gymnastics Sonipat 2004
M Golf Gurgaon 2010
M Wrestling Rohtak 2006
M Vollyball Karnal 2007

Based on this data, the numbers of possible lines in a report or bar chart is shown in table 3.

Table 3: Summary Indexing

Aggregate By Number of lines in report
Sex 2
Category 5
Place 4
Year 5

Instead of sequentially reading the raw data the more common method of aggregating data is for the I.T. staff to pre
build summary tables that contain the rolled up data aggregations that users want. Summary tables for predictable
queries are fast at query time, but it still makes a full table scan of the large fact table to build one.
What makes it more complex is no matter how many summary tables IT builds, users always need to query in a new
way given the inherent nature of data warehouse, which is to look for new information and patterns. Also there is no
drill down to view the raw detailed data that makes up the summary table which is a table scan of the large related
table. As an alternative, aggregation indexes can quickly summarize categories on the fly. They can dynamically
calculate the number of wrestlers, in a particular period and a specific category instantly.
Aggregation indexes eliminate the needs for a summary table to match each possible user aggregation. Aggregation
indexes allow the user complete flexibility in the selection criteria based on inverted list indexing then dynamically
summarize the metric data at high rates of speeds.
More significant is the fact that, aggregation table gives instant access back to the detail data because they contain
pointers (row ids) to the raw details data. After viewing summary one can instantly drill down and view the detail
data.
IV. ACCESS METHOD COMPARISON
Advanced search techniques have lot of characteristics over the sequential scan and Relational (B-tree) key. A
comparison is shown using some special characteristics of indexing methodology in table 4.
Table 4: Comparison of Accessing Methods
Characteristics Sequential Scan Relational (B-tree) Key Inverted list Index

Key Word search Yes - Yes

Partial Key Searches Yes - Yes

Progressive searches (drill - - Yes

through)

Vol. 1 Issue 4 December 2012 111 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

Multiple key combination - Yes Yes

Automated quantifying - - Yes

Count

Case Insensitive - - Yes

Position insensitive - - Yes

Pre-join indexes - - Yes

Relational Logic Yes Yes Yes

Boolean Logic Yes Yes Yes

Soundex - - Yes

Excluded word - - Yes

Concatenated key - - Yes

Composition Keys - - Yes

Grouping Constants - - Yes

Batch Indexing - - Yes

V. CREATION OF BIT MAP INDEXING

Bit mapped indexes are useful in processing complex queries in decision support systems (DSS) and they have been
implemented successfully in several commercial database systems. Major data structures used in this type of
indexing is B-tree and B-tree string extension.
The indicated column of the data file is scanned to identify the unique value. These unique values are stored in the
code array is used while populating the bitmaps. The index returned by the code array for a key is used to index into
the bit table to locate the bitmap for the key.
A bit table is constructed to hold the bitmaps for each of the unique keys in the data file. The bitmap is a character
array. This bit table is used to create the b-tree index. Each of the unique key values is picked up from the code array
and the bitmap is picked from the bit table value encoding.
The simple algorithm used for creation of bitmap indexes and retrieval of data is shown in figure 1 and figure 2 in
the form of flow charts.

Vol. 1 Issue 4 December 2012 112 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

Fig 1: Flow Chart for Bitmap Index Creation

Fig. 2: Flow Chart for Retrieving Data

VI. CONCLUSION
The bit map index algorithm developed is used and the time consumed in indexing at different cardinality is shown
in the table 5.

Vol. 1 Issue 4 December 2012 113 ISSN: 2319-1058

International Journal of Innovations in Engineering and Technology (IJIET)

Table 5: Observations

REFERENCES
[1] K. Doris, E. Janssen, C. Nani and Athon Z., “A 480 mW 2.6 GS/s 10b Time-Interleaved ADC With 48.5 dB SNDR up to Nyquist in 65 nm
CMOS” in IEEE Journal of Solid-state Circuits, vol. 46 No. 12, December 2011, pp. 2821-2833.
[2] You, J. Dillon, T. Liu, J., “An integration of data mining and data warehousing for hierarchical multimedia information retrieval”, in
International Symposium on Intelligent Multimedia, Video and Speech Processing, August 2002, pp. 373-376.
[3] Tomasic A., Garcia-Nolina, H., and Soen K., “Incremental Updates of Inverted List for Text Retrieval”, proc. ACM SIGMOD cont on
management of data, Minnapolis, pp 289-300.
[4] Lawyer, J.; Chowdhury, S.; Walter E., “Best practices in data warehousing to support business initiatives and needs ”, 37th Annual
Hawaii IEEE International Conference on System Sciences, Jan 2004.
[5] Jamil, S.; Ibrahim, R, “Performance analysis of indexing techniques in Data warehousing ”, in IEEE International Conference on
Emerging Technologies, Islamabad on Dec 2009.
[6] Graefe, G.; Kuno, H, “Modern B-tree techniques”, in 27th IEEE International Conference on Data Engineering, Hannover, on May 2011.

Vol. 1 Issue 4 December 2012 114 ISSN: 2319-1058

Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The
No ratings yet
Index On The Search Key, and Heap Files With An Unclusted Hash Index. Briefly Discuss The
5 pages
An Elasticsearch Crash Course Presentation PDF
No ratings yet
An Elasticsearch Crash Course Presentation PDF
81 pages
Introduction To Database Management System Second Edition PDF
100% (2)
Introduction To Database Management System Second Edition PDF
553 pages
Hotel Billing System
100% (3)
Hotel Billing System
65 pages
Advanced Database Indexing
No ratings yet
Advanced Database Indexing
17 pages
Fs Mini Project Report
No ratings yet
Fs Mini Project Report
25 pages
An Introduction To Database Systems Bipin C.desaI
No ratings yet
An Introduction To Database Systems Bipin C.desaI
849 pages
Chapter 11. File Organisation and Indexes
No ratings yet
Chapter 11. File Organisation and Indexes
56 pages
0801 2378 PDF
No ratings yet
0801 2378 PDF
63 pages
Recent Progress On Selected Topics in Database Research
No ratings yet
Recent Progress On Selected Topics in Database Research
15 pages
Kuvempu University Data Warehousing
No ratings yet
Kuvempu University Data Warehousing
6 pages
Engr Ass! One
No ratings yet
Engr Ass! One
14 pages
Data Mining MCA 3 Sem
No ratings yet
Data Mining MCA 3 Sem
51 pages
1 Indexing Techniques
No ratings yet
1 Indexing Techniques
30 pages
Chapter 3
No ratings yet
Chapter 3
40 pages
Infosys Certified Software Programmer-Python
No ratings yet
Infosys Certified Software Programmer-Python
5 pages
UNIT 1 On Databases
No ratings yet
UNIT 1 On Databases
66 pages
Study of Knowledge Discovery On The Web Using Fuzzy Approach
No ratings yet
Study of Knowledge Discovery On The Web Using Fuzzy Approach
7 pages
Compusoft, 3 (10), 1108-115 PDF
No ratings yet
Compusoft, 3 (10), 1108-115 PDF
8 pages
Data Warehouse - Bitmap Indexing
No ratings yet
Data Warehouse - Bitmap Indexing
24 pages
TERM PAPER - DBMS N
No ratings yet
TERM PAPER - DBMS N
5 pages
Readings in Database Systems: Fifth Edition
No ratings yet
Readings in Database Systems: Fifth Edition
54 pages
Query Recommender System Using Hierarchical Classification
No ratings yet
Query Recommender System Using Hierarchical Classification
4 pages
Sap Hana and Its Performance Benefits
No ratings yet
Sap Hana and Its Performance Benefits
9 pages
Database Guides
No ratings yet
Database Guides
4 pages
File Organization
No ratings yet
File Organization
41 pages
Lt20 21 Index
No ratings yet
Lt20 21 Index
28 pages
Computer Applications Topic
No ratings yet
Computer Applications Topic
3 pages
Associate Cloud Engineer (How To Prepare For Exams)
No ratings yet
Associate Cloud Engineer (How To Prepare For Exams)
7 pages
A Brief History of Database Systems
100% (1)
A Brief History of Database Systems
4 pages
This Tutorial Teaches ASP PDF
No ratings yet
This Tutorial Teaches ASP PDF
223 pages
Indexing in Relational Databases
No ratings yet
Indexing in Relational Databases
2 pages
DWH Indexes
No ratings yet
DWH Indexes
11 pages
Overview of File Systems
No ratings yet
Overview of File Systems
13 pages
Indexing Techniquesto Enhancethe Performance
No ratings yet
Indexing Techniquesto Enhancethe Performance
10 pages
Database Management Systems: (Revised by Jiin-Feng Chen, National Chengchi University For Classroom Use)
No ratings yet
Database Management Systems: (Revised by Jiin-Feng Chen, National Chengchi University For Classroom Use)
40 pages
G.C.E. (Advanced) Level Information & Communication Technology
No ratings yet
G.C.E. (Advanced) Level Information & Communication Technology
12 pages
Range of Quiz 2: Database System Implementation
No ratings yet
Range of Quiz 2: Database System Implementation
3 pages
01 First
No ratings yet
01 First
11 pages
Basis Midterm Database
No ratings yet
Basis Midterm Database
19 pages
Student Attendance Management System
No ratings yet
Student Attendance Management System
9 pages
Building Data Pipeline With Pentaho Lab Guide
No ratings yet
Building Data Pipeline With Pentaho Lab Guide
18 pages
Data Mining and Data Warehouse: Qis College of Engineering & Technology Ongole
No ratings yet
Data Mining and Data Warehouse: Qis College of Engineering & Technology Ongole
10 pages
Managing Database Systems
No ratings yet
Managing Database Systems
14 pages
Dbms Ishan
No ratings yet
Dbms Ishan
2 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
35 pages
Database
No ratings yet
Database
15 pages
Cosmos DB 4-12
No ratings yet
Cosmos DB 4-12
9 pages
Types of Indexes
No ratings yet
Types of Indexes
9 pages
DBMS Unit1 Notes
No ratings yet
DBMS Unit1 Notes
40 pages
Data Mining and Data Warehouse
No ratings yet
Data Mining and Data Warehouse
11 pages
DBMS A1
No ratings yet
DBMS A1
10 pages
DWDM Material
No ratings yet
DWDM Material
175 pages
Unit 3 DBMS
No ratings yet
Unit 3 DBMS
114 pages
CS614 Short Notes Midterm
No ratings yet
CS614 Short Notes Midterm
18 pages
Implementing A Generic Data Access Layer Using Entity Framework - Magnus Montin
100% (2)
Implementing A Generic Data Access Layer Using Entity Framework - Magnus Montin
23 pages
Azure DE Interview Que
100% (1)
Azure DE Interview Que
25 pages
Chapter 7 8
No ratings yet
Chapter 7 8
50 pages
A Survey On Techniques For Indexing and Hashing in Big Data
No ratings yet
A Survey On Techniques For Indexing and Hashing in Big Data
6 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
SQL DBA Mod 1 Intro
No ratings yet
SQL DBA Mod 1 Intro
27 pages
SAP Business Data Cloud
100% (2)
SAP Business Data Cloud
30 pages
Columstore Index
No ratings yet
Columstore Index
6 pages
MCQ - Hadoop - Javaguides
No ratings yet
MCQ - Hadoop - Javaguides
3 pages
SS 2 Data Processing 1ST Term 20172018 Exam
No ratings yet
SS 2 Data Processing 1ST Term 20172018 Exam
9 pages
Hash Table
No ratings yet
Hash Table
36 pages
DINLect 1
No ratings yet
DINLect 1
69 pages
Book Management User Management Issue/Return Tracking Reports
No ratings yet
Book Management User Management Issue/Return Tracking Reports
51 pages
Internship: Java Full Stack
No ratings yet
Internship: Java Full Stack
32 pages
PostgreSQL Operator For Kubernetes
No ratings yet
PostgreSQL Operator For Kubernetes
7 pages
Microsoft Access Help Desk (Edited)
No ratings yet
Microsoft Access Help Desk (Edited)
9 pages
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
No ratings yet
Indexing and Hashing: Basic Concept, Ordered Indices: Adbms
22 pages
UNIT 4 Updated - 121124
No ratings yet
UNIT 4 Updated - 121124
52 pages
Data Modeler Release Notes
No ratings yet
Data Modeler Release Notes
92 pages
C Dbadm 2404
No ratings yet
C Dbadm 2404
2 pages
2223L - BZBD-NS-W04 - SANS-NoSQL Injection
No ratings yet
2223L - BZBD-NS-W04 - SANS-NoSQL Injection
25 pages
Unit 1-1
No ratings yet
Unit 1-1
9 pages
Income Tax Project Selva
No ratings yet
Income Tax Project Selva
61 pages
The Study of Building the Data Warehouse
From Everand
The Study of Building the Data Warehouse
venkateswara Rao
No ratings yet
Building and Operating Data Hubs: Using a practical Framework as Toolset
From Everand
Building and Operating Data Hubs: Using a practical Framework as Toolset
Georg Graner
No ratings yet
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
From Everand
PYTHON DATA ANALYTICS: Mastering Python for Effective Data Analysis and Visualization (2024 Beginner Guide)
FLOYD BAX
No ratings yet
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Data Structures Explained: A Practical Guide with Examples
From Everand
Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
From Everand
The InfluxDB Handbook: Deploying, Optimizing, and Scaling Time Series Data
Robert Johnson
No ratings yet
Image Retrieval: Fundamentals and Applications
From Everand
Image Retrieval: Fundamentals and Applications
Fouad Sabry
No ratings yet
Image Retrieval: Unlocking the Power of Visual Data
From Everand
Image Retrieval: Unlocking the Power of Visual Data
Fouad Sabry
No ratings yet
InfluxDB Essentials: Definitive Reference for Developers and Engineers
From Everand
InfluxDB Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

An Efficient Approach For Data Indexing in Datawarehousing and Datamining

Uploaded by

An Efficient Approach For Data Indexing in Datawarehousing and Datamining

Uploaded by

International Journal of Innovations in Engineering and Technology (IJIET)

An Efficient Approach for Data Indexing in

Dr. H.M. Rai

Keywords: OLTP, OLAP, DSS

1.1 Data Indexing

Vol. 1 Issue 4 December 2012 108 ISSN: 2319-1058

Vol. 1 Issue 4 December 2012 109 ISSN: 2319-1058

2.1. Inverted List Indexes

2.2. Bit Mapped Indexing

Another type of advanced indexing technique is a bitmap or bitmapped indexing.

Table 1. Bit-Map Table

Vol. 1 Issue 4 December 2012 110 ISSN: 2319-1058

2.3. Aggregation Indexes

Table 3: Summary Indexing

Key Word search Yes - Yes

Partial Key Searches Yes - Yes

Progressive searches (drill - - Yes

Vol. 1 Issue 4 December 2012 111 ISSN: 2319-1058

Multiple key combination - Yes Yes

Automated quantifying - - Yes

Case Insensitive - - Yes

Position insensitive - - Yes

Pre-join indexes - - Yes

Relational Logic Yes Yes Yes

Boolean Logic Yes Yes Yes

Excluded word - - Yes

Concatenated key - - Yes

Composition Keys - - Yes

Grouping Constants - - Yes

Batch Indexing - - Yes

V. CREATION OF BIT MAP INDEXING

Vol. 1 Issue 4 December 2012 112 ISSN: 2319-1058

Fig 1: Flow Chart for Bitmap Index Creation

Fig. 2: Flow Chart for Retrieving Data

Vol. 1 Issue 4 December 2012 113 ISSN: 2319-1058

Vol. 1 Issue 4 December 2012 114 ISSN: 2319-1058

You might also like