Unit - 3 Data Cube Technology
Unit - 3 Data Cube Technology
What is OLAP?
Online Analytical Processing (OLAP) is a category of software that allows users to analyze information
from multiple database systems at the same time. It is a technology that enables analysts to extract and view
business data from different points of view.
Analysts frequently need to group, aggregate and join data. These OLAP operations in data mining are
resource intensive. With OLAP data can be pre-calculated and pre-aggregated, making analysis faster.
OLAP databases are divided into one or more cubes. The cubes are designed in such a way that creating and
viewing reports become easy. OLAP stands for Online Analytical Processing.
OLAP cube:
At the core of the OLAP concept, is an OLAP Cube. The OLAP cube
is a data structure optimized for very quick data analysis.
OLAP Cube
Data cube operations are used to manipulate data to meet the needs of users. These operations help to
select particular data for the analysis purpose. There are mainly 5 operations listed below-
Roll-up: Roll-up is also known as “consolidation” or “aggregation.” The Roll-up operation can be
performed in 2 ways
1. Reducing dimensions
2. Climbing up concept hierarchy. Concept hierarchy is a system of grouping things based on their
order or level.
Operation and aggregate certain similar data attributes having the same dimension together. For
example, if the data cube displays the daily income of a customer, we can use a roll-up operation to
find the monthly income of his salary.
Consider the following diagram
Drill-down: In drill-down data is fragmented into smaller parts. It is the opposite of the rollup process. It
can be done via
Moving down the concept hierarchy
Increasing a dimension
Slicing:
Here, one dimension is selected, and a new sub-cube is created.
Following diagram explain how slice operation performed:
Dicing:
This operation is similar to a slice. The difference in dice is you select 2 or more dimensions that
result in the creation of a sub-cube.
Pivot:
In Pivot, you rotate the data axes to provide a substitute presentation of data.
In the following example, the pivot is based on item types.
Advantages of OLAP
OLAP is a platform for all type of business includes planning, budgeting, reporting, and analysis.
Information and calculations are consistent in an OLAP cube. This is a crucial benefit.
Quickly create and analyze “What if” scenarios
Easily search OLAP database for broad or specific terms.
OLAP provides the building blocks for business modeling tools, Data mining tools, performance
reporting tools.
Allows users to do slice and dice cube data all by various dimensions, measures, and filters.
Hybrid OLAP
Hybrid OLAP is a mixture of both ROLAP and MOLAP. It offers fast computation of MOLAP and higher
scalability of ROLAP. HOLAP uses two databases.
1. Aggregated or computed data is stored in a multidimensional OLAP cube
2. Detailed information is stored in a relational database.
Benefits of Hybrid OLAP:
This kind of OLAP helps to economize the disk space, and it also remains compact which helps to
avoid issues related to access speed and convenience.
Hybrid HOLAP’s uses cube technology which allows faster performance for all types of data.
ROLAP are instantly updated and HOLAP users have access to this real-time instantly updated data.
MOLAP brings cleaning and conversion of data thereby improving data relevance. This brings best
of both worlds.
Drawbacks of Hybrid OLAP:
Greater complexity level: The major drawback in HOLAP systems is that it supports both ROLAP
and MOLAP tools and applications. Thus, it is very complicated.
Potential overlaps: There are higher chances of overlapping especially into their functionalities.