Lecture 8
Lecture 8
Mamuna Fatima
1
DWH & OLAP
3
Supporting the human thought process
THOUGHT PROCESS QUERY SEQUENCE
• Analysis is directional
– Drill Down
More in
– Roll Up subsequent
slides
– Pivot
5
“All” possible queries (level aggregates)
ALL ALL
?
Transaction
Data
Data
Loading
OLAP
Reports
Decision
Maker
Data Cube
(MOLAP) Presentation
Tools
8
OLTP vs. OLAP
Feature OLTP OLAP
Level of data Detailed Aggregated
Amount of data per Small Large
transaction
Views Pre-defined User-defined
Typical write Update, insert, delete Bulk insert
operation
“age” of data Current (60-90 days) Historical 5-10 years and
also current
Number of users High Low-Med
Tables Flat tables Multi-Dimensional tables
Database size Med (109 B – 1012 B) High (1012 B – 1015 B)
Query Optimizing Requires experience Already “optimized”
Data availability High Low-Med
9
OLAP FASMI Test
• Fast Analysis of Shared Multidimensional
Information (FASMI) Test.
• developed to evaluate the OLAP products.
10
OLAP FASMI Test
Fast: Delivers information to the user at a fairly
constant rate. Most queries answered in under five
seconds.
Analysis: Performs basic numerical and statistical
analysis of the data, pre-defined by an application
developer or defined ad-hocly by the user.
Shared: Implements the security requirements
necessary for sharing potentially confidential data
across a large user population.
Multi-dimensional: The essential characteristic of
OLAP.
Information: Accesses all the data and information
necessary and relevant for the application, wherever
it may reside and not limited by volume.
...from the OLAP Report by Pendse and Creeth.
11
Data Warehouses and OLAP for Data Mining
A DW integrated with an OLAP can be used to perform these
important tasks:
– information processing, which is implemented through querying,
providing basic statistical analysis, and reporting using tables, charts,
and graphs
– analytical processing, in which multidimensional analysis of the data
by using basic OLAP operations, such as slice and dice, drilling,
pivoting, etc. is performed
- data mining, in which new and interesting hidden patterns in the
data are found. Some of the DW are furnished with, or can be
integrated with, data mining tools that support discovery of data
associations, construction of analytical models, performance of
classification and prediction, and presentation of the results using
visualization tools.
12
Examples of OLAP Tools
• Pentaho BI
• Mondrian
• IBM Cognos
• Micro Strategy
• icCube
13