Day1.4 DataWarehousing
Day1.4 DataWarehousing
Warehousing
Introduction
• Data warehousing’s goal is to make the right
information available @ the right time
• Data warehousing is a data store (eg., a database of
some sort) and a process for bringing together
disparate data from throughout an organization for
decision-support purposes
• Relational database management systems (RDBMS),
such as Oracle, DB2, Sybase, Informix, Focus, SQL
Server, etc. are often used for data warehousing
Definitions of a Data Warehouse
Customers Orders
Transactions
Vendors Etc…
Data Miners:
Etc… • “Farmers” – they know
• “Explorers” - unpredictable
Copied,
organized
summarized
Decision
Support
Data Mart Information
Decision
Data Support
Data Mart Information
Warehouse
Decision
Support
Data Mart Information
Generic Architecture of Data
• Source systems
• Extraction, (Clean),
Transformation, & Load
(ETL)
• Central repository
• Metadata repository
• Data marts
• Operational feedback
• End users (business)
Where does OLAP fit in?
OLAP Overview
• Interactive, exploratory analysis of
multidimensional data to discover patterns
gender
ts
en
c id
a c
age
OLAP Architecture
Server Options
• Single processor
• Symmetric
multiprocessor (SMP)
• Massively parallel
processor (MPP)
OLAP Server Options
• ROLAP (Relational)
• MOLAP (Multidimensional)
• HOLAP (Hybrid)
OLAP – Online Analytical Processing
• A definition:
The
Cube
OLAP Cube - 5
Page Columns
Three- Region: Sales
North
Dimensional
Red Blue Total
Cube 1996
blob blob
Display Rows
Year
1997
Total
OLAP Cube - 6
Dimension Example
Six- Brand Mt. Airy
Store Atlanta
Dimensional Customer segment Business
Cube Product group Desks
Period January
Variable Units sold
Rotation (Pivot Table)
Drill Down
Region Sales variance
Africa 105%
Asia 57%
Europe 122%
North America 97%
Pacific 85%
South America 163%