Module-3 Data Warehousing
Module-3 Data Warehousing
Warehousing
MODULE 2: Introduction Data Warehouse
1
Introduction to Data Warehouse
2
What is Data
Warehouse?
3
Data Warehouse—Subject-
Oriented
4
Data Warehouse—
Integrated
5
Data Warehouse—Time
Variant
6
Data Warehouse—
Nonvolatile
7
Data Warehouse vs.
Heterogeneous DBMS
8
Heterogeneous
DBMS
9
Why Separate Data
Warehouse?
10
From Tables and
Spreadsheets to Data
Cubes
A data warehouse is based on a multidimensional data model
which views data in the form of a data cube
A data cube, such as sales, allows data to be modeled and
viewed in multiple dimensions
Dimension tables, such as item (item_name, brand, type), or
time(day, week, month, quarter, year)
Fact table contains measures (such as dollars_sold) and keys
to each of the related dimension tables
In data warehousing literature, an n-D base cube is called a base
cuboid. The top most 0-D cuboid, which holds the highest-level
of summarization, is called the apex cuboid. The lattice of
cuboids forms a data cube.
11
Multidimensional
Data Model
12
Multidimensional
Data Model
2D
view
13
Multidimensional
Data Model
3D view
14
Multidimensional
Data Model
4D view
15
Cube: A Lattice of Cuboids
16
Conceptual Modeling
of Data Warehouses
17
Example:
Star Schema
18
Example:
Snowflake Schema
19
Example:
Fact Constellation
20
Cube: Three
Categories
21
Hierarchy:
Dimension (location)
22
Warehouses &
Hierarchies
23
Warehouses &
Hierarchies
24
A Sample Data Cube
Date
1Qtr 2Qtr 3Qtr 4Qtr sum
t
uc
TV
od
PC U.S.A
Pr
VCR
Country
sum
Canada
Mexico
sum
25
Typical OLAP
Operations
26
Roll Up
27
Drill
Down
28
Slice
29
Dice
30
Pivot
31
A Star-Net Query
Model
32
Warehouse: A Business
Analysis Framework
33
Data Warehouse
Design Process
34
Data Warehouse: A Multi-Tiered Architecture
35
Three Data
Warehouse Models
Enterprise warehouse
collects all of the information about subjects spanning the
entire organization
Data Mart
a subset of corporate-wide data that is of value to a specific
groups of users. Its scope is confined to specific, selected
groups, such as marketing data mart
Independent vs. dependent (directly from warehouse) data mart
Virtual warehouse
A set of views over operational databases
Only some of the possible summary views may be
materialized
36
Development:
A Recommended Approach
37
Back-End Tools and
Utilities
38
Metadata Repository
39
OLAP Server
Architectures
40
ROLAP Datastore
(Example)
41
Data Warehouse
Usage
42
OLAP & OLAM
43
Integrated OLAM & OLAP
44