DWH Architecture & Concepts
DWH Architecture & Concepts
Source 1
Canned Reports
Source 2
Ad-hoc analysis
Source 3
Summaries
/
Aggregatio
ns
Source n
Sources
Staging Data Reporting
Layer Warehouse Layer
Source 1
Canned Reports
Source 2
Ad-hoc analysis
Source 3 Summaries
Detail Data /
Aggregatio
ns
Source n Transformatio
n
Summarization
Aggregation
Sources Staging ODS Data Reporting
Layer Warehouse Layer
Source 1
Canned Reports
Source 2
Ad-hoc analysis
Source 3 Summaries
Detail Data / Cubes-
Aggregatio Conformed
ns Dimension
s
Source n Transformatio
n
Summarization
Aggregation
Sources Staging ODS Data Data Reporting
Layer Warehouse Marts Layer
Requirement Gathering
Analysis
Poor Performance
Tend to be very complex and difficult to
navigate.
Dimensional Modeling
Dimensional data modeling comprises of one or more dimension tables and fact tables.
Must identify
Business process to be supported
Grain (level of detail)
Dimensions
Facts
What is Fact?
A fact is a collection of related data items,
consisting of measures and context data.
Each fact typically represents a business item, a
business transaction, or an event that can be used
in analyzing the business or business process.
Facts are measured, “continuously valued”,
rapidly changing information. Can be calculated
and/or derived.
Types of Facts
Additive
Additive facts are facts that can be summed up through all of the
dimensions in the fact table.
Able to add the facts along all the dimensions
Eg. Retail sales in $ (or) A sales fact
Semi-Additive
Semi-Additive facts are facts that can be summed up for some of the dimensions
in the fact table, but not the others.
Eg. Daily balances fact can be summed up through the customers dimension
but not through the time dimension.
Non-Additive
Non-Additive facts cannot be summed up for any of the dimensions present in
the fact table.
Eg. %(Percentages) , Ratios etc.,
Classification of Facts
Based on the classification , there are 2 types of Fact tables.
Cumulative Facts
Snapshot Facts
Cumulative Facts - This type of fact table describes what has happened over period
of time.
Eg. Additive Facts , Total sales by product by store by day or week or month or year .
Snapshot Facts – This type of fact table describes the state of things in a particular
instance of time.
Conformed Dimension
Junk Dimension
Degenerated Dimension
Dimensions Types
Conformed Dimension
Junk Dimension
Degenerated Dimension
• Dependent data marts are marts that are fed directly by the DW,
sometimes supplemented with other feeds, such as external data.
• Independent data marts are marts that are fed directly by external
sources and do not use the DW.
• Embedded data marts are marts that are stored within the central DW.
They can be stored relationally as files or cubes.
Operational Data Store (ODS)
An ODS
Characteristics
Integrated
Subject-oriented
Volatile (including update)
Current valued
Types of Schemas
- Star schema
- Snowflake schema