Jukic Chapter07
Jukic Chapter07
Warehousing Concepts
Database Systems -
Introduction to Databases
and Data Warehouses
Business Intelligence
and Analytics
Packaged Transient
application data source
Data
warehouse
Legacy
Extract Transform Cleanse Load
system
Data mart
Other internal
applications
Technical Differences
Small Amounts used in a Process Large Amounts used in a Process
High frequency of Access Low/Modest frequency of Access
Can be Updated Read (and Append) Only
Non-Redundant Redundancy not an Issue
Functional Differences
Used by all types of employees Used by narrower set of
for tactical purposes users for decision making
Application Oriented Subject Oriented
An application-
oriented database
serving the Vitality
Health Club Visits
and Payments
Application
• Subject-oriented
– Refers to fundamental difference in purpose of operational
database system vs. a data warehouse.
– Operational database system: developed to support a specific
business operation
– Data warehouse: developed to analyze specific business
subject areas
• Historical
– Refers to larger time horizon in DW compared to operational
databases
– Operational databases hold typically 1 year of data
– DW hold many years of data
Database Systems - Jukić, Vrbsky, Nestorov Ch. 7, slide 18
THE DATA WAREHOUSE (DW) DEFINITION
• Time variant
– A data warehouse contains slices or snapshots of data from
different periods of time across the time horizon
– With the data slices, the user can create reports for various
periods of time within the time horizon
• Detailed and/or summarized data
– A DW may include detailed data or summary data or both
– A DW that contains data at the finest level of detail is most
powerful
Database Systems - Jukić, Vrbsky, Nestorov Ch. 7, slide 19
THE DATA WAREHOUSE (DW) DEFINITION
Example:
core components
of a data
warehousing
system
• Source systems
– Operational databases and repositories that provide
analytically useful information in DW subject areas
– Each operational data store has two purposes:
• Original operational purpose
• Source system for the data warehouse
– Source systems can include external data sources
Example: A data
warehouse with internal
and external source
systems
Example: data
warehouse with
front-end
applications