Introduction To Data Warehousing
Introduction To Data Warehousing
Data Warehousing
From DBMS to Decision Support
• DBMSs widely used to maintain transactional
data
• Attempts to use of these data for analysis,
exploration, identification of trends etc. has led
to Decision Support Systems.
• Rapid Growth since mid 70’s
• DBMSs vendors have answered this trend by
adding new features to existing products
• Rarely enough
DBs for Decision Support
• Trend towards Data Warehousing
• Data Warehousing – consolidation of data
from several databases which are in turn
maintained by individual business units
along with historical and summary
information
Characteristics of TPSs
Characteristic OLTP
Screens Unchanging
Orientation Records
TPS
TPS vs
vs Decision
Decision Support
Support
Ad hoc access
Production
platforms
Production
platforms
Operational reports
Data Extract Processing
Extract explosion
• Duplicated effort
• Multiple technologies
• Obsolete reports
• No metadata
Data Quality Issues
• No common time basis
• Different calculation algorithms
• Different levels of extraction
• Different levels of granularity
• Different data field names
• Different data field meanings
• Missing information
• No data correction rules
• No drill-down capability
From Extract to Warehouse DSS
• Controlled
• Reliable
• Quality information
• Single source of
data
Data Warehousing Architecture
External Data Sources
Visualisation
Extract Clean
Metadata
Transform Load respository Serves
OLAP
Refresh
Distribution Analyst
Tiered data warehouse
Mainframe
Product Region
Model Nation
Type Facts District
Color Product Dealer
Region
Time
Channel
Revenue
Channel Expenses Time
Units Week
Year
Multidimensional Database
Customer Store Model Store
Time Time
SALES FINANCE
Product