Data Warehouse
Data Warehouse
DATA WAREHOUSE
DATA WAREHOUSE
1. Subject oriented
• Organized around major subjects, such as sales progress
• Containing only information relevant for decision support
• Focusing on the modeling and analysis of data for decision
makers, not on daily operations or transaction processing
4 MAIN CHARACTERISTICS OF DATA WAREHOUSING
1. Subject oriented
• For example, to learn more about your company's sales, you
can build a warehouse that concentrates on sales.
Using this warehouse, you can answer questions like "Who was
our best customer for this item last year?" This ability to define
a data warehouse by subject matter, sales in this case, makes
the data warehouse subject oriented (http://docs.oracle.com/)
4 MAIN CHARACTERISTICS OF DATA WAREHOUSING
2. Integrated
• Constructed by integrating multiple, various
data sources
• Must place data from different sources into
a consistent format, to do so they must deal
with naming conflict and discrepancies
• Data cleaning and data integration
techniques are applied
• Ensure consistency in naming conventions
among different data sources
• When data is moved to the warehouse, it is
converted
4 MAIN CHARACTERISTICS OF DATA WAREHOUSING
4. Non-volatile
• after data are entered into a data warehouse, users cannot
change or update the data.
Operational update of data does not occur in the data
warehouse environment
• Does not require transaction processing, recovery, and
concurrency control mechanisms
• Requires only two operations in data accessing:
• Initial loading of data and access of data
SUMMARY OF DATA WAREHOUSE
Data mart
Smaller and focuses on a particular subject or department.
It is a subset of data warehouse/departmental data warehouse
A data mart is a smaller DW designed around one problem,
organizational function, topic, or other focus area.
Can be Dependent data mart
A subset that is created directly from a data warehouse
Ensures that the end user is viewing the same version of the data
that are accessed by all other data warehouse users
Or Independent data mart
A small data warehouse designed for a strategic business unit or a
department
DATA WAREHOUSING - CONCEPT
—or—
3b.Data are loaded into data marts
4b.The data marts are consolidated into the EDW
5. Analyses are performed as needed
DATA WAREHOUSING - PROCESS OVERVIEW
Pros: Pros:
Easy to build organizationally Business Enterprise View
Easy to build technologically Design consistency
Cons: Data reusability
Enterprise wide view unavailable Cons:
Redundant data costs Require corporate leadership and vision
High ETL costs
High DBA costs
EVENTUALLY IT CAN BE THIS …
DATA WAREHOUSE
DEVELOPMENT
Describe the major similarities and differences between the Inmon and
Kimball data warehouse development approaches.
•Similarities: Both methods can produce an enterprise data warehouse and
subset data marts.
•Differences: