Data Warehouse Lec-6
Data Warehouse Lec-6
• Requirements for DW
– Information Package
• Dimensional Modeling
• Data Design ‐ ER vs DM
2
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
BITS Pilani, Pilani Campus
•The grain or level of granularity for each of our fact table (s).
•The number of conformed (cross‐subject area) dimensions.
•Overall size for each dimension and fact table.
•Initial number and type of aggregates.
•The types and number of the predictable queries to be run
against the model and their frequency.
•Con‐Currency and security regarding data for each dimension
and fact.
•Validation in that each attribute in our model is referenced or
used by a process object.
•A view into what will be required in terms of source system
data to populate our model.
•The types of analytic tools to provide the users access to the
information as contained in our model.
BITS Pilani, Pilani Campus
Conduct Business Requirements Analysis
1) Schedule end‐user requirements gathering sessions.
a) Conduct end‐user requirements sessions.
i) Determine all tactical reporting requirements.
ii) Determine all strategic analysis requirements.
28
BITS Pilani, Pilani Campus
Dimensional Modeling: Vocabulary
• Measures or facts
• Facts are “numeric” & “additive”
• For example; Sale Amount, Sale Units
• Factors or dimensions
• Star Schemas
• Snowflake & Starflake Schemas
FK FK
Product Location
Dimension Dimension
Sales Fact
Table
FK FK
Time Promotion
Dimension Dimension
30
BITS Pilani, Pilani Campus
Dimensional Modeling
• Fact tables
– Contain the metrics resulting from a business process or
measurement event, such as the sales ordering process or
service call event
– Dimensional models should be structured around business
processes and their associated data sources,
• This results in ability to design identical, consistent views of data
for all observers, regardless of which business unit they belong to,
which goes a long way toward eliminating misunderstandings at
business meetings
– Fact table’s granularity should be set at the lowest, most
atomic level captured by the business process
• This allows for maximum flexibility and extensibility.
– Business users will be able to ask constantly changing, free‐ranging,
and very precise questions.
• Dimensional table
– Contain the descriptive attributes and characteristics
associated with specific, tangible measurement events,
such as the customer, product, or sales representative
associated with an order being placed.
– Dimension attributes are used for constraining, grouping,
or labeling in a query.
– Hierarchical many‐to‐one relationships are denormalized
into single dimension tables.
• A fact table
• Multiple dimension tables
• Example: Assume this schema to be of a retail‐chain. Fact will be
revenue (money). How do you want to see data is called a dimension.