0% found this document useful (0 votes)
10 views15 pages

Data Warehouse Design Practices and Methodologies

Uploaded by

yangui.rania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views15 pages

Data Warehouse Design Practices and Methodologies

Uploaded by

yangui.rania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 15

Information Systems Program

Module 3
Data Warehouse Design Practices
and Methodologies

Lesson 7: Mini Case for Data Warehouse Design


Lesson Objectives
• Practice with data warehouse design problems
• Prepare for data warehouse design assignment
• Gain insights about analyzing data sources

Information Systems Program


Mini Case on Schema Integration
• Apply and integrate skills from module 3 lessons
• Acquire new skills
• Data source specifications, business needs, and
sample data

Information Systems Program


Design
Requirements
Identify
Create table Map data
Specify summarizabilit
Determine design using sources and
dimensions y problems
grain transformation populate
and measures and suggest
s tables
resolutions

Information Systems Program


Data Sources
Purchase Database
Supplier
Purchase
SuppNo
SuppName PurchNo
SuppPhone PurchFrom PurchDate
SuppEmail PurchPayMethod
SuppDisc ProdDelDate

PrefSupp
Contains

Product
ProdNo PurchLine
ProdName
ProdQOH BoughtOn PLQty
PLUnitCost
ProdPrice
ProdNextShipDate

Purchases Spreadsheet for Custom Products


ProdCode ProdDesc Supp Qty Stock Unit Price PurDate Amount
CPC1 Souvenir 1 Omart 20 1 $2.00 13-Feb-2021 $40.00
CPC2 Souvenir 2 Smart 10 2 $3.50 14-Feb-2021 $35.00
CPC3 Souvenir 3 Pmart 20 0 $1.50 11-Feb-2021 $30.00 5

Information Systems Program


Business Intelligence Needs
• Track inventory over time by product and supplier
• Calculate inventory measures over time using
quantity on hand and value
• Report on additions to inventory (purchases)
• No reporting on deletions to inventory (orders)

Information Systems Program


Important Design Decisions
• Grain determination and relative size calculations
• Simplification using design transformations
• Mappings from source data to populate data
warehouse tables

Information Systems Program


Grain Size Calculations
Fact table size
• Use sizes of dimensions and
estimate sparsity
• Fill Ratio: 1 - Sparsity
• Fact Table Size: Product of
dimension sizes times fill ratio

Sparsity
• Match fact table to source tables
• Use sizes of dimensions and
source table
• Fill Ratio: Source table size
divided by product of dimension
table sizes
• Sparsity: 1 – Fill Ratio

Information Systems Program


Transformations for Table Design
Flatten Merge
• Purchase and • Merge Supplier table
PurchLine tables and Supp column
• Products grouped by • Merge Product table
date columns and
(PurchDate and ProdCode/ProdDesc
DelDate) columns
• Merge flattened table
with spreadsheet rows

Information Systems Program


Purchasing Star Schema

Supplier
SuppNo
SuppName
SuppPhone
SuppEmail

Supplies

Calendar InvFact
PurchDate
CalId InvFactNo Product
CalDay InvFactQty
ProdNo
CalMonth DelDate InvFactAmt ProdOf ProdName
CalYear InvFactUnitCost
ProdType
InvFactQOH
InvFactProdPrice
InvFactSuppDisc

10

Information Systems Program


ISMG6480
Summarizability Problems
• Incomplete fact-dimension relationship (delivery
date)
• Potential problems for hierarchical product
dimension (brand, product type)
• Missing values for supplier and product
dimensions (product type, supplier phone, and
email)

11

Information Systems Program


ISMG6480
Mappings from Source Data

• Source column
Association matching
s • Conversions

• Generated PK
values
Additions • Default values
• Derived values

12

Information Systems Program


Schema Integration Assignment
• Like the practice case
• Artifacts
– Dimensional design with dimensions and members
– Grain analysis
– ERD integrating data sources with indication of design
transformations
– Summarizability problems and resolutions
– Mapping from data sources
– Population of DW tables using sample data from data
sources
13

Information Systems Program


Summary
• Mini case study to help apply and integrate
concepts and skills
• Case study requirements and data sources
• Concept extensions
– Grain size
– Mapping source data to data warehouse

14

Information Systems Program


Grain Size Determination
• Determine sparsity
– Given dimension cardinalities and source table
cardinality
– Associate fact table to tables of data source
– 1 minus source table cardinality divided by product of
dimension cardinalities
• Determine fact table size
– Given dimension cardinalities and sparsity estimate
– Product of dimension cardinalities
– Reduce by sparsity
15

Information Systems Program

You might also like