0% found this document useful (0 votes)
40 views12 pages

Data Warehouse Design Practices and Methodologies: Lesson 5: Schema Integration Process

Uploaded by

yangui.rania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views12 pages

Data Warehouse Design Practices and Methodologies: Lesson 5: Schema Integration Process

Uploaded by

yangui.rania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 12

Information Systems Program

Module 3
Data Warehouse Design Practices
and Methodologies

Lesson 5: Schema Integration Process


Lesson Objectives
• Explain steps in the schema integration process
• Apply merge transformation to simple data
sources
• Apply flatten transformation to simple data
sources

Information Systems Program


Overview of Schema Integration
• Combine small number of data sources
• Business intelligence requirements
– Decisions
– Measures
– Wish lists
• Data sources as raw materials to satisfy business
intelligence requirements
• Apply schema integration process as part of a
methodology for many data sources

Information Systems Program


Schema Integration Steps
Specify
dimensions
and measures

Determine grains
and estimate fact
table sizes

Create initial
table design

Analyze
summarizabili
ty

Map data
sources and
populate
tables

Information Systems Program


Design Transformations
• Used in third and fourth step of schema
integration methodology
• Merge and flatten transformations to simplify fact
table design
• History transformations for historical integrity
• Summarizability patterns for dimension and
dimension-fact problems

Information Systems Program


Merge Transformation
• Design transformation to integrate source
schemas
• Applies to source tables representing dimensions
and measures
• One source table from each data source
combines into a single data warehouse table
• Sometimes combine multiple tables from the
same data source

Information Systems Program


Merge Transformation for a Dimension
Table
Data source i
Customer
CustId
CustName
Data warehouse design
CustAddr CustomerDim
CustBal
CustId
CustFirstName
CustLastName
Data source j CustCity
Shopper CustState
ShopNo CustZip
ShopFirstName CustBal
ShopLastName
ShopStreet
ShopCity
ShopState
ShopZip

Information Systems Program


Flatten Transformation
• Design transformation to simplify fact table
design
• Two level table design for business transactions
– Parent entity type with transaction heading
– Child entity type with item details
• Combine two level design into a single fact table

Information Systems Program


Flatten Transformation Example I
Source database design
Registration
RegNo
RegCompDate
Data warehouse design
RegTerm EnrollFact
RegYear
StdNo EnrollNo
EnrollCompDate
EnrollTerm
EnrollYear
StdNo
Enrollment OfferNo

RegNo
OfferNo
EnrGrade

Information Systems Program


Flatten Transformation Example II
Source database design
ShoppingCart
ShopCartNo
SCDate Data warehouse design
CustNo
ShopCartItemFact
ShopCartItemNo
SCIDate
CustNo
ProdNo
ShopCartItem
ShopCartNo
ProdNo
SCIQty

10

Information Systems Program


Flatten Applicability
• Close time proximity for heading and details of a
business transaction
• Mostly related items and measures tracked for
heading and details of a business transaction
• Ability to capture the heading and details of a
business transaction into a data warehouse
without undue wait for completion of details

11

Information Systems Program


Summary
• Process provides a sequence of steps applied to
a pair of data sources
• Emphasis on integration and simplification using
transformations
• Merge transformation to combine dimensions and
measures
• Flatten transformation to reduce detail for fact
tables
• Apply data integration process on practice case
and assignment
12

Information Systems Program

You might also like