Teradata Data Modleing Reference PDF
Teradata Data Modleing Reference PDF
Debbie Smith
Data Warehouse
Consultant
Teradata Global
Sales Support
Executive Summary
Introduction
So What is an EDW?
Data Infrastructure
Physical Models
10
10
Surrogate Keys
11
Changing Dimensions
12
13
13
Summary
14
15
Appendix B A Comparison
16
Appendix C Glossary
17
Appendix D Endnotes
18
Appendix E References
18
EB-2406
Executive Summary
The data model choice for the data warehouse is often a matter
of great controversy. The desire is to offer a self-service type of
environment that allows business users easy access with acceptable
response times. Response time also includes the time required
between the conception of a new application and the delivery of
that application. What data model do you employ to provide ease
of use for the business user while still being able to address current
and future needs of the data warehouse in terms of updating,
expansion, availability, and management? This paper will provide
an overview of popular data modeling and the Teradata Corporation position regarding data modeling.
a dimensional model.
12 Rules
So What is an EDW?
EB-2406
EB-2406
Customer
Entity
CustomerName
CustomerAddr
Places
Relationship
Business Rule
Boston to
Entity
New York
TravelRequest
Figure 1. Example ER Diagram
Dimensional Modeling
Dimensional modeling is another logical
design method used to organize data for
functional groups of users or business
EB-2406
Advertising
Fiscal
Calendar
Ad Year
Year
Year
Product
Geography
Dept
Quarter
Quarter
Period
Month
Ad Period
Region
Minor
Dept
Category
Sub
Category
District
Ad Week
Week
Week
SKU
UPC
Day
Store
Sales
EB-2406
Snowflake
Star
Flattened
Normalized
Denormalized
ness demands.
Physical Models
EB-2406
data subjects.
Denormalized Model
EB-2406
Item
Date
Sales
100012
01102001
10.00
300012
02122001
3.00
200012
01152001
2.50
100012
03042001
15.00
Item
Jan Sales
Feb Sales
Mar Sales
Apr Sales
100012
345.00
450.00
326.50
245.90
200012
456.60
376.50
210.00
390.00
300012
254.00
112.00
310.00
295.00
400012
510.00
610.00
590.00
545.00
dimensionally structured.
EB-2406
data model.
complex schema.
Impact of Information
Impact of Extraction,
Delivery Tools
(ETL) Tools
EB-2406
EB-2406
SKU
Catg
SKU
Catg
Updated
SKU
Catg
Prev Catg
1001
01
1001
03
20021029
1001
03
01
1002
02
1001
01
20010101
1001
01
15
1003
03
1002
02
20010101
1001
15
20
1003
01
20020915
1002
02
02
1003
03
20010101
1003
01
03
1003
03
12
1003
12
05
1003
05
03
been overlaid.
Changing Dimensions
EB-2406
Warehousing Evolution
EB-2406
Summary
ized models.
of denormalized models.
EB-2406
table form.
and Delete
EB-2406
Any Question
Complex Analysis
Data Duplication
Data Granularity
To maintain the ease of navigation, the denormalized model is typically aggregated along one of the
dimension lines.
Data Navigation
Flexibility
Maintenance
Updating the denormalized model requires preprocessing, aggregation and longer time frames.
EB-2406
Dimensional Model
Foreign Key
Attribute
entity.
Hierarchy
Drill (Across)
different dimensions.
Intelligent Key
Normalized Model
replicated.
ETL
OLTP
Data Warehouse
A reflection of integrated data that is
Fact
Flattened Model
Dimension
report.
EB-2406
Primary Key
Appendix D Endnotes
Appendix E References
Schema
Structured framework that, for purposes
of this paper, provides and represents
relationships between entities.
Service Level Agreement (SLA)
05-07-2001.
2002, pp 11-12.
addresses).
30, 2003.
Snowflake Schema
Star Schema
http://www.billinmon.com/library/
articles/artdimmd.asp.
Surrogate Key
Artificial key used as a substitute for
natural data keys (i.e. customer id).
AllFusion is a trademark and ERwin is a registered trademark of Computer Associates International, Inc. Teradata continually improves products as new technologies and components become available. Teradata, therefore, reserves the right to change specifications without prior notice. All features, functions, and operations
described herein may not be marketed in all parts of the world. Consult your Teradata representative or Teradata.com for more information.
Copyright 2004-2007 by Teradata Corporation
EB-2406
Produced in U.S.A.