0% found this document useful (0 votes)

37 views18 pages

COMP 430 Intro. To Database Systems: Denormalization & Dimensional Modeling

This document discusses dimensional modeling, an alternative approach to entity-relationship (ER) modeling for database design. Dimensional modeling emphasizes fast retrieval and aggregation of historical data. It models data with fact tables containing numeric, additive data and dimension tables in one-to-many relationships with facts. Dimensional modeling starts by identifying potential queries and facts, and aims to support each query with one fact table and related dimensions. It may begin with an ER model but then denormalizes into a star or starflake schema to optimize for queries. Facts represent the center of a multidimensional data cube, and dimensions define the axes.

Uploaded by

Raynaldy Mahdi Putra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views18 pages

COMP 430 Intro. To Database Systems: Denormalization & Dimensional Modeling

Uploaded by

Raynaldy Mahdi Putra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 18

COMP 430

Intro. to Database Systems

Denormalization & Dimensional Modeling
Some consequences of normalization
• Data redundancy is reduced or eliminated.
• Relations are broken into smaller, related tables.
• Using all the attributes from the original relation requires joining
these smaller tables.
Denormalization
Deliberately reintroducing some redundancy, so that we can access
data faster.

DENORMALIZED
DATA AHEAD
Example

Technique:
Add duplicate fields

Technique:
Add computed fields
Example

Technique:
Join tables
Less common denormalization techniques
• Duplicating a commonly-used subset of table fields
• Splitting some table rows into different tables
• Frequently- vs. rarely-used data
• Data for different regions
• Common subclasses of data
When to denormalize?

Typically used when some or all of the following apply:

• Many queries need to join the data
• Joining the data is expensive – uses scans, rather than indices
• Computing derived data is expensive – complex queries or complex functions
Dealing with redundant data
Still want data consistency, but now it requires work.
Is full consistency required at all times?

Techniques:
• Stored procedures act as API for updating DB. They add the redundant data.
• Triggers check (and fix?) consistency.
• Application code carefully maintains consistency during updates.
• Reconcile data as background process.
• Reconcile data during system maintenance.
Dimensional modeling
In a tiny, brief nutshell
An alternative to ER modeling

Only a brief overview.

So, we’ll view it through our lens of ER modeling + denormalization.

Emphasizes decision making & use of historical data

• Fast retrieval & aggregation of data
• Less concern with updating data & maintaining consistency while updating
DB design often resembles multiple
starflakes
Starflake = tree of junction table & child tables in 1-to-many relationships

College(collegeid, …)

Course(crn, …) Student(sid, …, collegeid, …, home_stateid)

Enrollment(crn, sid, …) State(stateid, …, country)

Teach(crn, instr_id, …)
Typical:
• Few cycles.
Instructor(instr_id, …) • Super/sub classes implemented
only with superclass table.
Starflakes can be compressed to stars
Star = junction table & one level of child tables in 1-to-many relationships

Course(crn, …) Student(sid, …, college, …, home_state)

Enrollment(crn, sid, …)

Teach(crn, instr_id, …)
Denormalize by joining each child’s tree.

Instructor(instr_id, …)
Fact & dimension tables
Facts: The junction tables are the most important data – the facts
• E.g.: store purchases, class enrollments, click data
• Generally the largest tables
• Key data often numeric & additive – e.g., quantity bought, cost per unit,
advertisement views

Dimensions: The child tables in 1-to-many relationship with facts

• E.g., stores, customers, sales people, sales period
Dimensional modeling process
• Centered on identifying the business model
• Identifying the potential queries
• Identifying the facts – the data used in such queries
• Each query should use only one fact table & its dimension tables.

• Possibly start with an ER model

• Identify which junction tables serve as fact tables
• Use only surrogate keys
• Often add time dimension
• Denormalize into starflake or star schema
Customer Product
customer_ID sku
customer_name
purchase_profile
ER model description
brand
credit_profile category
address

OrderLine
Store Order
order_id
store_id order_id
sku
store_name customer_id
promotion_id
address store_id
dollars_sold
district clerk_id
units_sold
floor_type date
dollars_cost

Clerk Promotion
clerk_id promotion_id
clerk_name promotion_name
clerk_grade price_type
ad_type
Customer Product
customer_key
customer_name
Dimensional product_key
sku
purchase_profile
credit_profile model description
brand
address category
Order
time_key
Store
store_key
store_key Promotion
clerk_key
store_id promotion_key
product_key
store_name promotion_name
customer_key
address price_type
promotion_key
district ad_type
dollars_sold
floor_type
units_sold
dollars_cost
Time
Clerk time_key
clerk_key SQL_date
clerk_id day_of_week
clerk_name month
clerk_grade
View fact table as -dimensional data cube

Facts are the data in Each dimension table

the cube. represents a dimension
of the cube.

Facts might be pre-

aggregated along each
dimension or combination
of dimensions.
Sometimes things are still messy

Not all data fit nicely into facts + 1-to-many dimensions.

Leads to exceptions from this simple presentation.

Case Study On SDLC
100% (1)
Case Study On SDLC
9 pages
Final DBA
No ratings yet
Final DBA
87 pages
Primers - RDBMS My SQL
No ratings yet
Primers - RDBMS My SQL
105 pages
All Document Reader 1716046023160
No ratings yet
All Document Reader 1716046023160
79 pages
VB Project On Banking System Visual Basic
No ratings yet
VB Project On Banking System Visual Basic
24 pages
4 CC104 DataMOdel-1
No ratings yet
4 CC104 DataMOdel-1
49 pages
Advanced Database Integration Group 52
No ratings yet
Advanced Database Integration Group 52
45 pages
Lec 10 - DS - Database Management System Normalization
No ratings yet
Lec 10 - DS - Database Management System Normalization
40 pages
Presentation by Rajashekar G.S
100% (1)
Presentation by Rajashekar G.S
79 pages
Final Unit-3 Dbms
No ratings yet
Final Unit-3 Dbms
55 pages
DataBase - Management DataBase - Administration
No ratings yet
DataBase - Management DataBase - Administration
848 pages
Accenture Interview Questions & Answers.
100% (2)
Accenture Interview Questions & Answers.
4 pages
Basis Data - Database Design and SQL
No ratings yet
Basis Data - Database Design and SQL
72 pages
DBMS - 263
No ratings yet
DBMS - 263
23 pages
Data Management For Analytics Notes
No ratings yet
Data Management For Analytics Notes
21 pages
CH09 PPT DesigningDatabase
No ratings yet
CH09 PPT DesigningDatabase
43 pages
EDM - E1 - Data Architecture and Modeling - Normalization v1.1
No ratings yet
EDM - E1 - Data Architecture and Modeling - Normalization v1.1
60 pages
ETL Concepts Data Mart V1
No ratings yet
ETL Concepts Data Mart V1
35 pages
Chapter 2
No ratings yet
Chapter 2
46 pages
Lecture 3 & 4 - 5610
No ratings yet
Lecture 3 & 4 - 5610
19 pages
Explore The Role of Normal Forms in Dimensional Modeling
No ratings yet
Explore The Role of Normal Forms in Dimensional Modeling
15 pages
SQL W3schools
No ratings yet
SQL W3schools
110 pages
Course Content For SAP EWM: The Extended Warehouse Management System
No ratings yet
Course Content For SAP EWM: The Extended Warehouse Management System
5 pages
Relational Databases - An Intro
No ratings yet
Relational Databases - An Intro
15 pages
DWM Unit2
No ratings yet
DWM Unit2
65 pages
Introduction To Database
No ratings yet
Introduction To Database
56 pages
RDBMS
No ratings yet
RDBMS
46 pages
DWH Unit 2
No ratings yet
DWH Unit 2
13 pages
Modeling
No ratings yet
Modeling
8 pages
Chapter V
No ratings yet
Chapter V
38 pages
Data Modeling and Design
No ratings yet
Data Modeling and Design
18 pages
Lecture 02
No ratings yet
Lecture 02
46 pages
DBMS Session 6 Notes
No ratings yet
DBMS Session 6 Notes
50 pages
Data Modeling
No ratings yet
Data Modeling
6 pages
Module 2
No ratings yet
Module 2
16 pages
Dimensional Data Modeling - Lecture3
100% (1)
Dimensional Data Modeling - Lecture3
87 pages
Data Modeling
No ratings yet
Data Modeling
3 pages
Databases Lec6
No ratings yet
Databases Lec6
9 pages
Unit 4
No ratings yet
Unit 4
6 pages
Schema
No ratings yet
Schema
17 pages
Normalization vs. Denormalization Striking The Right Balance in Database Design
No ratings yet
Normalization vs. Denormalization Striking The Right Balance in Database Design
7 pages
Normalization and Denormalization Balancing Performance and Storage Efficiency
No ratings yet
Normalization and Denormalization Balancing Performance and Storage Efficiency
6 pages
Database Normalization and ERD
No ratings yet
Database Normalization and ERD
10 pages
Dimensional Modeling: Prof. Sunita Sahu
No ratings yet
Dimensional Modeling: Prof. Sunita Sahu
50 pages
DB Midterm Important - by Enas
No ratings yet
DB Midterm Important - by Enas
8 pages
Data Governance: A Conceptual Framework, Structured Review, and Research Agenda
No ratings yet
Data Governance: A Conceptual Framework, Structured Review, and Research Agenda
37 pages
Designing Databases
No ratings yet
Designing Databases
3 pages
Denormalization
No ratings yet
Denormalization
9 pages
Data Mesh A Business Oriented Framework For Quicker Insights HCL Whitepapers
No ratings yet
Data Mesh A Business Oriented Framework For Quicker Insights HCL Whitepapers
16 pages
DBMS 2
No ratings yet
DBMS 2
33 pages
Data Modeling Advanced Concepts & Database Tables and Normalization
No ratings yet
Data Modeling Advanced Concepts & Database Tables and Normalization
7 pages
Busiess Analytics Data Modeling Lecture 2
No ratings yet
Busiess Analytics Data Modeling Lecture 2
24 pages
Normalized and Denormalized Data
No ratings yet
Normalized and Denormalized Data
2 pages
Week6 7 Cce104l
No ratings yet
Week6 7 Cce104l
10 pages
Mpeb DBMS
No ratings yet
Mpeb DBMS
40 pages
Back-End Development Bootcamp KnowledgeHut RBG
No ratings yet
Back-End Development Bootcamp KnowledgeHut RBG
16 pages
Introduction To Data Modeling For Power BI - Gray
No ratings yet
Introduction To Data Modeling For Power BI - Gray
9 pages
Prelims - Dcit55
No ratings yet
Prelims - Dcit55
4 pages
Data Warehouse and Data Modelling
No ratings yet
Data Warehouse and Data Modelling
11 pages
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
No ratings yet
Unit Iv Data Normalization: Semantics of Attributes Should Be Easy To Interpret
14 pages
Advance Database Management System: Unit - 1 .The Extended Entity Relationship Model and Object Model
No ratings yet
Advance Database Management System: Unit - 1 .The Extended Entity Relationship Model and Object Model
31 pages
Dimension Modeling
No ratings yet
Dimension Modeling
37 pages
Database Management System: Introduction To DBMS Ms. Deepikkaa.S
No ratings yet
Database Management System: Introduction To DBMS Ms. Deepikkaa.S
45 pages
Big Data 2021 - 6,7,8 Big Data Technologies
No ratings yet
Big Data 2021 - 6,7,8 Big Data Technologies
55 pages
02 Database Design ER Model
No ratings yet
02 Database Design ER Model
22 pages
Dimensional Data Modeling - Lecture 1
No ratings yet
Dimensional Data Modeling - Lecture 1
21 pages
Democracia y Redentorismo - JLP
No ratings yet
Democracia y Redentorismo - JLP
131 pages
C THR12 66
No ratings yet
C THR12 66
1 page
FSD Template ENG
No ratings yet
FSD Template ENG
7 pages
41 Data Reconciliation
No ratings yet
41 Data Reconciliation
3 pages
Entity Relational Modeling Vs
No ratings yet
Entity Relational Modeling Vs
9 pages
NOSQL Data Management
No ratings yet
NOSQL Data Management
21 pages
Mhfdsbsvslnsafvjqjaoaodldanan
No ratings yet
Mhfdsbsvslnsafvjqjaoaodldanan
160 pages
Introduction To Database Systems
No ratings yet
Introduction To Database Systems
24 pages
Preface: Computer and Network Security: March 2020
No ratings yet
Preface: Computer and Network Security: March 2020
4 pages
Sheldon J. Plankton - Appearances - Encyclopedia SpongeBobia - Fandom
No ratings yet
Sheldon J. Plankton - Appearances - Encyclopedia SpongeBobia - Fandom
19 pages
Denormalization: 1 See Also
No ratings yet
Denormalization: 1 See Also
2 pages
A Review Paper On Cryptography: June 2019
No ratings yet
A Review Paper On Cryptography: June 2019
7 pages
Computer Engineering: Database Systems (DS) Assignment
No ratings yet
Computer Engineering: Database Systems (DS) Assignment
15 pages
Data Penting2
No ratings yet
Data Penting2
30 pages
My XML
No ratings yet
My XML
57 pages
Security Risks and Protection in Online Learning A
No ratings yet
Security Risks and Protection in Online Learning A
20 pages
Securing Network-on-Chip Using Incremental Cryptography
No ratings yet
Securing Network-on-Chip Using Incremental Cryptography
8 pages
Network Security With Cryptography: International Journal of Scientific Research January 2018
No ratings yet
Network Security With Cryptography: International Journal of Scientific Research January 2018
3 pages
Big Data Overview
No ratings yet
Big Data Overview
18 pages
Sample Format of Baseline Data Presentation of SBDPO
No ratings yet
Sample Format of Baseline Data Presentation of SBDPO
2 pages
Creating Natively Compiled Stored Procedures
No ratings yet
Creating Natively Compiled Stored Procedures
2 pages
Business Analytics Case Study: Likhamor L. Quezon Group - 1 (Hamilton)
No ratings yet
Business Analytics Case Study: Likhamor L. Quezon Group - 1 (Hamilton)
2 pages
L2. Introduction To Datawarehouse PDF
No ratings yet
L2. Introduction To Datawarehouse PDF
16 pages
IMQAV
No ratings yet
IMQAV
3 pages
Jurnal Perencanaan Arsitektur Enterprise Menggunakan Togaf Adm Versi 9 Studi Kasus (Bimbel Salemba Group)
No ratings yet
Jurnal Perencanaan Arsitektur Enterprise Menggunakan Togaf Adm Versi 9 Studi Kasus (Bimbel Salemba Group)
11 pages
Summer Holiday Homework Subject-Ip GRADE-11 Q1. Fill in The Blanks
No ratings yet
Summer Holiday Homework Subject-Ip GRADE-11 Q1. Fill in The Blanks
2 pages
Scrum Master Resume Sample - Windsor Original
No ratings yet
Scrum Master Resume Sample - Windsor Original
1 page

COMP 430 Intro. To Database Systems: Denormalization & Dimensional Modeling

Uploaded by

COMP 430 Intro. To Database Systems: Denormalization & Dimensional Modeling

Uploaded by

COMP 430

Intro. to Database Systems

Typically used when some or all of the following apply:

Only a brief overview.

Emphasizes decision making & use of historical data

Course(crn, …) Student(sid, …, collegeid, …, home_stateid)

Course(crn, …) Student(sid, …, college, …, home_state)

Dimensions: The child tables in 1-to-many relationship with facts

• Possibly start with an ER model

Facts are the data in Each dimension table

Facts might be pre-

Not all data fit nicely into facts + 1-to-many dimensions.

Leads to exceptions from this simple presentation.

You might also like