Data Warehouse Design

The basic concepts of dimensional modeling are facts, dimensions, and measures. Facts represent business transactions and contain measures and context data. Dimensions describe one business dimension and determine the context for facts. Measures are numeric attributes of facts representing business performance relative to dimensions.

Uploaded by

Eri Zuliarso

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

333 views29 pages

Data Warehouse Design

Uploaded by

Eri Zuliarso

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

The basic concepts of dimensional modeling

are: facts, dimensions and measures.

A fact is a collection of related data items,
consisting of measures and context data.
represents business items or business transactions.
A dimension is a collection of data that
describe one business dimension.
determine the contextual background for the facts;
A measure is a numeric attribute of a fact,
representing the performance or behavior of
the business relative to the dimensions
1. Fact Tables
the large tables in the warehouse schema that store
business measurements.
typically contain facts and foreign keys to the dimension
tables.
Represents data, usually numeric and additive, that can be
analyzed and examined.
Examples include sales, cost, and profit.
2. Dimension Tables
also known as lookup or reference tables,
contain the relatively static data in the warehouse.
store the information normally use to contain queries.
usually textual and descriptive and use them as the row
headers of the result set.
Examples are customers, Location, Time, Suppliers or
products.
A fact table has two types of columns:
1. measurements : those that contain numeric facts
2. those that are foreign keys to dimension tables.
A fact table contains :
detail-level facts
facts that have been aggregated.
Fact tables that contain aggregated facts are
called SUMMARY TABLES.
A fact table usually contains facts with the
same level of aggregation.
A fact table is the primary table in a dimensional
model where the numerical performance
measurements of the business are stored.
We can imagine standing in the marketplace watching
products being sold and writing down the quantity sold and
dollar sales amount each day for each product in each store.
A measurement is taken at the intersection of all the
dimensions (day, product, and store).
This list of dimensions defines the grain of the fact table and
tells us what the scope of the measurement is.
A row in a fact table corresponds to a
measurement.
A measurement is a row in a fact table.
All the measurements in a fact table must be at
the same grain.
The most useful facts are numeric and additive
fact tables have two or more foreign keys, as designated
by the FK notation
When all the keys in the fact table match their respective
primary keys correctly in the corresponding dimension
tables, we say that the tables satisfy referential integrity.
The fact table itself generally has its own
primary key made up of a subset of the foreign
keys.
This key is often called a composite or
concatenated key.
Every fact table in a dimensional model has a
composite key, and conversely, every table that
has a composite key is a fact table.
Fact tables express the many-to-many
relationships between dimensions in
dimensional models.
A dimension is a structure, often composed of
one or more hierarchies, that categorizes data.
Dimensional attributes help to describe the
dimensional value.
normally descriptive, textual values.
Dimension data is typically collected at the
lowest level of detail and then aggregated into
higher-level totals that are more useful for
analysis.
These natural rollups or aggregations within a
dimension table are called hierarchies
dimension tables have many columns or
attributes.
These attributes describe the rows in the
dimension table.
Each dimension is defined by its single primary
key, designated by the PK notation.
Dimension attributes serve as the primary
source of query constraints, groupings, and
report labels.
In a query or report request, attributes are
identified as the by words.
For example, when a user state that he or she
wants to see dollar sales by week by brand,
week and brand must be available as dimension
attributes.
A dimension table may be used in multiple
places if the data warehouse contains multiple
fact tables or contributes data to data marts.
For example:
a product dimension may be used with a sales fact
table and an inventory fact table in the data
warehouse, and also in one or more departmental
data marts.
A dimension such as customer, time, or product
that is used in multiple schemas is called a
conforming dimension
The records in a dimension table establish one-
to-many relationships with the fact table.
Examples : a number of sales to a single customer,
or a number of sales of a single product.
A schema is a collection of database objects,
including tables, views, indexes, and synonyms.
Most data warehouses use a dimensional model
schema
The principal characteristic of a dimensional
model is a set of detailed business facts
surrounded by multiple dimensions that
describe those facts.
When realized in a database, the schema for a
dimensional model contains a central fact table
and multiple dimension tables.
A schema is called a star schema if all dimension
tables can be joined directly to the fact table.
In the star schema design, a single object (the
fact table) sits in the middle and is radically
connected to other surrounding objects
(dimension lookup tables) like a star.
A star schema can be simple or complex.
A simple star consists of one fact table; a
complex star can have more than one fact table.
A schema is called a snowflake schema if one or
more dimension tables do not join directly to
the fact table but must join through other
dimension tables.
For example, a dimension that describes
products may be separated into three tables
(snowflaked).
In a star schema every dimension will have a
primary key.
In a star schema, a dimension table will not
have any parent table.
In a snowflake schema, a dimension table will
have one or more parent tables.
In star schema Hierarchies for the dimensions
are stored in the dimensional table itself.
Hierarchies are broken into separate tables in
snowflake schema.

EB2406 - Teradata PDF
No ratings yet
EB2406 - Teradata PDF
18 pages
Object Oriented Methods A Foundation PDF
No ratings yet
Object Oriented Methods A Foundation PDF
2 pages
A Dimension Table Consists of The Attributes About The Facts
No ratings yet
A Dimension Table Consists of The Attributes About The Facts
3 pages
Dimensional Modeling PDF
No ratings yet
Dimensional Modeling PDF
14 pages
Insurance DataWare House Design Vechiles
No ratings yet
Insurance DataWare House Design Vechiles
2 pages
Enabling Scalable OLAP Directly On A Data Lakehouse Architecture
No ratings yet
Enabling Scalable OLAP Directly On A Data Lakehouse Architecture
39 pages
Dimensional Model Data Warehouse Overview
No ratings yet
Dimensional Model Data Warehouse Overview
2 pages
The Benefits of Delta Lake and Lakehouse Architecture
No ratings yet
The Benefits of Delta Lake and Lakehouse Architecture
3 pages
CH 2 Introduction To Data Warehousing
No ratings yet
CH 2 Introduction To Data Warehousing
31 pages
Modern Data Architecture: Bywhinmon
No ratings yet
Modern Data Architecture: Bywhinmon
10 pages
Govindarajan Data Vault PDF
100% (1)
Govindarajan Data Vault PDF
29 pages
What Are The Dimensions in Data Warehouse
100% (1)
What Are The Dimensions in Data Warehouse
6 pages
A Dimensional Modeling Manifesto
No ratings yet
A Dimensional Modeling Manifesto
8 pages
MDX Introduction and Overview
No ratings yet
MDX Introduction and Overview
3 pages
Need of Two Types of Data: Information
No ratings yet
Need of Two Types of Data: Information
7 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
34 pages
Data Warehousing FAQ
No ratings yet
Data Warehousing FAQ
5 pages
L3 - Data Models
No ratings yet
L3 - Data Models
13 pages
The Definitive Guide To The SQL Data Lakehouse Eckerson Report
No ratings yet
The Definitive Guide To The SQL Data Lakehouse Eckerson Report
19 pages
Data Vault and HQDM Principles PDF
No ratings yet
Data Vault and HQDM Principles PDF
8 pages
ERModel PDF
100% (1)
ERModel PDF
82 pages
Data Prep Ebook Snowflake 1
No ratings yet
Data Prep Ebook Snowflake 1
8 pages
Data Modelling Training 21st Century +917386622889
No ratings yet
Data Modelling Training 21st Century +917386622889
8 pages
DWH
No ratings yet
DWH
48 pages
Data Mining Unit - 1 Notes
No ratings yet
Data Mining Unit - 1 Notes
16 pages
Data Modeling Interviews
No ratings yet
Data Modeling Interviews
16 pages
What Is DW2.0
No ratings yet
What Is DW2.0
13 pages
Apache Iceberg - Additional Real World Use Cases
No ratings yet
Apache Iceberg - Additional Real World Use Cases
25 pages
Mesh Modelling
No ratings yet
Mesh Modelling
4 pages
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
No ratings yet
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
28 pages
Denodo8 - Metadata Management Overview
No ratings yet
Denodo8 - Metadata Management Overview
28 pages
Federated vs. Centeralized vs. De-Centeralized Data Warehouse
No ratings yet
Federated vs. Centeralized vs. De-Centeralized Data Warehouse
5 pages
Logical Modeling SDLC
0% (1)
Logical Modeling SDLC
6 pages
Data Mining Concept Description: Characterization and Comparison
No ratings yet
Data Mining Concept Description: Characterization and Comparison
14 pages
What Is The Level of Granularity of A Fact Table
No ratings yet
What Is The Level of Granularity of A Fact Table
15 pages
Dimensional Data Modeling - Lecture 1
No ratings yet
Dimensional Data Modeling - Lecture 1
21 pages
CS54-Data Modeling Using The Entity-Relationship Data B
No ratings yet
CS54-Data Modeling Using The Entity-Relationship Data B
35 pages
CSE 530 - Database Management Systems: Data Warehousing Presentation by Ali Gardezi Prashanth Janardanan Aaron Sheffield
No ratings yet
CSE 530 - Database Management Systems: Data Warehousing Presentation by Ali Gardezi Prashanth Janardanan Aaron Sheffield
69 pages
A Framework For ETL Systems Development
No ratings yet
A Framework For ETL Systems Development
16 pages
Data Warehouse
No ratings yet
Data Warehouse
74 pages
How To Sell A Data Warehouse To Upper Management Checklist
No ratings yet
How To Sell A Data Warehouse To Upper Management Checklist
6 pages
Dimensional Models Intro
No ratings yet
Dimensional Models Intro
18 pages
An Investigation of NoSQL Database Performance From A MYSQL Perspective
No ratings yet
An Investigation of NoSQL Database Performance From A MYSQL Perspective
3 pages
Unit 1
No ratings yet
Unit 1
14 pages
Gartner Reprint
No ratings yet
Gartner Reprint
42 pages
Dimensional Modeling and Schemas: Data Modeling Research Paper
No ratings yet
Dimensional Modeling and Schemas: Data Modeling Research Paper
11 pages
ERwin API
No ratings yet
ERwin API
72 pages
OBIEE Semantic Layer
No ratings yet
OBIEE Semantic Layer
3 pages
Data Mining-Data Warehouse
No ratings yet
Data Mining-Data Warehouse
7 pages
Data Warehousing Logical Design
100% (1)
Data Warehousing Logical Design
23 pages
Association Rules (Ardytha Luthfiarta)
No ratings yet
Association Rules (Ardytha Luthfiarta)
69 pages
Data Warehouse Design For E-Commerce Environment
No ratings yet
Data Warehouse Design For E-Commerce Environment
26 pages
Batch B DWM Experiments
No ratings yet
Batch B DWM Experiments
90 pages
Why Is The Snowflake Schema A Good Data Warehouse Design
No ratings yet
Why Is The Snowflake Schema A Good Data Warehouse Design
19 pages
Database Questions and Answers
No ratings yet
Database Questions and Answers
4 pages
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
From Everand
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
Pierre-yves Bonnefoy
No ratings yet
Enterprise architecture planning Complete Self-Assessment Guide
From Everand
Enterprise architecture planning Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Enterprise Architecture EA Standard Requirements
From Everand
Enterprise Architecture EA Standard Requirements
Gerardus Blokdyk
No ratings yet
Mastering Data Engineering and Analytics with Databricks: A Hands-on Guide to Build Scalable Pipelines Using Databricks, Delta Lake, and MLflow (English Edition)
From Everand
Mastering Data Engineering and Analytics with Databricks: A Hands-on Guide to Build Scalable Pipelines Using Databricks, Delta Lake, and MLflow (English Edition)
Manoj Kumar
No ratings yet
HDInsight Essentials - Second Edition
From Everand
HDInsight Essentials - Second Edition
Rajesh Nadipalli
No ratings yet
Pattern Based Indonesian Question Answering System
No ratings yet
Pattern Based Indonesian Question Answering System
6 pages
1.1 Networks-Everywhere PDF
No ratings yet
1.1 Networks-Everywhere PDF
10 pages
Visualizing The Intellectual Structure of Information Science (2006-2015) : Introducing Author Keyword Coupling Analysis
No ratings yet
Visualizing The Intellectual Structure of Information Science (2006-2015) : Introducing Author Keyword Coupling Analysis
20 pages
Study of Stemming Algorithms PDF
No ratings yet
Study of Stemming Algorithms PDF
49 pages
Becchetti 2008 Link Spam Techniques
No ratings yet
Becchetti 2008 Link Spam Techniques
15 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
20 pages
Graph Visualization
No ratings yet
Graph Visualization
9 pages
Ontology-Based Question Answering in A Federation of University Sites: The MOSES Case Study
No ratings yet
Ontology-Based Question Answering in A Federation of University Sites: The MOSES Case Study
11 pages
Detailed University Schema: Appendix
No ratings yet
Detailed University Schema: Appendix
2 pages
Chapter 05
100% (1)
Chapter 05
11 pages
Color 1 Color 2 Color 3 Color 4 Color 5
No ratings yet
Color 1 Color 2 Color 3 Color 4 Color 5
4 pages
Sysmod Sysml 1.3 Reference Card Weilkiens PDF
No ratings yet
Sysmod Sysml 1.3 Reference Card Weilkiens PDF
4 pages
Dbms Hindi
89% (9)
Dbms Hindi
33 pages
Question 1 of 20
100% (2)
Question 1 of 20
54 pages
Relational Algebra: Prepared By: Raquel Ofreneo, MIT
No ratings yet
Relational Algebra: Prepared By: Raquel Ofreneo, MIT
19 pages
Principles of Information: Systems, Tenth Edition
No ratings yet
Principles of Information: Systems, Tenth Edition
61 pages
CS/SS G514 Object Oriented Analysis and Design: 9-Aug-17 Ooad 2
No ratings yet
CS/SS G514 Object Oriented Analysis and Design: 9-Aug-17 Ooad 2
16 pages
Programming Paradigms: (Lectures On High-Performance Computing For Economists VII)
No ratings yet
Programming Paradigms: (Lectures On High-Performance Computing For Economists VII)
21 pages
OOSE Week 07-UML Sequence Diagram
No ratings yet
OOSE Week 07-UML Sequence Diagram
20 pages
CS 103 Computer Programming: Week 02
No ratings yet
CS 103 Computer Programming: Week 02
15 pages
The Renormalization Group - Lecture Notes (Condensed) : Jan Tuzlić Offermann
No ratings yet
The Renormalization Group - Lecture Notes (Condensed) : Jan Tuzlić Offermann
6 pages
ER Model: Example, Suppose We Design A School Database. in This Database, The Student
No ratings yet
ER Model: Example, Suppose We Design A School Database. in This Database, The Student
10 pages
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
No ratings yet
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
79 pages
Quality Management in Healthcare
No ratings yet
Quality Management in Healthcare
7 pages
Modelling Concepts
No ratings yet
Modelling Concepts
15 pages
3dsmax FBX 2012 Map
No ratings yet
3dsmax FBX 2012 Map
5 pages
Best Subset Methods
No ratings yet
Best Subset Methods
3 pages
CSE II-II SEM (DBMS Lab Manual) PDF
100% (1)
CSE II-II SEM (DBMS Lab Manual) PDF
124 pages
1.input Design
No ratings yet
1.input Design
30 pages
Physical Modeling
No ratings yet
Physical Modeling
18 pages
Oracle 1Z0 071
No ratings yet
Oracle 1Z0 071
2 pages
Geometric Modelling Projection Systems: Engineering Communications GL2
No ratings yet
Geometric Modelling Projection Systems: Engineering Communications GL2
20 pages
Otsu Thresholding Explained: Hide Menu
No ratings yet
Otsu Thresholding Explained: Hide Menu
4 pages
4th and 5th Norm Forms
No ratings yet
4th and 5th Norm Forms
4 pages
55-7963 Control of Linear Systems Assignment - 1415 PDF
No ratings yet
55-7963 Control of Linear Systems Assignment - 1415 PDF
4 pages
CS1402 Ooad
No ratings yet
CS1402 Ooad
9 pages
Lucrarea de Laborator Nr. 3: Raport La
No ratings yet
Lucrarea de Laborator Nr. 3: Raport La
12 pages

Data Warehouse Design

Uploaded by

Data Warehouse Design

Uploaded by

The basic concepts of dimensional modeling

are: facts, dimensions and measures.

You might also like