Bi Unit 2

This document discusses key concepts in dimensional data modeling including fact tables, dimensions, grain, additive and non-additive facts, slowly changing dimensions, clickstream data, multivalued dimensions, and dimension attributes. Fact tables contain numeric measurements while dimension tables contain descriptive attributes. Together they define the structure for analyzing business processes and events.

Uploaded by

Jagdish Kapadnis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views14 pages

Bi Unit 2

Uploaded by

Jagdish Kapadnis

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Star Schema

Snow flake schema

Fact constellation schema
What is a Fact Table?

• Fact tables contain the data corresponding to

a particular business process.
• Each row represents a single event associated
with that process and contains the
measurement data associated with that event.
• The information contained within a fact table
is typically numeric data.
What are Dimensions?

• Dimensions describe the objects involved in a

business intelligence effort. While facts
correspond to events, dimensions correspond
to people, items, or other objects.
• Dimension tables contain details about each
instance of an object.
Fact table Vs dimension table
• Fact tables and dimension tables are related
to each other.
• If we take example of retail model, the fact
table for a customer transaction would likely
contain a foreign key reference to the item
dimension table, where the entry corresponds
to a primary key in that table for a record
describing the item purchased.
Dimensional Modeling - Grain

• Grain: The grain describe the level of detail.

• It can be also seen as:
• the unique key of a table of a SQL Statement of the unique
identifier of a hierarchy level in a dimension.

At the top level, there are two main options in choosing the level
of granularity:
• Unsummarized/Atomic (transaction level granularity): this is
the highest level of granularity where each fact table row
corresponds to a single transaction or line item
• Summarized: transactions may be summarized by a subset of
dimensions or dimensional attributes. In this case, each row in
the fact table corresponds to multiple transactions
Dimensional Modeling - Grain
• The most granular or atomic data (atomic as an indivisible unit of work)
has the most dimensionality. Atomic data is highly dimensional.
Preferably, you should develop dimensional models for the most atomic
information captured by a Event. Atomic data is the most detailled
information collected: such data cannot be subdivided further.
Example: If a high grain is the month whereas a low or detail grain can
be the day
• A data warehouse almost always demands data to the lowest possible
grain of each dimension not because queries want to see individual low
level rows but because queries need to cut through the details in very
precise ways.
• The lower the level of granularity (or conversely, the higher the level of
summarization), the less storage space required and the faster queries
will be executed.
Additive, Semi-Additive, and Non-Additive Facts

• The numeric measures in a fact table fall into three categories.

The most flexible and useful facts are fully additive; additive
measures can be summed across any of the dimensions
associated with the fact table. Semi-additive measures can be
summed across some dimensions, but not all; balance amounts
are common semi-additive facts because they are additive across
all dimensions except time. Finally, some measures are
completely non-additive, such as ratios. A good approach for
non-additive facts is, where possible, to store the fully additive
components of the non-additive measure and sum these
components into the final answer set before calculating the final
non-additive fact. This final calculation is often done in the BI
layer or OLAP cube.
Slowly Changing Dimensions(SCD)
• dimensions have been assumed to be independent
of time. Unfortunately, this is not the case in the
real world.
• dimension table attributes are relatively static, they
are not fixed forever.
• Dimension attributes change, though rather slowly,
over time. Dimensional designers must engage
business representatives proactively to help
determine the appropriate change-handling
strategy.
Clickstream Data
• Clickstream data is an information trail a user
leaves behind while visiting a website.
• It is typically captured in semi-structured
website log files.
• These website log files contain data elements
such as a date and time stamp, the visitor's IP
address, the destination URLs of the pages
visited, and a user ID that uniquely identifies
the website visitor.
Clickstream Data logs
Multivalued Dimensions
• An open-ended many-valued attribute can be associated with a
dimension row by using a bridge table to associate the many-
valued attributes with the dimension.
Example:
In some financial services companies, the individual customer is
identified and associated with each transaction. For example,
credit card companies often issue unique card numbers to each
cardholder. John and Mary Smith may have a joint credit card
account, but the numbers on their respective pieces of plastic are
unique. In this case there is no need for an account-to-customer
bridge table because the atomic transaction facts are at the
discrete customer grain. Account and customer would both be
foreign keys in this fact table.
Dimension attributes.
• Dimension tables are integral companions to a fact table. The dimension
tables contain the textual descriptors of the business.
• In a well-designed dimensional model, dimension tables have many
columns or attributes. These attributes describe the rows in the dimension
table.
• Dimension attributes serve as the primary source of query constraints,
groupings, and report labels. In a query or report request, attributes are
identified as the by words.
• For example, when a user states that he or she wants to see dollar sales by
week by brand, week and brand must be available as dimension attributes.
• Dimension table attributes play a vital role in the data warehouse. Since
they are the source of virtually all interesting constraints and report labels,
they are key to making the data warehouse usable and understandable.
• The data warehouse is only as good as the dimension attributes. The power
of the data warehouse is directly proportional to the quality and depth of
the dimension attributes.

Chapter 2 Kimball Dimensional Modelling Techniques Overview
No ratings yet
Chapter 2 Kimball Dimensional Modelling Techniques Overview
14 pages
Data Warehousing Schemas
No ratings yet
Data Warehousing Schemas
18 pages
What Are The Dimensions in Data Warehouse
100% (1)
What Are The Dimensions in Data Warehouse
6 pages
Data Warehouse Design
No ratings yet
Data Warehouse Design
29 pages
First Part 27 Pages
No ratings yet
First Part 27 Pages
27 pages
COMP8047 - S05 Dimensional Modelling 2
No ratings yet
COMP8047 - S05 Dimensional Modelling 2
34 pages
Chapter Eight
No ratings yet
Chapter Eight
33 pages
DWT Chapter 2 Part 1
No ratings yet
DWT Chapter 2 Part 1
18 pages
CSIS 3300 W3 Denormalization StarSchema
No ratings yet
CSIS 3300 W3 Denormalization StarSchema
27 pages
BW4 New
100% (1)
BW4 New
28 pages
Ais Prof 1 Chapter 5
No ratings yet
Ais Prof 1 Chapter 5
39 pages
Lecture 3 & 4 - 5610
No ratings yet
Lecture 3 & 4 - 5610
19 pages
Design Handbooks Table of Contents
No ratings yet
Design Handbooks Table of Contents
1 page
2 Logical Applications m2 Slides
No ratings yet
2 Logical Applications m2 Slides
11 pages
Lecture 4
No ratings yet
Lecture 4
24 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
59 pages
Unit II DWDM
No ratings yet
Unit II DWDM
97 pages
DWM Unit-Ii Notes
No ratings yet
DWM Unit-Ii Notes
27 pages
CH 3
No ratings yet
CH 3
60 pages
DW Lec7
No ratings yet
DW Lec7
15 pages
Unit 4
No ratings yet
Unit 4
41 pages
Week 3
No ratings yet
Week 3
39 pages
DataWarehouse Interview Question
No ratings yet
DataWarehouse Interview Question
7 pages
Different Types of Dimensions and Facts in Data
No ratings yet
Different Types of Dimensions and Facts in Data
5 pages
Unit 2
No ratings yet
Unit 2
33 pages
Dimensional Modeling: E-BIZ Practice Tata Consultancy Services, India
No ratings yet
Dimensional Modeling: E-BIZ Practice Tata Consultancy Services, India
35 pages
Dimensional Modelling: CS2.1.1 CS2.1.2
No ratings yet
Dimensional Modelling: CS2.1.1 CS2.1.2
22 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
11 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
14 pages
Dimensional Modeling: Prof. Sunita Sahu
No ratings yet
Dimensional Modeling: Prof. Sunita Sahu
50 pages
Dimensional Modelling
No ratings yet
Dimensional Modelling
36 pages
Facts & Dims
No ratings yet
Facts & Dims
14 pages
Week 5
No ratings yet
Week 5
19 pages
Bi Lecture4 - 2023
No ratings yet
Bi Lecture4 - 2023
49 pages
A Dimension Table Consists of The Attributes About The Facts
No ratings yet
A Dimension Table Consists of The Attributes About The Facts
3 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
59 pages
Lecture 1 Notes: Dimension Tables
No ratings yet
Lecture 1 Notes: Dimension Tables
2 pages
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
No ratings yet
What Is Fact?: A Fact Is A Collection of Related Data Items, Each Fact Typically Represents A Business Item, A
28 pages
Basics of Dimensional Modeling
100% (1)
Basics of Dimensional Modeling
14 pages
What Is The Difference Between OLTP and OLAP?
No ratings yet
What Is The Difference Between OLTP and OLAP?
33 pages
Citer
No ratings yet
Citer
4 pages
DATAWAREHOUSE PPT NEWW
No ratings yet
DATAWAREHOUSE PPT NEWW
27 pages
Oh 3
No ratings yet
Oh 3
30 pages
ETL Testing
No ratings yet
ETL Testing
3 pages
Dimensional Modeling
100% (1)
Dimensional Modeling
12 pages
C 01 Dimensional Modeling
No ratings yet
C 01 Dimensional Modeling
30 pages
Chapter 7 Data Marts and Star Schema Design
No ratings yet
Chapter 7 Data Marts and Star Schema Design
7 pages
Dimensional Modelling
No ratings yet
Dimensional Modelling
26 pages
Advanced Dimensional Modeling
No ratings yet
Advanced Dimensional Modeling
19 pages
Entity-Relationship Model: Data Warehouse Data Models
No ratings yet
Entity-Relationship Model: Data Warehouse Data Models
4 pages
Data Stage
No ratings yet
Data Stage
10 pages
Postgrre
No ratings yet
Postgrre
14 pages
What Is Dimensional Model
No ratings yet
What Is Dimensional Model
7 pages
Dimensional Modeling PDF
No ratings yet
Dimensional Modeling PDF
14 pages
Types of Dimensions - Data Warehouse
No ratings yet
Types of Dimensions - Data Warehouse
8 pages
Disadvantages of File Processing System
No ratings yet
Disadvantages of File Processing System
17 pages
Data Warehouse Schema
No ratings yet
Data Warehouse Schema
10 pages
Learn Excel, Finance and More With These Free Resources
No ratings yet
Learn Excel, Finance and More With These Free Resources
18 pages
Oracle Apps Tables
100% (1)
Oracle Apps Tables
4 pages
Data Mning
No ratings yet
Data Mning
10 pages
Dimensions DW
No ratings yet
Dimensions DW
6 pages
Top 50 Oracle Interview Questions and Answers
No ratings yet
Top 50 Oracle Interview Questions and Answers
51 pages
Dimensional Modeling (DM)
No ratings yet
Dimensional Modeling (DM)
9 pages
Fact Tables
No ratings yet
Fact Tables
3 pages
Fact and Dimension Tables
No ratings yet
Fact and Dimension Tables
11 pages
Unit-III-SQL RDBMS: A Lalitha Associate Professor Avinash Degree College
No ratings yet
Unit-III-SQL RDBMS: A Lalitha Associate Professor Avinash Degree College
32 pages
Module-1: Data Warehousing & Modelling
No ratings yet
Module-1: Data Warehousing & Modelling
13 pages
DWH Int Questions
100% (1)
DWH Int Questions
9 pages
Guide To Google Cloud Databases
100% (1)
Guide To Google Cloud Databases
15 pages
Advanced SQL
No ratings yet
Advanced SQL
31 pages
Linux MCQ Fot Interview
No ratings yet
Linux MCQ Fot Interview
2 pages
Data Warehousing and Mining Complete Notes
No ratings yet
Data Warehousing and Mining Complete Notes
495 pages
CSE544: SQL: Monday 3/27 and Wednesday 3/29, 2006
No ratings yet
CSE544: SQL: Monday 3/27 and Wednesday 3/29, 2006
78 pages
CH 7
0% (2)
CH 7
11 pages
WinCC Unified V19 - Connecting Unified Comfort Panel With SQL Database
No ratings yet
WinCC Unified V19 - Connecting Unified Comfort Panel With SQL Database
3 pages
Cara Membuka STATA
No ratings yet
Cara Membuka STATA
2 pages
Homogeneous Databases
No ratings yet
Homogeneous Databases
1 page
Modern Systems Analysis and Design: Ninth Edition
No ratings yet
Modern Systems Analysis and Design: Ninth Edition
62 pages
Stud MGMT System
No ratings yet
Stud MGMT System
16 pages
TCCS3023
No ratings yet
TCCS3023
20 pages
Vdocument - in - Appsync With Dell Emc Unity Table of Contents Executive Summary 3 Audience
No ratings yet
Vdocument - in - Appsync With Dell Emc Unity Table of Contents Executive Summary 3 Audience
22 pages
GWP-Technical SEO-251022-111228
No ratings yet
GWP-Technical SEO-251022-111228
6 pages
C355 1 Team File Submission Team 3 001
No ratings yet
C355 1 Team File Submission Team 3 001
5 pages
Tableau Important Question
No ratings yet
Tableau Important Question
2 pages
OpenSAP Bwhana1 Week 1 Transcript
No ratings yet
OpenSAP Bwhana1 Week 1 Transcript
35 pages
CIS Reviewer
No ratings yet
CIS Reviewer
3 pages
The Sample Database: Appendix
No ratings yet
The Sample Database: Appendix
6 pages
The Athena Service Management System: Mark A. Rosenstein Daniel E. Geer, Jr. Peter J. Levine
No ratings yet
The Athena Service Management System: Mark A. Rosenstein Daniel E. Geer, Jr. Peter J. Levine
11 pages
Bankproject Py
No ratings yet
Bankproject Py
2 pages
Microsoft Excel Statistical and Advanced Functions for Decision Making
From Everand
Microsoft Excel Statistical and Advanced Functions for Decision Making
Palani Murugappan
5/5 (2)

Bi Unit 2

Uploaded by

Bi Unit 2

Uploaded by

Star Schema

Snow flake schema

• Fact tables contain the data corresponding to

• Dimensions describe the objects involved in a

• Grain: The grain describe the level of detail.

• The numeric measures in a fact table fall into three categories.

You might also like