0% found this document useful (0 votes)
47 views

Star and Snowflake Schema in Data Warehouse With Model Examples

The document discusses different types of multidimensional schemas used in data warehousing. It describes the star schema as the simplest type containing one fact table and dimension tables. The snowflake schema extends the star schema by normalizing dimensions into multiple tables. A galaxy schema contains multiple fact tables that share dimension tables, while a star cluster schema balances the complexity of snowflake and simplicity of star schemas.

Uploaded by

Ranztyn Tortor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views

Star and Snowflake Schema in Data Warehouse With Model Examples

The document discusses different types of multidimensional schemas used in data warehousing. It describes the star schema as the simplest type containing one fact table and dimension tables. The snowflake schema extends the star schema by normalizing dimensions into multiple tables. A galaxy schema contains multiple fact tables that share dimension tables, while a star cluster schema balances the complexity of snowflake and simplicity of star schemas.

Uploaded by

Ranztyn Tortor
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Star and Snowflake Schema in Data Warehouse with Model Examples

What is Multidimensional schema?


Multidimensional Schema is especially designed to model data warehouse systems. The schemas are
designed to address the unique needs of very large databases designed for the analytical purpose (OLAP).

Types of Data Warehouse Schema:


Following are 3 chief types of multidimensional schemas each having its unique advantages.
 Star Schema
 Snowflake Schema
 Galaxy Schema

Topic
 What is a Star Schema?
 What is a Snowflake Schema?
 Star Schema Vs Snowflake Schema: Key Differences
 What is a Galaxy schema?
 What is Star Cluster Schema?

What is a Star Schema?


Star Schema in data warehouse, in which the center of the star can have one fact table and a number of
associated dimension tables. It is known as star schema as its structure resembles a star. The Star Schema data
model is the simplest type of Data Warehouse schema. It is also known as Star Join Schema and is optimized for
querying large data sets.
In the following Star Schema example, the fact table is at the center which contains keys to every dimension
table like Dealer_ID, Model ID, Date_ID, Product_ID, Branch_ID & other attributes like Units sold and revenue.

Example of Star Schema Diagram


Characteristics of Star Schema:
 Every dimension in a star schema is represented with the only one-dimension table.
 The dimension table should contain the set of attributes.
 The dimension table is joined to the fact table using a foreign key
 The dimension table are not joined to each other
 Fact table would contain key and measure
 The Star schema is easy to understand and provides optimal disk usage.
 The dimension tables are not normalized. For instance, in the above figure, Country_ID does not have
Country lookup table as an OLTP design would have.
 The schema is widely supported by BI Tools

What is a Snowflake Schema?


Snowflake Schema in data warehouse is a logical arrangement of tables in a multidimensional database
such that the ER diagram resembles a snowflake shape. A Snowflake Schema is an extension of a Star Schema,
and it adds additional dimensions. The dimension tables are normalized which splits data into additional tables.
In the following Snowflake Schema example, Country is further normalized into an individual table.

Example of Snowflake Schema

Characteristics of Snowflake Schema:


 The main benefit of the snowflake schema it uses smaller disk space.
 Easier to implement a dimension is added to the Schema
 Due to multiple tables query performance is reduced
 The primary challenge that you will face while using the snowflake Schema is that you need to perform more
maintenance efforts because of the more lookup tables.
Star Schema Vs Snowflake Schema: Key Differences
Following is a key difference between Snowflake schema vs Star schema:
Star Schema Snowflake Schema
Hierarchies for the dimensions are stored in the Hierarchies are divided into separate tables.
dimensional table.
It contains a fact table surrounded by dimension tables. One fact table surrounded by dimension table which are
in turn surrounded by dimension table
In a star schema, only single join creates the A snowflake schema requires many joins to fetch the
relationship between the fact table and any dimension data.
tables.
Simple DB Design. Very Complex DB Design.
Denormalized Data structure and query also run faster. Normalized Data Structure.
High level of Data redundancy Very low-level data redundancy
Single Dimension table contains aggregated data. Data Split into different Dimension Tables.
Cube processing is faster. Cube processing might be slow because of the complex
join.
Offers higher performing queries using Star Join Query The Snowflake schema is represented by centralized
Optimization. fact table which unlikely connected with multiple
Tables may be connected with multiple dimensions. dimensions.

What is a Galaxy Schema?


A Galaxy Schema contains two fact table that share dimension tables between them. It is also called Fact
Constellation Schema. The schema is viewed as a collection of stars hence the name Galaxy Schema.

Example of Galaxy Schema


As you can see in above example, there are two facts table
1. Revenue
2. Product.
In Galaxy schema shares dimensions are called Conformed Dimensions.
Characteristics of Galaxy Schema:
 The dimensions in this schema are separated into separate dimensions based on the various levels of
hierarchy.
 For example, if geography has four levels of hierarchy like region, country, state, and city then Galaxy
schema should have four dimensions.
 Moreover, it is possible to build this type of schema by splitting the one-star schema into more Star
schemes.
 The dimensions are large in this schema which is needed to build based on the levels of hierarchy.
 This schema is helpful for aggregating fact tables for better understanding.

What is Star Cluster Schema?


Snowflake schema contains fully expanded hierarchies. However, this can add complexity to the Schema
and requires extra joins. On the other hand, star schema contains fully collapsed hierarchies, which may lead to
redundancy. So, the best solution may be a balance between these two schemas which is Star Cluster Schema
design.

Example of Star Cluster Schema

Overlapping dimensions can be found as forks in hierarchies. A fork happens when an entity acts as a parent in two
different dimensional hierarchies. Fork entities then identified as classification with one-to-many relationships.

Summary:
 Multidimensional schema is especially designed to model data warehouse systems
 The star schema is the simplest type of Data Warehouse schema. It is known as star schema as its
structure resembles a star.
 Comparing Snowflake vs Star schema, a Snowflake Schema is an extension of a Star Schema, and it adds
additional dimensions. It is called snowflake because its diagram resembles a Snowflake.
 In a star schema, only single join defines the relationship between the fact table and any dimension tables.
 Star schema contains a fact table surrounded by dimension tables.
 Snowflake schema is surrounded by dimension table which are in turn surrounded by dimension table
 A snowflake schema requires many joins to fetch the data.
 Comparing Star vs Snowflake schema, Start schema has simple DB design, while Snowflake schema has
very complex DB design.
 A Galaxy Schema contains two fact table that shares dimension tables. It is also called Fact Constellation
Schema.
 Star cluster schema contains attributes of Star and Snowflake Schema.

You might also like