Unit 2
Unit 2
4. Identify Facts
5. Build Star
The model should describe the Why, How much, When/Where/Who and What of your
business process
attribute that helps describe a dimensional value. Fact tables are usually larger than dimension
tables.
Characteristics of Star Schema
Every dimension in a star schema is represented with the only one-dimension table.
The dimension table should contain the set of attributes.
The dimension table is joined to the fact table using a foreign key
The dimension table are not joined to each other
Fact table would contain key and measure
The Star schema is easy to understand and provides optimal disk usage.
The dimension tables are not normalized. For instance, in the above figure, Country_ID
does not have Country lookup table as an OLTP design would have.
The schema is widely supported by BI Tools
Advantages of Star Schema
Star schemas have a more straightforward join logic compared to other schemas for
fetching data from highly normalized transactional schemas.
As opposed to highly normalized transactional schemas, the star schema simplifies
common business reporting logic, such as reporting and period-over-period.
Star schemas are widely used by OLAP systems to design cubes efficiently. A star schema
can be used as a source without designing a cube structure in most major OLAP systems.
By enabling specific performance schemes that can be applied to queries, the query
processor software in Star Schema can offer better execution plans.
Disadvantage of Star Schema
Since the schema is highly de-normalized, data integrity is not enforced well.
Not flexible in terms of analytical needs.
Star schemas do not reinforce many-to-many relationships within business entities.
What is a Snowflake Schema?
Snowflake Schema in data warehouse is a logical arrangement of tables in a multidimensional
database such that the ER diagram resembles a snowflake shape. A Snowflake Schema is an
extension of a Star Schema, and it adds additional dimensions. The dimension tables are
normalized which splits data into additional tables.
Snowflake Schema Example
In the following Snowflake Schema example, Country is further normalized into an individual table.
data.
Cube processing might be slow because of
Cube processing is faster.
the complex join.
Offers higher performing queries using Star
The Snowflake schema is represented by
Join Query Optimization.
centralized fact table which unlikely
Tables may be connected with multiple
connected with multiple dimensions.
dimensions.
Implementation of independent data marts is antithetical to the motivation for building a data
warehouse. First of all, you need a consistent, centralized store of enterprise data which can be
analyzed by multiple users with different interests who want widely varying information.