Schemas For Multidimensional Databases

Star schema, snowflake schema, and fact constellation schema are logical data warehouse designs for multidimensional databases. A star schema centers around a large fact table linked to smaller dimension tables. A snowflake schema further normalizes the dimension tables. A fact constellation contains multiple fact tables that share dimension tables.

Uploaded by

ddhakal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views5 pages

Schemas For Multidimensional Databases

Uploaded by

ddhakal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

SCHEMAS FOR MULTIDIMENSIONAL DATABASES

STAR SCHEMA:
A star schema classifies the attributes of an event into facts(measured numeric/time data), and descriptive dimension attributes (product id, customer name, sale date) that give the facts a context. A fact record is the nexus between the specific dimension values and the recorded facts. The Facts are grouped together by grain (level of detail) and stored in the fact table. Dimension attributes are organized into affinity groups and stored a minimal number of dimension tables. A weather star schema that records weather data may have facts of temp, barometric pressure, wind speed, precipitation, cloud cover,etc and dimensions of location, date/time, reporter, etc. Star schemas are designed to optimize user ease-of-use and retrieval performance by minimizing the number of tables to join to materialize a transaction. A star schema is called such as it resembles a constellation of stars, generally several bright stars (facts) surrounded by dimmer ones (dimensions).

The fact table holds the metric values recorded for a specific event. Because of the desire to hold atomic level data, there generally are a very large number of records(billions). Special care is taken to minimize the number and size of attributes in order to constrain the overall table size and maintain performance. Fact tables generally come in 3 flavors transaction (facts about a specific event eg Sale), snapshot (facts recorded at a point in time (eg Account details at month end ), and accumulating snapshot tables (eg month-to-date sales for a product). Dimension tables, usually have few records compared to fact tables, but may have a very large number of attributes that describe the fact data.

Star Schema DMQL define cube sales_star [dim_date, dim_product, dim_store]: define dimension dim_store as (id, store_number, state_province, country) define dimension dim_product as (id, EAN_code, Product_Name, Brand, Product_category) define dimension Dim_Date as (id, Date, Day, Day_of_Week, Month, Month_Name, Quarter, Quarter_Name, Year)

Benefits
The primary benefit of star schema is its simplicity for users to write, and databases to process: queries are written with simple inner joins between the facts and a small number of dimensions. Star joins are simpler than possible in snowflake schema. Where conditions need only to filter on the attributes desired, and aggregations are fast.

The star schema is a way to implement multidimensional database (MDDB) functionality using a mainstream relational database: given most organizations' commitment to relational databases, a specialized multidimensional DBMS is likely to be both expensive and inconvenient.

SNOWFLAKES
In computing, a snowflake schema is a logical arrangement of tables in a multidimensional database such that the entity relationship diagram resembles a snowflake in shape. The snowflake schema is represented by centralized fact tables which are connected to multiple dimensions. The snowflake schema is similar to the star schema. However, in the snowflake schema, dimensions are normalized into multiple related tables, whereas the star schema's dimensions are normalized with each dimension represented by a single table. A complex snowflake shape emerges when the dimensions of a snowflake schema are elaborate, having multiple levels of relationships, and the child tables have multiple parent tables ("forks in the road"). The "snowflaking" effect only affects the dimension tables and NOT the fact tables. Star and snowflake schemas are most commonly found in dimensional data warehouses and data marts where speed of data retrieval is more important than the efficiency of data manipulations. As such, the tables in these schemas are not normalized much, and are frequently designed at a level of normalization short of third normal form. Deciding whether to employ a star schema or a snowflake schema should involve considering the relative strengths of the database platform in question and the query tool to be employed. Star schemas should be favored with query tools that largely expose users to the underlying table structures, and in environments where most queries are simpler in nature. Snowflake schemas are often better with more sophisticated query tools that create a layer of abstraction between the users and raw table structures for environments having numerous queries with complex criteria. Normalization splits up data to avoid redundancy (duplication) by moving commonly repeating groups of data into new tables. Normalization therefore tends to increase the number of tables that need to be joined in order to perform a given query, but reduces the space required to hold the data and the number of places where it needs to be updated if the data changes. From a space storage point of view, the dimensional tables are typically small compared to the fact tables. This often removes the storage space benefit of snowflaking the dimension tables, as compared with a star schema.

Snowflake DMQL define cube sales_snowflake [dim_date, dim_product, dim_store]: define dimension dim_store as (id, store_number, Geography_Id(id, state_province,country)) define dimension dim_product as (id, EAN_code, Product_Name, Brand_id(id, Brand), Product_category(id, product_category))

FACT Constellation
FACT Constellation Schema is describes a logical database structure of Data Warehouse or Data Mart. FACT Constellation Schema can design with collection of de-normalized FACT, Shared and Conformed Dimension tables. FACT Constellation Schema is an extended and decomposed STAR schema .FACT Constellation Schema is complicated database design that is difficult to summarize data. FACT Constellation Schema can implement between Aggregate FACT tables or elsewhere to decompose a complex FACT table into independent simplex FACT tables. Sophisticated applications may require multiple fact tables to share dimension tables. This kind of

schema can be viewed as a collection of stars, and hence is called a galaxy schema or a fact constellation.

Fact constellation DMQL de_ne cube sales [Dim_date, Dim_product, Dim_Store]: dollars sold = sum(sales in dollars), units sold = count(*) define dimension Dim_Date as (id, Date, Day, Day_of_Week, Month, Month_Name, Quarter, Quarter_Name, Year) define dimension dim_store as (id, store_number, Geography_Id(id, state_province,country)) define dimension dim_product as (id, EAN_code, Product_Name, Brand_id(id, Brand), Product_category(id, product_category)) define cube Transport [time_key, item_key, transport_key, from location, to location]: dollars cost = sum(cost in dollars), units shipped = count(*) define dimension Dim_Tport as (transport_key, location_key, Transport_name, Store_number)

PLMCE2012 Starring Sakila
100% (2)
PLMCE2012 Starring Sakila
79 pages
Android Penetration Testing Report
No ratings yet
Android Penetration Testing Report
38 pages
Prune Days and Change Capture in Data Warehouse Application Console (DAC)
100% (2)
Prune Days and Change Capture in Data Warehouse Application Console (DAC)
3 pages
How To Download Exported Data File Using API - Cloud Customer Connect
No ratings yet
How To Download Exported Data File Using API - Cloud Customer Connect
8 pages
Database and Data Warehouse Assignment 1
No ratings yet
Database and Data Warehouse Assignment 1
15 pages
Vsphere Esxi Vcenter Server 55 Storage Guide
No ratings yet
Vsphere Esxi Vcenter Server 55 Storage Guide
278 pages
DWH by Concepts - v1
No ratings yet
DWH by Concepts - v1
56 pages
Why Is The Snowflake Schema A Good Data Warehouse Design
No ratings yet
Why Is The Snowflake Schema A Good Data Warehouse Design
19 pages
A Trio of Interesting Snowflakes - Kimball Group
No ratings yet
A Trio of Interesting Snowflakes - Kimball Group
9 pages
DWM Assignment
No ratings yet
DWM Assignment
9 pages
Data Cubemod2
100% (1)
Data Cubemod2
21 pages
Certified List of Candidates: Region X Misamis Oriental Provincial Governor
No ratings yet
Certified List of Candidates: Region X Misamis Oriental Provincial Governor
37 pages
ETL Introduction
No ratings yet
ETL Introduction
44 pages
Data Driven Framework For Degraded Pogo Pin Detection in
No ratings yet
Data Driven Framework For Degraded Pogo Pin Detection in
6 pages
Unit 2 - Data Warehouse Logical Designm
No ratings yet
Unit 2 - Data Warehouse Logical Designm
73 pages
Data Warehousing Concepts 2
No ratings yet
Data Warehousing Concepts 2
26 pages
Cubes Poster - PyCon 2014
100% (1)
Cubes Poster - PyCon 2014
2 pages
What Is The Purpose of Factless Fact Table
No ratings yet
What Is The Purpose of Factless Fact Table
11 pages
Wa de Troubleshooting Guide
No ratings yet
Wa de Troubleshooting Guide
36 pages
03 Etl 081028 2055
No ratings yet
03 Etl 081028 2055
46 pages
File System Questions
No ratings yet
File System Questions
34 pages
BCS-15 Relational Model I
No ratings yet
BCS-15 Relational Model I
22 pages
Logical Modeling SDLC
0% (1)
Logical Modeling SDLC
6 pages
Best Practices For Multi-Dimensional Design Using Cognos 8 Framework Manager
No ratings yet
Best Practices For Multi-Dimensional Design Using Cognos 8 Framework Manager
24 pages
DataCaptureMethodsC3 18mar06
No ratings yet
DataCaptureMethodsC3 18mar06
32 pages
Test Case Review Checklist-Project - Version
No ratings yet
Test Case Review Checklist-Project - Version
12 pages
Best Practices and Solutions For GENESIS64 G64 104
No ratings yet
Best Practices and Solutions For GENESIS64 G64 104
51 pages
Data Warehousing Concepts JSR
No ratings yet
Data Warehousing Concepts JSR
24 pages
Lecture 2 Data Models
No ratings yet
Lecture 2 Data Models
32 pages
DWDM Lecturenotes PDF
No ratings yet
DWDM Lecturenotes PDF
133 pages
Conncetivity To Change Data Capture
No ratings yet
Conncetivity To Change Data Capture
74 pages
Clover ETL - 1
No ratings yet
Clover ETL - 1
29 pages
IBatis Introduction
No ratings yet
IBatis Introduction
9 pages
CS 606 Advanced Database Technology
No ratings yet
CS 606 Advanced Database Technology
46 pages
CDC With HDFS Apply
No ratings yet
CDC With HDFS Apply
10 pages
DW Example
No ratings yet
DW Example
24 pages
CDM Best Practice
No ratings yet
CDM Best Practice
34 pages
What Is The Level of Granularity of A Fact Table
No ratings yet
What Is The Level of Granularity of A Fact Table
15 pages
Schema
No ratings yet
Schema
17 pages
Database Management Systems
No ratings yet
Database Management Systems
5 pages
DW Concepts Shiva
No ratings yet
DW Concepts Shiva
32 pages
Oracle Test To Production Notes
No ratings yet
Oracle Test To Production Notes
24 pages
Building Your ETL Framework With BIML
No ratings yet
Building Your ETL Framework With BIML
19 pages
SQL Commands For Final
No ratings yet
SQL Commands For Final
7 pages
Comparative Performance Analysis of Mysql and SQL Server Relational Database Management Systems in Windows Environment
No ratings yet
Comparative Performance Analysis of Mysql and SQL Server Relational Database Management Systems in Windows Environment
6 pages
Directory Structures and Implementations
No ratings yet
Directory Structures and Implementations
18 pages
COGNOS Guidelines and Best Practices
No ratings yet
COGNOS Guidelines and Best Practices
21 pages
Business Intelligence & Business Performance Mgt.: อภิชาต ชมภูนุช Sunday, June 27, 2010
No ratings yet
Business Intelligence & Business Performance Mgt.: อภิชาต ชมภูนุช Sunday, June 27, 2010
50 pages
Data Warehouse Schemas
No ratings yet
Data Warehouse Schemas
17 pages
How To Remove VSCSI Disk
No ratings yet
How To Remove VSCSI Disk
6 pages
Access Control Snowflake
No ratings yet
Access Control Snowflake
6 pages
How To Invoke Web Services From Odi
No ratings yet
How To Invoke Web Services From Odi
6 pages
Software Testing FAQ: Explain The Software Development Lifecycle
No ratings yet
Software Testing FAQ: Explain The Software Development Lifecycle
30 pages
Web Mining
No ratings yet
Web Mining
73 pages
DWH
No ratings yet
DWH
48 pages
Data Warehouse Conceptual Data Model
No ratings yet
Data Warehouse Conceptual Data Model
6 pages
Ram Manohar Bheemana: Contact About Me
No ratings yet
Ram Manohar Bheemana: Contact About Me
7 pages
Dimensional Modeling PDF
No ratings yet
Dimensional Modeling PDF
14 pages
Data Flow Diagrams and User Stories-IOT
No ratings yet
Data Flow Diagrams and User Stories-IOT
4 pages
Incident Management LVL 100: Introduction To The Incident Management Application in Servicenow
No ratings yet
Incident Management LVL 100: Introduction To The Incident Management Application in Servicenow
11 pages
Upgrade
No ratings yet
Upgrade
12 pages
Definitions of Database Terms
No ratings yet
Definitions of Database Terms
7 pages
Best Practices - ETL
No ratings yet
Best Practices - ETL
3 pages
Budget For Proposed Structure For The Archival System
No ratings yet
Budget For Proposed Structure For The Archival System
5 pages
Factless Fact Table
No ratings yet
Factless Fact Table
5 pages
Data Analyst Roles and Job Descriptions
100% (1)
Data Analyst Roles and Job Descriptions
6 pages
DWH Architecture
No ratings yet
DWH Architecture
3 pages
Data Mining-Data Warehouse
No ratings yet
Data Mining-Data Warehouse
7 pages
Change Data Capture Error 14234
No ratings yet
Change Data Capture Error 14234
2 pages
How To Use MySQL With Erlang
No ratings yet
How To Use MySQL With Erlang
2 pages
Toad 9.0
No ratings yet
Toad 9.0
49 pages
CDCSetup
No ratings yet
CDCSetup
4 pages
Dwques
No ratings yet
Dwques
5 pages
SCD 2
No ratings yet
SCD 2
9 pages
Programmer Date Ad
No ratings yet
Programmer Date Ad
2 pages
Data Mining Unit - 1 Notes
No ratings yet
Data Mining Unit - 1 Notes
16 pages
Muhammad Abdullah - Canva
No ratings yet
Muhammad Abdullah - Canva
3 pages
Employee Queries
No ratings yet
Employee Queries
18 pages
Chapter Nine
No ratings yet
Chapter Nine
36 pages
Abstract
No ratings yet
Abstract
1 page
Business Anaytics Schedule
No ratings yet
Business Anaytics Schedule
1 page
DVD ISO Ripping Instructions: Step 1
No ratings yet
DVD ISO Ripping Instructions: Step 1
1 page
Other Relevant Roadmaps: Postgresql Roadmap Backend Developer Roadmap
No ratings yet
Other Relevant Roadmaps: Postgresql Roadmap Backend Developer Roadmap
1 page
A Framework For ETL Systems Development
No ratings yet
A Framework For ETL Systems Development
16 pages
UNIT-I Notes BBA III Sem
No ratings yet
UNIT-I Notes BBA III Sem
11 pages
Dataware House Strcture
No ratings yet
Dataware House Strcture
13 pages
SDD Template
No ratings yet
SDD Template
7 pages
Data Warehousing FAQ
No ratings yet
Data Warehousing FAQ
5 pages
Lecture Six-Schemas
No ratings yet
Lecture Six-Schemas
5 pages
ALL Problems and Solutions in SAP
No ratings yet
ALL Problems and Solutions in SAP
5 pages
Access Control Policy: Technical
50% (2)
Access Control Policy: Technical
6 pages
Srs For Library Management System
No ratings yet
Srs For Library Management System
1 page
UML Tutorial
100% (1)
UML Tutorial
32 pages
1.4 Storage Notes
No ratings yet
1.4 Storage Notes
2 pages

Schemas For Multidimensional Databases

Uploaded by

Schemas For Multidimensional Databases

Uploaded by

SCHEMAS FOR MULTIDIMENSIONAL DATABASES

You might also like