Data Warehousing
Data Warehousing
A Technology Marvel
• Introduction
• Business Need Beyond Reporting
• Traditional Approaches
• Definition
• Data Classification
• Components of Data Warehouse
• Benefits
• Tools For DataWarehousing
• Data Modeling Terminologies
• Schemas
– Star Schema
– Snowflake Schema
2
Scenario 1
• Your company has made less profit than previous year?
3
Business Need beyond Reporting….
4
Scenario 2
• You are a frequent Traveler
• You have a Saving Bank account with ABC Bank pvt. Ltd.
• You use your Bank’s ATM card to buy your Air Tickets…
• Now, one day you receive an exciting offer from the bank stating a 15 percent discount on
all the Air Tickets booked using Bank’s ATM Card …..
• Sounds Fascinating , Isn’t it?
• How did you Bank Get to know about you Nature of your
transactions…????
5
Traditional Approaches
• Programs were written to analyze the data stored on
tapes or on Mainframes .
6
Data Warehousing has the key to all these Questions ….
7
Defining Data Warehouse
According to Bill Inmon, known as the father of Data Warehousing, a data
warehouse is a subject oriented, integrated, time-variant, nonvolatile
collection of data in support of management decisions.
8
Data classification
Data
Operational Informational
Data Data
Operational Analytical
processing
processing
9
Informational & Operational Data
Data warehouse OLTP DB
Typical operation Query scans thousands or millions of rows. Accesses only a handful of records. For example"
For example" Find the total sales of last Retrieve the current order for this customer."
month."
Data Modification: A data warehouse is updated on a regular The OLTP database is always up to date, and
basis. The end users of a data warehouse do reflects the current state of each business
not directly update the data. transaction.
Historical Data Data warehouses usually store many months OLTP systems usually store data from only a few
or years of data. weeks or months.
10
Components of Data Warehouse
A Data Warehouse typically comprises of following
components –
11
12
Source Data Layer & Data Transformation Layer
ETL is the process of Extracting, Transforming & Loading Data in
the process of Data Warehousing.
• EXTRACTION: The data are extracted from the source. Data can be
extracted from more than a single source.
13
Data Flow (Data Warehousing Layer)
A Data Mart is -
•Scaled down version of DWH which is designed for a particular line of business.
•Focuses on one subject area or only one group of users.
Finance
Orders
Billing
Product
DWH Marketing
Customer
Customer
OLTP Service Data Marts
14
Reporting Layer
• Reporting is the process of development and production
of business reports based on data warehouse data.
15
Data Warehousing – End to End
16
Benefits
Data Warehouse –
17
Tools Available For Data Warehousing:-
18
Fact
500
3. Sold in Jalandhar
City
19
Data Modeling Terminologies
20
Data Warehouse Schemas
– Star Schema
• Star Schema is a relational database schema for representing
multidimensional data. The center of the star schema consists
of a large fact table and it points towards the dimension tables
– Snowflake Schema
• A snowflake schema is a variation on the star schema, in
which very large dimension tables are normalized into
multiple tables. Dimensions with hierarchies can be
decomposed into a snowflake structure when it is required to
normalize the dimension tables, in order to save space.
Snowflake schema approach increases the number of joins
and results in poor performance in retrieval of data.
21
Example of a Star Schema
22
Example of a Snowflake Schema
23
Thank You