0% found this document useful (0 votes)
11 views15 pages

ETL Basics Lesson 01

Uploaded by

singh.abhi.abhi7
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views15 pages

ETL Basics Lesson 01

Uploaded by

singh.abhi.abhi7
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

ETL Basics Basic Concepts

Page 01-1
ETL Basics Basic Concepts

Page 01-2
ETL Basics Basic Concepts

Page 01-3
ETL Basics Basic Concepts

Page 01-4
ETL Basics Basic Concepts

The data marts are treated as sub sets of the data warehouse. Each
data mart is built for an individual department and is optimized for
analysis needs of the particular department for which it is created. The
data flow in the top down OLAP environment begins with data
extraction from the operational data sources. This data is loaded into
the staging area and validated and consolidated. This data from the
Staging area is then loaded in to the datawarehouse.

Page 01-5
ETL Basics Basic Concepts

The data flow in the top down OLAP environment begins with data
extraction from the operational data sources. This data is loaded into
the staging area and validated and consolidated for ensuring a level of
accuracy and then transferred to the Operational Data Store. (ODS).
The ODS stage is sometimes skipped if it is a replication of the
operational databases. Data is also loaded into the Data warehouse in
a parallel process to avoid extracting it from the ODS.
Detailed data is regularly extracted from the ODS and temporarily
hosted in the staging area for aggregation, summarization and then
extracted and loaded into the Data warehouse. The need to have an
ODS is determined by the needs of the business. If there is a need for
detailed data in the Data warehouse then, the existence of an ODS is
considered justified. Else organizations may do away with the ODS
altogether.
Once the Data warehouse aggregation and summarization processes
are complete, the data mart refresh cycles will extract the data from the
Data warehouse into the staging area and perform a new set of
transformations on them. This will help organize the data in particular
structures required by data marts. Then the data marts can be loaded
with the data and the OLAP environment becomes available to the
users.
The data in a data warehouse is time variant in nature as it contains
historical data. Inmon proposes a top-down model approach to create a
centralized Enterprise Data Warehouse using traditional database

Page 01-6
ETL Basics Basic Concepts

modeling techniques (ER Model), where the data is stored in


3NF. The data warehouse acts as data source for the new data
marts

Page 01-‹#›
ETL Basics Basic Concepts

Page 01-7
ETL Basics Basic Concepts

The bottom-up approach reverses the positions of the Data warehouse


and the Data marts. Data marts are directly loaded with the data from
the operational systems through the staging area. The ODS may or
may not exist depending on the business requirements.
The data flow in the bottom up approach starts with extraction of data
from operational databases into the staging area where it is processed
and consolidated and then loaded into the ODS. The data in the ODS
is appended to or replaced by the fresh data being loaded. After the
ODS is refreshed the current data is once again extracted into the
staging area and processed to fit into the Data mart structure. The data
from the Data Mart, then is extracted to the staging area aggregated,
summarized and so on and loaded into the Data Warehouse and made
available to the end user for analysis.

Page 01-8
ETL Basics Basic Concepts

The bottom-up approach reverses the positions of the Data warehouse


and the Data marts. Data marts are directly loaded with the data from
the operational systems through the staging area. The ODS may or
may not exist depending on the business requirements.
The data flow in the bottom up approach starts with extraction of data
from operational databases into the staging area where it is processed
and consolidated and then loaded into the ODS. The data in the ODS
is appended to or replaced by the fresh data being loaded. After the
ODS is refreshed the current data is once again extracted into the
staging area and processed to fit into the Data mart structure. The data
from the Data Mart, then is extracted to the staging area aggregated,
summarized and so on and loaded into the Data Warehouse and made
available to the end user for analysis.

Page 01-9
ETL Basics Basic Concepts

Page 01-10
ETL Basics Basic Concepts

Page 01-11
ETL Basics Basic Concepts

Page 01-12
ETL Basics Basic Concepts

Page 01-13
ETL Basics Basic Concepts

Add the notes here.

Page 01-14

You might also like