Why We Need Data Warehouse?
Why We Need Data Warehouse?
Data Warehouse
What is Data Warehouse?
A data warehouse is a type of data management system that is designed to enable and support
business intelligence (BI) activities, especially analytics. Data warehouses are solely intended to
perform queries and analysis and often contain large amounts of historical data.
A data warehouse centralizes and consolidates large amounts of data from multiple sources. Its
analytical capabilities allow organizations to derive valuable business insights from their data to
improve decision-making. Over time, it builds a historical record that can be invaluable to data
scientists and business analysts. Because of these capabilities, a data warehouse can be considered
an organization’s “single source of truth.”
HDFC Bank
Application Data Warehouse
1-1-2000 ………………………………………data
Application
(Front end) 2-1-2000…………………………………………data
……………………………………………………………….
……………………………………………………………….
……………………………………………………………….
DB ETL ……………………………………………………………….
1-1-2000 …….
……………………………………………………………….
2-2-2000…….
……………………………………………………………….
…..
……………………………………………………………….
……
……………………………………………………………….
…..
……………………………………………………………….
1-1-2021
1-1-2021……………………………………….data…
Data….
Created by HemantGautam
2
Data Warehouse
Data Warehouse Concepts-
Subject oriented- They can analyse data about a particular subject or functional area (such
as sales).
Integrated- Data warehouses create consistency among different data types from disparate
sources.
Time-variants- Data warehouse analysis looks at change over time.
Non-volatile- Once data is in a data warehouse, it’s stable and doesn’t change.
Integrated:-
A
Source-1
Source-2
Source-3
Server
Source-1
Source-2
ETL Report
Source-3
Data Warehouse
Server
Source-1
Source-2
Source-3
Server
Integration of data warehouse
BI
Created by HemantGautam
3
Data Warehouse
OLTP vs. OLAP
Transaction System (OLTP) Target System (OLAP)
1. Transaction database is used to maintain 1. Data warehouse Database is to maintain
daily transaction activity of particular historical data to generate the report business
business. analysis purpose.
2. OLTP database is not a subject oriented, is 2. Data warehouse a subject oriented, time
not integrated, and is not time variant and it’s variant and non-volatile collection of data.
a volatile database.
Staging:- In the staging, first we can generate the data from source code after that exact the data
in stating (temporary database) to cleaning data as per business requirement after cleaning in
temporary database we will load the data in data warehouse and then generate the report.
Transform
Source
Table A
Table A Table B
Table C
Table B Cleaning
Table Report
Table C
Table
Created by HemantGautam
4
Data Warehouse
Database vs. Data Warehouse
A database is like a filing cabinet that is built primarily for fast queries and transaction processing,
but not analytics. A database typically serves as the focused data store for a specific application,
whereas a data warehouse stores data from any number of applications in an organization.
A database focuses on capturing real-time data while a data warehouse has a broader scope,
capturing historical data—but sometimes current data—for predictive analytics, machine learning
and other advanced analysis.
A data warehouse can centralize data from various data sources, such as transactional systems,
operational databases and flat files. It then cleanses this operational data, eliminates duplicates and
standardizes it to create a single source of truth that gives an organization a comprehensive, reliable
view of enterprise data.
Deeper insights
When data is locked in disparate sources, it might limit the ability of decision makers to derive
insights and set business strategies with confidence. A data warehouse with one central repository
enables business users to draw all of an organization’s pertinent data into business decision-
making.
By running reports on historical data, a data warehouse can help determine which systems and
processes are working and what needs improvement.
Data warehouses make it possible to discover and report on themes, trends and aggregations. Data
professionals and business leaders can use these insights to make better-informed decisions based
on hard evidence in virtually every area of the organization, from business processes to financial
management and inventory management.
Created by HemantGautam