Chapter 1
Chapter 1
Chapter:-1
Introduction Of Data Warehousing
Subject Oriented.
Integrated.
Time Variant.
Non- Volatile.
Financial Services.
Banking Services.
Consumer Goods.
Retail Sector.
Controlled Manufacturing.
Information Processing.
Analytical Processing.
Data Mining.
Data warehouse technology is fast & new tool to take care of our future
needs.
Data warehousing combine information collected for multiple sources into
one comprehensive database.
Data warehouse subject oriented because it provides information around a
subject rather than organization ongoing operations.
Data warehouse is essential database & is kept separate from an
organization operational database.
This warehouse can be used for extensive analytical processing & this
analytics performed on the data is stored in the warehouse.
Which perform OLAP operational & few others like drill down & drill up
with enhance the result of the analysis.
Data Extraction.
Data Cleaning.
Data Transformation.
Data Loading.
Data warehouse staying power because the concept of central data collects
by dozens or hundreds of database, applications & system other source
system.
To the most efficient way of companies to get an enterprise wide view of
their customers, supply chain, sales& operations.
In today world of instant access by many different areas user & customers
data is no longer nicely away in big warehouse.
The trend is two words always on accessible & very open storage that is
fast & friendly for customers yet complex & deep in of the most important
data.
Data storage layer is where data was cleaned in storage area as a single
central process.
Depending on your business & your data warehouse architecture
requirements your data storage maybe a data warehouse, data mart &
Operational Data Store (ODS).
The presentation layer is where users interact with cleaned and organized.
This layer of data warehouse architecture provides users with the ability to
query the data for product or services insight, analyzed &information to
business scenarios & developed automated or adhoc reports.
You may OLAP or reporting tool with a user friendly Graphical User
Interface (GUI) to help users build their queries perform analysis or
designed their reports.
The data staging layer resides between data source and data warehouse.
In this layer data is extracted from different internal and external data
source.
The data extraction layer will utilize multiple technologies & tools to
extract the required data.
The extract has been loaded it will be subjected to high level data quality
data checks.
The staging layer contain the following components:
1) Landing Database & Staging Area.
2) Data Integration Tool.
A data warehouse relational database that is designed for query & analysis
rather than for transaction processes.
It usually contains historical data derived from transaction data but can
include data from other sources.
1) Top Tier
2) Middle Tier
3) Bttom Tier
The bottom tier of the architecture is the data warehouse database server.
We use the back-end tools & utilities to feed data in the bottom tiers.
This back-end tools & utilities perform the extract, clean, load & refresh
function (DWH) tools.
Data warehouse server fetch relevant information based on data mining &
request.
Data flow architecture reduces development time & can move easily
between design & implementation.
Data flow architecture is a computer architecture that directly contrasts the
traditional architecture or control flow architecture.
Data flow architecture the data can be input graph topology with a cycle.
Data flow architecture doesn’t have a program counter or execution of
instructions is determined based on availability.
There are benefits and drawbacks to each type of data.
For Example:-The data that you can access in the data warehouse is more
complex & can represent a greater number & several of relationship but it
can take longer time to collect & access that live data.
Data flow architecture that are determined in nature enable program to
manage complex task such as processor load balancing synchronization &
access to common resources.
A several of process enable reports to access application data in order to
produce report output & live reporting data is accessed directly from the
application using the API.