Data Warehouse: Concepts, Architecture and Components

The document defines a data warehouse as a system that contains historical and cumulative data from single or multiple sources to simplify reporting and analysis for decision making. It describes the key characteristics of a data warehouse as being subject-oriented, integrated, time-variant, and non-volatile. The document also outlines the typical three-tier architecture for a data warehouse consisting of a bottom tier database, middle tier OLAP server, and top tier front-end tools. It notes that data warehouses contain current, historical, and summarized data as well as metadata.

Uploaded by

nandini swami

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

280 views5 pages

Data Warehouse: Concepts, Architecture and Components

Uploaded by

nandini swami

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Data Warehouse

Concepts, Architecture and Components

Definition:

Data warehouse is an information system that contains historical and commutative data from single or
multiple sources. It simplifies reporting and analysis process of the organization.
It is also a single version of truth for any company for decision making and forecasting.

Characteristics of Data warehouse

A data warehouse has following characteristics:

 Subject-Oriented
 Integrated
 Time-variant
 Non-volatile

Subject-Oriented
A data warehouse is subject oriented as it offers information regarding a theme instead of companies'
ongoing operations. These subjects can be sales, marketing, distributions, etc.
A data warehouse never focuses on the ongoing operations. Instead, it put emphasis on modeling and
analysis of data for decision making. It also provides a simple and concise view around the specific subject
by excluding data which not helpful to support the decision process.

Integrated
In Data Warehouse, integration means the establishment of a common unit of measure for all similar data
from the dissimilar database. The data also needs to be stored in the Datawarehouse in common and
universally acceptable manner.
A data warehouse is developed by integrating data from varied sources like a mainframe, relational
databases, flat files, etc. Moreover, it must keep consistent naming conventions, format, and coding.
This integration helps in effective analysis of data. Consistency in naming conventions, attribute measures,
encoding structure etc. have to be ensured. Consider the following example:
In the above example, there are three different application labeled A, B and C. Information stored in these
applications are Gender, Date, and Balance. However, each application's data is stored different way.
 In Application A gender field store logical values like M or F
 In Application B gender field is a numerical value,
 In Application C application, gender field stored in the form of a character value.
 Same is the case with Date and balance
However, after transformation and cleaning process all this data is stored in common format in the Data
Warehouse.

Time-Variant
The time horizon for data warehouse is quite extensive compared with operational systems. The data
collected in a data warehouse is recognized with a particular period and offers information from the
historical point of view. It contains an element of time, explicitly or implicitly.
One such place where Data warehouse data display time variance is in the structure of the record key. Every
primary key contained with the DW should have either implicitly or explicitly an element of time. Like the
day, week month, etc.
Another aspect of time variance is that once data is inserted in the warehouse, it can't be updated or changed.

Non-volatile
Data warehouse is also non-volatile means the previous data is not erased when new data is entered in it.
Data is read-only and periodically refreshed. This also helps to analyze historical data and understand what
& when happened. It does not require transaction process, recovery and concurrency control mechanisms.
Activities like delete, update, and insert which are performed in an operational application environment are
omitted in Data warehouse environment. Only two types of data operations performed in the Data
Warehousing are
1. Data loading
2. Data access

Data Warehouse Architectures

There are mainly three types of Data warehouse Architectures: -

Single-tier architecture
The objective of a single layer is to minimize the amount of data stored. This goal is to remove data
redundancy. This architecture is not frequently used in practice.

Two-tier architecture
Two-layer architecture separates physically available sources and data warehouse. This architecture is not
expandable and also not supporting a large number of end-users. It also has connectivity problems because
of network limitations.

Three-tier architecture
This is the most widely used architecture.
It consists of the Top, Middle and Bottom Tier.
1. Bottom Tier: The database of the Datawarehouse servers as the bottom tier. It is usually a relational
database system. Data is cleansed, transformed, and loaded into this layer using back-end tools.
2. Middle Tier: The middle tier in Data warehouse is an OLAP server which is implemented using either
ROLAP or MOLAP model. For a user, this application tier presents an abstracted view of the
database. This layer also acts as a mediator between the end-user and the database.
3. Top-Tier: The top tier is a front-end client layer. Top tier is the tools and API that you connect and get
data out from the data warehouse. It could be Query tools, reporting tools, managed query tools,
Analysis tools and Data mining tools.
Datawarehouse Components

The data has been selected from various sources and then integrated and store the data in a single and
particular format

Data warehouse contains current detailed data, historical detailed data, lightly and highly summarized data,
and metadata.

Current and historical data: these are voluminous because they are stored at the highest level of detail.
Lightly and highly summarized data: are necessary to save processing time when users request them and
readily accessible.
Metadata: are “data about data”. It is important for designing, contructing, retrieving, and controlling the
warehouse data.

Benefits of Data Warehousing

The successful implementation of a data warehouse can bring major, benefits to an organization including:
• Potential high returns on investment
Implementation of data warehousing by an organization requires a huge investment typically from Rs 10
lack to 50 lacks. However, a study by the International Data Corporation (IDC) in 1996 reported that
average three-year returns on investment (RO I) in data warehousing reached 401%.
• Competitive advantage
The huge returns on investment for those companies that have successfully implemented a data warehouse is
evidence of the enormous competitive advantage that accompanies this technology. The competitive
advantage is gained by allowing decision-makers access to data that can reveal previously unavailable,
unknown, and untapped information on, for example, customers, trends, and demands.
• Increased productivity of corporate decision-makers
Data warehousing improves the productivity of corporate decision-makers by creating an integrated database
of consistent, subject-oriented, historical data. It integrates data from multiple incompatible systems into a
form that provides one consistent view of the organization. By transforming data into meaningful
information, a data warehouse allows business managers to perform more substantive, accurate, and
consistent analysis.
• More cost-effective decision-making
Data warehousing helps to reduce the overall cost of the· product· by reducing the number of channels.
• Better enterprise intelligence.
• Enhanced customer service.

Problems of Data Warehousing

The problems associated with developing and managing a data warehousing are as follows:
Underestimation of resources of data loading
Some times we underestimate the time required to extract, clean, and load the data into the warehouse. It
may take the significant proportion of the total development time, although some tools are there which are
used to reduce the time and effort spent on this process.
Hidden problems with source systems
Hidden .problems associated with the source systems feeding the data warehouse may be identified after
years of being undetected. For example, when entering the details of a new property, certain fields may
allow nulls which may result in staff entering incomplete property data, even when available and applicable.
Required data not captured
In some cases the required data is not captured by the source systems which may be very important for the
data warehouse purpose. For example the date of registration for the property may be not used in source
system but it may be very important analysis purpose.
Increased end-user demands
After satisfying some of end-users queries, requests for support from staff may increase rather than decrease.
This is caused by an increasing awareness of the users on the capabilities and value of the data warehouse.
Another reason for increasing demands is that once a data warehouse is online, it is often the case that the
number of users and queries increase together with requests for answers to more and more complex queries.
Data homogenization
The concept of data warehouse deals with similarity of data formats between different data sources. Thus,
results in to lose of some important value of the data.
High demand for resources
The data warehouse requires large amounts of data.
Data ownership
Data warehousing may change the attitude of end-users to the ownership of data. Sensitive data that owned
by one department has to be loaded in data warehouse for decision making purpose. But some time it results
in to reluctance of that department because it may hesitate to share it with others.
High maintenance
Data warehouses are high maintenance systems. Any reorganization· of the business processes and the
source systems may affect the data warehouse and it results high maintenance cost.
Long-duration projects
The building of a warehouse can take up to three years, which is why some organizations are reluctant in
investigating in to data warehouse. Some only the historical data of a particular department is captured in the
data warehouse resulting data marts. Data marts support only the requirements of a particular department
and limited the functionality to that department or area only.
Complexity of integration
The most important area for the management of a data warehouse is the integration capabilities. An
organization must spend a significant amount of time determining how well the various different data
warehousing tools can be integrated into the overall solution that is needed. This can be a very difficult task,
as there are a number of tools for every operation of the data warehouse.

DWDM Unit 1
No ratings yet
DWDM Unit 1
103 pages
DBMS-Question Paper (Set-1)
No ratings yet
DBMS-Question Paper (Set-1)
4 pages
Unit 1
No ratings yet
Unit 1
14 pages
Introduction To Data Warehouse
No ratings yet
Introduction To Data Warehouse
34 pages
BBA 2nd 2021 Solved Question
No ratings yet
BBA 2nd 2021 Solved Question
11 pages
A Report On: Summer Internship Program AT
100% (2)
A Report On: Summer Internship Program AT
35 pages
DB2 10.1 LUW Data Recovery and High Availability Guide and Reference IBM Inc
No ratings yet
DB2 10.1 LUW Data Recovery and High Availability Guide and Reference IBM Inc
507 pages
ETL Process-Training
0% (1)
ETL Process-Training
85 pages
Data Mining Unit-IV
No ratings yet
Data Mining Unit-IV
37 pages
Data Warehousing-Notes (Module - I & II)
No ratings yet
Data Warehousing-Notes (Module - I & II)
32 pages
NetBackup Copilot Configuration Guide - 2.7.3
No ratings yet
NetBackup Copilot Configuration Guide - 2.7.3
50 pages
What Kind of Data Can Be Mined
No ratings yet
What Kind of Data Can Be Mined
6 pages
3-Tier Architecture: Step by Step Exercises
No ratings yet
3-Tier Architecture: Step by Step Exercises
71 pages
Banking Automation Cs Project Pooja
No ratings yet
Banking Automation Cs Project Pooja
28 pages
TC12-My TC (My Help)
No ratings yet
TC12-My TC (My Help)
53 pages
Wireless Thesis
100% (3)
Wireless Thesis
7 pages
Tracking Outpatients: Using The E-Health System To Ensure Positive Treatment Progress For Hospital Services' Effectiveness For Clients Tracking And Communication At Golden Years Care
From Everand
Tracking Outpatients: Using The E-Health System To Ensure Positive Treatment Progress For Hospital Services' Effectiveness For Clients Tracking And Communication At Golden Years Care
Dr. Tamer Sabry
5/5 (2)
Data Warehousing-1
No ratings yet
Data Warehousing-1
51 pages
E-Shopping: Login Viewstock Orderproducts Enterscreditcardno. Givesfeedback Logout
No ratings yet
E-Shopping: Login Viewstock Orderproducts Enterscreditcardno. Givesfeedback Logout
66 pages
Batch B DWM Experiments
No ratings yet
Batch B DWM Experiments
90 pages
What Is A Data Warehouse
No ratings yet
What Is A Data Warehouse
34 pages
Oosad Project (AutoRecovered)
No ratings yet
Oosad Project (AutoRecovered)
45 pages
DWH by Concepts - v1
No ratings yet
DWH by Concepts - v1
56 pages
ETL (Extract, Transform, and Load) Process
No ratings yet
ETL (Extract, Transform, and Load) Process
8 pages
Data Warehouse Unit1 CS3551
No ratings yet
Data Warehouse Unit1 CS3551
25 pages
DBMS Notes
No ratings yet
DBMS Notes
141 pages
Unit 1
No ratings yet
Unit 1
26 pages
Assignment 5
No ratings yet
Assignment 5
2 pages
CH 2 Introduction To Data Warehousing
No ratings yet
CH 2 Introduction To Data Warehousing
31 pages
Online Voting System Project Proposal (Presentation Slide)
No ratings yet
Online Voting System Project Proposal (Presentation Slide)
14 pages
File Systems and Databases: Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel
No ratings yet
File Systems and Databases: Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel
38 pages
19BSP1650 - Nandini Swami
No ratings yet
19BSP1650 - Nandini Swami
10 pages
Assignment Group 11
No ratings yet
Assignment Group 11
14 pages
Entrepreneurship - Small Business Management (1. Entrepreneurship Perspective)
No ratings yet
Entrepreneurship - Small Business Management (1. Entrepreneurship Perspective)
32 pages
Study of Credit Risk and Risk Management of Overseas Education Loan
No ratings yet
Study of Credit Risk and Risk Management of Overseas Education Loan
9 pages
Data Warehouse
No ratings yet
Data Warehouse
74 pages
UNIT-1: Information System Concepts
No ratings yet
UNIT-1: Information System Concepts
53 pages
MVC Music Store - Tutorial - V3.0
No ratings yet
MVC Music Store - Tutorial - V3.0
136 pages
DWDM Unit-2 PDF
No ratings yet
DWDM Unit-2 PDF
149 pages
Elective Option Form Sem. III - Class of 2021
No ratings yet
Elective Option Form Sem. III - Class of 2021
1 page
IR Documentation
No ratings yet
IR Documentation
9 pages
Assignment1 19BSP1650
No ratings yet
Assignment1 19BSP1650
2 pages
S & D Quickforce
No ratings yet
S & D Quickforce
2 pages
Assignment 10
No ratings yet
Assignment 10
2 pages
19BSP1650 - SIP Executive Summary 2 PDF
No ratings yet
19BSP1650 - SIP Executive Summary 2 PDF
1 page
06 - Physical Design in Data Warehouse
No ratings yet
06 - Physical Design in Data Warehouse
18 pages
SQL Server 2019 Editions Datasheet
No ratings yet
SQL Server 2019 Editions Datasheet
3 pages
Compare and Contrast File System With Database System.: Application Programmer
100% (1)
Compare and Contrast File System With Database System.: Application Programmer
10 pages
ES BhavyaJhaveri
No ratings yet
ES BhavyaJhaveri
1 page
04 - MiVoice 5000 Manager - Directory Configuration
No ratings yet
04 - MiVoice 5000 Manager - Directory Configuration
28 pages
Assignment 8
No ratings yet
Assignment 8
1 page
Assignment 9
No ratings yet
Assignment 9
1 page
Data Mining Unit - 1 Notes
No ratings yet
Data Mining Unit - 1 Notes
16 pages
DWDM UNIT-1 Lecture Notes
No ratings yet
DWDM UNIT-1 Lecture Notes
15 pages
LectureNotes Data Warehousing
No ratings yet
LectureNotes Data Warehousing
126 pages
Data Warehouse Week 1
No ratings yet
Data Warehouse Week 1
78 pages
SKP Engineering College: A Course Material On
No ratings yet
SKP Engineering College: A Course Material On
212 pages
WMI Programming With Visual Basic
No ratings yet
WMI Programming With Visual Basic
9 pages
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
No ratings yet
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
52 pages
DWDM Lecturenotes PDF
No ratings yet
DWDM Lecturenotes PDF
133 pages
Adbms Data Warehousing and Data Mining
No ratings yet
Adbms Data Warehousing and Data Mining
169 pages
Unit No: 01 Introduction To Data Warehouse: by Pratiksha Meshram
No ratings yet
Unit No: 01 Introduction To Data Warehouse: by Pratiksha Meshram
38 pages
Data Cubemod2
100% (1)
Data Cubemod2
21 pages
Java Course
No ratings yet
Java Course
6 pages
Unit 1 Fundamentals of Data Warehouse
No ratings yet
Unit 1 Fundamentals of Data Warehouse
21 pages
Master TT (2024-25) EVEN SEM - 4-1-2025
No ratings yet
Master TT (2024-25) EVEN SEM - 4-1-2025
43 pages
Sample Paper Q0503
No ratings yet
Sample Paper Q0503
20 pages
Database Normalization
100% (3)
Database Normalization
19 pages
Lecture 4
No ratings yet
Lecture 4
16 pages
EB2406 - Teradata PDF
No ratings yet
EB2406 - Teradata PDF
18 pages
Azure Project
No ratings yet
Azure Project
32 pages
Mapping Datawarehouse Architecture
100% (1)
Mapping Datawarehouse Architecture
2 pages
Introduction To Data Warehousing: Presentation On
No ratings yet
Introduction To Data Warehousing: Presentation On
8 pages
Big Data and Data Warehouse
No ratings yet
Big Data and Data Warehouse
19 pages
Data Warehousing Logical Design
100% (1)
Data Warehousing Logical Design
23 pages
Data Warehousing FAQ
No ratings yet
Data Warehousing FAQ
5 pages
Designing The Data Warehouse Aima Second Lecture
No ratings yet
Designing The Data Warehouse Aima Second Lecture
34 pages
Data Warehousing Chapter 1
No ratings yet
Data Warehousing Chapter 1
8 pages
Blood Donation Management System
No ratings yet
Blood Donation Management System
56 pages
IC Spring2017 OracleConnectorGuide en
No ratings yet
IC Spring2017 OracleConnectorGuide en
28 pages
Unit V
No ratings yet
Unit V
51 pages
Data W Areho Us e
100% (1)
Data W Areho Us e
9 pages
Chap01 Data Warehouse 1
No ratings yet
Chap01 Data Warehouse 1
65 pages
Toaz - Info Quiz 1 Advance Database Management System PR
No ratings yet
Toaz - Info Quiz 1 Advance Database Management System PR
22 pages
DBMS Vs DataWarehouse
No ratings yet
DBMS Vs DataWarehouse
2 pages
Data Mining and Data Warehouse
No ratings yet
Data Mining and Data Warehouse
11 pages
Self Disclosure
No ratings yet
Self Disclosure
42 pages
Need of Two Types of Data: Information
No ratings yet
Need of Two Types of Data: Information
7 pages
Advantages of Data Warehouse
No ratings yet
Advantages of Data Warehouse
2 pages
What Is The Level of Granularity of A Fact Table
No ratings yet
What Is The Level of Granularity of A Fact Table
15 pages
Data Warehouse and OLAP
No ratings yet
Data Warehouse and OLAP
55 pages
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
Data Warehousing
No ratings yet
Data Warehousing
24 pages
DWH
No ratings yet
DWH
48 pages
Cosmosdb Study
No ratings yet
Cosmosdb Study
41 pages
Nandini 19BSP1650
No ratings yet
Nandini 19BSP1650
1 page
UNIT V DWM Notes
No ratings yet
UNIT V DWM Notes
18 pages
What's A Data Warehouse
No ratings yet
What's A Data Warehouse
24 pages
Pregnancy Companion Chatbot Using
No ratings yet
Pregnancy Companion Chatbot Using
5 pages
ITE3712 MockTest Paper Ans
No ratings yet
ITE3712 MockTest Paper Ans
10 pages
Datawarehousing Chap01
No ratings yet
Datawarehousing Chap01
27 pages
DW Example
No ratings yet
DW Example
24 pages
Join Now MELBET
No ratings yet
Join Now MELBET
1 page
Separation Clearance Form V7 - 19-Feb-2020 - NSM
No ratings yet
Separation Clearance Form V7 - 19-Feb-2020 - NSM
2 pages
Schemas For Multidimensional Databases
No ratings yet
Schemas For Multidimensional Databases
5 pages
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
From Everand
Decision Support System: Fundamentals and Applications for The Art and Science of Smart Choices
Fouad Sabry
No ratings yet

Data Warehouse: Concepts, Architecture and Components

Uploaded by

Data Warehouse: Concepts, Architecture and Components

Uploaded by

Data Warehouse

Concepts, Architecture and Components

Characteristics of Data warehouse

A data warehouse has following characteristics:

Data Warehouse Architectures

There are mainly three types of Data warehouse Architectures: -

Benefits of Data Warehousing

Problems of Data Warehousing

You might also like