0% found this document useful (0 votes)
97 views

Assignment 2

The document provides information about data warehouses including: - A data warehouse is a relational database designed to support querying and analysis of historical data from multiple sources to support decision making. - A data warehouse environment contains tools for extracting, transforming and loading (ETL) data from source systems, an online analytical processing (OLAP) engine for analysis, and applications for users to access and analyze the data. - The purpose of a data warehouse is to organize data from multiple sources to enable knowledge workers and executives to make better strategic decisions by systematically analyzing the integrated data.

Uploaded by

Aditya Boss
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
97 views

Assignment 2

The document provides information about data warehouses including: - A data warehouse is a relational database designed to support querying and analysis of historical data from multiple sources to support decision making. - A data warehouse environment contains tools for extracting, transforming and loading (ETL) data from source systems, an online analytical processing (OLAP) engine for analysis, and applications for users to access and analyze the data. - The purpose of a data warehouse is to organize data from multiple sources to enable knowledge workers and executives to make better strategic decisions by systematically analyzing the integrated data.

Uploaded by

Aditya Boss
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

1) Data Warehouse

Data Warehouse is a relational database management system (RDBMS) construct to meet the
requirement of transaction processing systems. It can be loosely described as any centralized data
repository which can be queried for business benefits. It is a database that stores information
oriented to satisfy decision-making requests. It is a group of decision support technologies, targets
to enabling the knowledge worker (executive, manager, and analyst) to make superior and higher
decisions. So, Data Warehousing support architectures and tool for business executives to
systematically organize, understand and use their information to make strategic decisions.

Data Warehouse environment contains an extraction, transportation, and loading (ETL) solution,
an online analytical processing (OLAP) engine, customer analysis tools, and other applications
that handle the process of gathering information and delivering it to business users.

What is a Data Warehouse?


A Data Warehouse (DW) is a relational database that is designed for query and analysis rather than
transaction processing. It includes historical data derived from transaction data from single and
multiple sources.

A Data Warehouse provides integrated, enterprise-wide, historical data and focuses on providing
support for decision-makers for data modeling and analysis.

Discuss data warehouse architecture in detail.

A data warehouse architecture is a method of defining the overall architecture of data


communication processing and presentation that exist for end-clients computing within the
enterprise. Each data warehouse is different, but all are characterized by standard vital
components.
Data warehouses and their architectures very depending upon the elements of an organization's
situation.

Three common architectures are:

o Data Warehouse Architecture: Basic


o Data Warehouse Architecture: With Staging Area
o Data Warehouse Architecture: With Staging Area and Data Marts

Data Warehouse Architecture: Basic


Write down Difference between Data Warehouse and Data Mining.

Data Mining Data Warehousing

Data mining is the process of determining data A data warehouse is a database system designed for
patterns. analytics.

Data mining is generally considered as the process Data warehousing is the process of combining all the
of extracting useful data from a large set of data. relevant data.

Business entrepreneurs carry data mining with the Data warehousing is entirely carried out by the
help of engineers. engineers.

In data mining, data is analyzed repeatedly. In data warehousing, data is stored periodically.

Data mining uses pattern recognition techniques to Data warehousing is the process of extracting and
identify patterns. storing data that allow easier reporting.

One of the most amazing data mining technique is One of the advantages of the data warehouse is its
the detection and identification of the unwanted ability to update frequently. That is the reason why it
errors that occur in the system. is ideal for business entrepreneurs who want up to date
with the latest stuff.

The data mining techniques are cost-efficient as The responsibility of the data warehouse is to simplify
compared to other statistical data applications. every type of business data.

The data mining techniques are not 100 percent In the data warehouse, there is a high possibility that
accurate. It may lead to serious consequences in a the data required for analysis by the company may not
certain condition. be integrated into the warehouse. It can simply lead to
loss of data.

Companies can benefit from this analytical tool by Data warehouse stores a huge amount of historical
equipping suitable and accessible knowledge-based data that helps users to analyze different periods and
data. trends to make future predictions.
Explain about the Data Warehouse implementation.

Data Warehouse Implementation


There are various implementation in data warehouses which are as follows

1. Requirements analysis and capacity planning: The first process in data warehousing involves
defining enterprise needs, defining architectures, carrying out capacity planning, and selecting the
hardware and software tools. This step will contain be consulting senior management as well as
the different stakeholder.

2. Hardware integration: Once the hardware and software has been selected, they require to be
put by integrating the servers, the storage methods, and the user software tools.

3. Modeling: Modelling is a significant stage that involves designing the warehouse schema and
views. This may contain using a modeling tool if the data warehouses are sophisticated.
17.4M

332

Prime Ministers of India | List of Prime Minister of India (1947-2020)

4. Physical modeling: For the data warehouses to perform efficiently, physical modeling is
needed. This contains designing the physical data warehouse organization, data placement, data
partitioning, deciding on access techniques, and indexing.

5. Sources: The information for the data warehouse is likely to come from several data sources.
This step contains identifying and connecting the sources using the gateway, ODBC drives, or
another wrapper.

6. ETL: The data from the source system will require to go through an ETL phase. The process of
designing and implementing the ETL phase may contain defining a suitable ETL tool vendors and
purchasing and implementing the tools. This may contains customize the tool to suit the need of
the enterprises.

7. Populate the data warehouses: Once the ETL tools have been agreed upon, testing the tools
will be needed, perhaps using a staging area. Once everything is working adequately, the ETL
tools may be used in populating the warehouses given the schema and view definition.

8. User applications: For the data warehouses to be helpful, there must be end-user applications.
This step contains designing and implementing applications required by the end-users.

9. Roll-out the warehouses and applications: Once the data warehouse has been populated and
the end-client applications tested, the warehouse system and the operations may be rolled out for
the user's community to use.

Discuss OLAP Technology in detail.

Online Analytical Processing Server (OLAP) is based on the multidimensional data model. It
allows managers, and analysts to get an insight of the information through fast, consistent, and
interactive access to information. This chapter cover the types of OLAP, operations on OLAP,
difference between OLAP, and statistical databases and OLTP.

Types of OLAP Servers


We have four types of OLAP servers −

 Relational OLAP (ROLAP)


 Multidimensional OLAP (MOLAP)
 Hybrid OLAP (HOLAP)
 Specialized SQL Servers
Relational OLAP
ROLAP servers are placed between relational back-end server and client front-end tools.
To store and manage warehouse data, ROLAP uses relational or extended-relational
DBMS.
ROLAP includes the following −

 Implementation of aggregation navigation logic.


 Optimization for each DBMS back end.
 Additional tools and services.
Multidimensional OLAP
MOLAP uses array-based multidimensional storage engines for multidimensional views
of data. With multidimensional data stores, the storage utilization may be low if the data
set is sparse. Therefore, many MOLAP server use two levels of data storage
representation to handle dense and sparse data sets.

Hybrid OLAP
Hybrid OLAP is a combination of both ROLAP and MOLAP. It offers higher scalability of
ROLAP and faster computation of MOLAP. HOLAP servers allows to store the large data
volumes of detailed information. The aggregations are stored separately in MOLAP store.

Specialized SQL Servers


Specialized SQL servers provide advanced query language and query processing
support for SQL queries over star and snowflake schemas in a read-only environment.

OLAP Operations
Since OLAP servers are based on multidimensional view of data, we will discuss OLAP
operations in multidimensional data.

You might also like