What Is a Data Warehouse
What Is a Data Warehouse
What is ETL – ETL Vs ELT – Types of Data warehouses - Data warehouse Design and Modeling
-Delivery Process - Online Analytical Processing (OLAP) - Characteristics of OLAP -
Online Transaction Processing (OLTP) Vs OLAP - OLAP operations- Types of OLAP- ROLAP
Vs MOLAP Vs HOLAP.
A data warehouse is a centralized storage system that allows for the storing,
analyzing, and interpreting of data in order to facilitate better decision-making.
Transactional systems, relational databases, and other sources provide data into
data warehouses on a regular basis.
A data warehouse is a type of data management system that facilitates and
supports business intelligence (BI) activities, specifically analysis. Data
warehouses are primarily designed to facilitate searches and analyses and usually
contain large amounts of historical data.
A data warehouse can be defined as a collection of organizational data and
information extracted from operational sources and external data sources. The data
is periodically pulled from various internal applications like sales, marketing, and
finance; customer-interface applications; as well as external partner systems. This
data is then made available for decision-makers to access and analyze. So what is
data warehouse? For a start, it is a comprehensive repository of current and
historical information that is designed to enhance an organization’s performance.
Key Characteristics of Data Warehouse
Subject-Oriented
A data warehouse is subject-oriented since it provides topic-wise information
rather than the overall processes of a business. Such subjects may be sales,
promotion, inventory, etc. For example, if you want to analyze your company’s
sales data, you need to build a data warehouse that concentrates on sales. Such a
warehouse would provide valuable information like ‘who was your best customer
last year?’ or ‘who is likely to be your best customer in the coming year?’
Integrated
A data warehouse is developed by integrating data from varied sources into a
consistent format. The data must be stored in the warehouse in a consistent and
universally acceptable manner in terms of naming, format, and coding. This
facilitates effective data analysis.
Non-Volatile
Data once entered into a data warehouse must remain unchanged. All data is read-
only. Previous data is not erased when current data is entered. This helps you to
analyze what has happened and when.
Time-Variant
The data stored in a data warehouse is documented with an element of time, either
explicitly or implicitly. An example of time variance in Data Warehouse is
exhibited in the Primary Key, which must have an element of time like the day,
week, or month.
Bottom Tier
The bottom tier or data warehouse server usually represents a relational database
system. Back-end tools are used to cleanse, transform and feed data into this layer.
Middle Tier
The middle tier represents an OLAP server that can be implemented in two ways.
The ROLAP or Relational OLAP model is an extended relational database
management system that maps multidimensional data process to standard relational
process.
Top Tier
This is the front-end client interface that gets data out from the data warehouse. It
holds various tools like query tools, analysis tools, reporting tools, and data mining
tools.
Data Warehousing integrates data and information collected from various sources
into one comprehensive database. For example, a data warehouse might combine
customer information from an organization’s point-of-sale systems, its mailing
lists, website, and comment cards. It might also incorporate confidential
information about employees, salary information, etc. Businesses use such
components of data warehouse to analyze customers.
Data mining is one of the features of a data warehouse that involves looking for
meaningful data patterns in vast volumes of data and devising innovative strategies
for increased sales and profits.
This type of warehouse serves as a key or central database that facilitates decision-
support services throughout the enterprise. The advantage to this type of
warehouse is that it provides access to cross-organizational information, offers a
unified approach to data representation, and allows running complex queries.
Operational Data Store (ODS)
This type of data warehouse refreshes in real-time. It is often preferred for routine
activities like storing employee records. It is required when data warehouse
systems do not support reporting needs of the business.
Data Mart
Wondering what Data warehouse tools is? Well, these are software components
used to perform several operations on an extensive data set. These tools help to
collect, read, write and transfer data from various sources. What do data
warehouses support? They are designed to support operations like data sorting,
filtering, merging, etc.
Autonomous Database
An autonomous system is one that can achieve a given set of goals in a changing
environment—gathering information about the environment and working for an
extended period of time without human control or intervention. Driverless cars and
autonomous mobile robots (AMRs) used in warehouses are two common
examples.
With an autonomous database, developers can quickly build scalable and secure
enterprise applications from data housed in a preconfigured, fully managed, and
secure environment.