0% found this document useful (0 votes)
9 views7 pages

DWDM Unit 1 (Lecture 1)

Uploaded by

SAHINA FIRDOSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views7 pages

DWDM Unit 1 (Lecture 1)

Uploaded by

SAHINA FIRDOSH
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 7

Data Ware Housing

and Data Mining


BCA 302 (Lecture-1)
Data Warehouse
• A data warehouse is a subject-oriented, integrated, time-variant and non-
volatile collection of data in support of management's decision making
process.

• Subject-Oriented: A data warehouse can be used to analyze a particular


subject area. For example, "sales" can be a particular subject.

• Integrated: A data warehouse integrates data from multiple data sources.


For example, source A and source B may have different ways of identifying
a product, but in a data warehouse, there will be only a single way of
identifying a product.
• Time-Variant: Historical data is kept in a data warehouse. For example,
one can retrieve data from 3 months, 6 months, 12 months, or even older
data from a data warehouse. For example, a transaction system may hold
the most recent address of a customer, where a data warehouse can hold
all addresses associated with a customer.

• Non-Volatile: Once data is in the data warehouse, it will not change. So,
historical data in a data warehouse should never be altered.
Difference between Database System and
Data Warehouse
Database System Data Warehouse

Primarily designed to handle transactional Designed for analytical processing (OLAP -


data (OLTP - Online Transaction Processing). Online Analytical Processing). It is
It supports daily operations like inserting, optimized to handle large volumes of
updating, deleting, and querying small historical data and is used for reporting,
amounts of data. analysis, and decision-making.

Stores current, operational data that Stores historical data accumulated over
reflects day-to-day business operations. time. This data is typically denormalized to
This data is usually highly normalized (split improve query performance and is
into multiple tables). organized in a way that is optimized for
complex querying and analysis.
Database System Data Warehouse

Data is often organized in a highly Data is often denormalized, meaning it is


structured and normalized format to stored in fewer tables to optimize read
minimize redundancy and maintain data performance and simplify complex queries.
integrity. The data schema might be star, snowflake,
or galaxy.

Generally handles a smaller, more Typically stores massive amounts of data


manageable volume of data, focused on the over long periods (years or decades),
day-to-day operation of a business. capturing a wide historical view.

Primarily used by operational staff and Primarily used by data analysts, business
front-end applications like e-commerce intelligence professionals, and decision-
platforms, point-of-sale systems, and other makers who need to analyze trends,
business applications. perform reporting, and make strategic
decisions.
Database System Data Warehouse

Optimized for high-speed transactional Optimized for read-heavy operations like


processing and ensures data integrity, complex queries and aggregations over
consistency, and quick updates. large datasets. Indexing and partitioning are
commonly used to speed up analytical
queries.

Frequently updated in real-time as new Data is updated less frequently, often


transactions happen. through ETL (Extract, Transform, Load)
processes that occur in batches, typically
during off-peak hours.
Data is normalized to reduce redundancy Data is often denormalized to improve
and improve data integrity. query performance, as there is less concern
about redundancy.
Database System Data Warehouse

Data retrieval tends to be simple, focusing Data retrieval tends to be more complex
on quick access to specific records based and involves running large analytical
on operational needs. queries on aggregated data for business
analysis.
Example: An online store that tracks Example: A retail company analyzing sales
customer orders, inventory levels, and trends, customer behavior, and inventory
payments. management across multiple years to
improve future business strategies.

You might also like