0% found this document useful (0 votes)
8 views21 pages

Data Warehouse OLAP

The document provides an overview of data warehouses, contrasting them with traditional databases and introducing data lakes. It discusses key characteristics, architecture, and applications of data warehouses, as well as the differences between OLAP and OLTP systems. The conclusion emphasizes the importance of these components in creating a robust data ecosystem for modern enterprises.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views21 pages

Data Warehouse OLAP

The document provides an overview of data warehouses, contrasting them with traditional databases and introducing data lakes. It discusses key characteristics, architecture, and applications of data warehouses, as well as the differences between OLAP and OLTP systems. The conclusion emphasizes the importance of these components in creating a robust data ecosystem for modern enterprises.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 21

Understanding Data Warehouse

Data Warehouse vs Traditional Database


Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Unveiling the Data Ecosystem: Data


Warehouses, Databases, & Data Lakes
in Modern Enterprises

Khang Nguyen-Hoang

Hutech University

February 21, 2025

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Outline

1 Understanding Data Warehouse

2 Data Warehouse vs Traditional Database

3 Introducing Data Lake

4 OLAP vs OLTP: A Detailed Comparison

5 Components of a Data Warehouse

6 Applications in Enterprises

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

What is a Data Warehouse?

A centralized repository for storing and managing large


volumes of historical and current data from diverse sources.
Designed for analytical workloads (OLAP), supporting
business intelligence, analytics, and AI/ML.
Optimized for complex queries, aggregations, and reporting
at scale (terabytes to petabytes).
Source: Snowflake Data Warehouse Overview

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Key Characteristics of a Data Warehouse

Characteristic Description
Subject-Oriented Organized around business domains (e.g., sales,
inventory) for thematic analysis.
Integrated Unified data from heterogeneous sources,
cleansed and transformed.
Time-Variant Retains historical data for trend analysis,
append-only storage.
Non-Volatile Stable data, rarely updated, ensuring reliable
historical records.
Source: TechTarget Data Warehouse Definition

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Architecture of a Data Warehouse

Bottom Tier: Data ingestion from sources (e.g., ETL


with Apache Airflow).
Middle Tier: Storage and processing (e.g., Snowflake,
Redshift) with columnar storage and MPP.
Top Tier: Access via BI tools (e.g., Tableau) and SQL
clients.
Source: Snowflake Documentation Key Concepts

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Warehouse vs Traditional Database:


Purpose and Use

Aspect Traditional Data Warehouse


Database
Purpose Real-time transaction Historical data analysis
processing (OLTP) (OLAP)
Use Case Daily operations (e.g., Strategic insights (e.g.,
recording sales) sales trends over 5
years)
Source: Panoply Database vs Data Warehouse Guide

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Warehouse vs Traditional Database:


Technical Comparison

Aspect Traditional Data Warehouse


Database
Data Structure Normalized (reduces re- Denormalized (star/s-
dundancy) nowflake schema)
Storage Row-based Columnar
Scalability Transaction volume Data volume
(petabytes)
Performance Fast transactions Fast complex queries
Source: Oracle Data Warehouse Information

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

What is a Data Lake?

A centralized storage system for raw data in its native


format (structured, semi-structured, unstructured).
Designed for big data analytics, machine learning, and data
exploration.
Highly scalable, often cloud-based (e.g., Amazon S3, Azure
Blob Storage).
Source: AWS Big Data Lake

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Key Features of a Data Lake

Feature Description
Raw Data Storage Stores data as-is (e.g., JSON, images, logs).
Scalability Leverages distributed systems (e.g., HDFS, S3).
Flexibility Schema-on-read, no predefined structure
needed.
Cost-Effective Low-cost storage, processing deferred.
Source: Databricks Data Lakes Discover

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

OLAP vs OLTP: Overview

OLTP (Online Transaction Processing): Focuses on


real-time transaction management for operational
efficiency.
OLAP (Online Analytical Processing): Focuses on
complex analysis of historical data for strategic insights.
Foundation for understanding databases and data
warehouses.
Source: Oracle OLTP vs OLAP Explanation

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

OLAP vs OLTP: Functional Comparison

Aspect OLTP OLAP


Purpose Manage daily transac- Analyze historical data
tions
Data Type Current, operational Historical, aggregated
data data
Users Operational staff, appli- Analysts, data scientists
cations
Query Type Simple (e.g., updates, Complex (e.g., aggrega-
inserts) tions, joins)
Source: IBM DB2 OLTP vs OLAP Documentation

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

OLAP vs OLTP: Technical Comparison

Aspect OLTP OLAP


Data Structure Normalized (e.g., rela- Denormalized (e.g., star
tional tables) schema)
Performance Fast writes, low latency Fast reads, high
throughput
Storage Design Row-based, optimized Columnar, optimized
for transactions for analysis
Volume Smaller, transactional Larger, historical data
data
Source: GeeksforGeeks OLTP vs OLAP Difference

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Components of a Data Warehouse: Overview

A Data Warehouse comprises several key components


working together to enable efficient data ingestion, storage,
processing, and access.
Essential for supporting enterprise-scale analytics and
decision-making.
Source: Oracle Data Warehouse Information

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Components of a Data Warehouse: Core


Elements (1)

Component Description
Data Sources Heterogeneous systems providing raw data (e.g.
CRM, ERP, IoT devices).
ETL/ELT Pipelines Extract, transform, and load data into the ware
house (e.g., Apache Airflow, Informatica).
Storage Layer Centralized repository for structured data, often
columnar (e.g., Snowflake, Redshift).
Source: Snowflake Documentation Key Concepts

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Components of a Data Warehouse: Core


Elements (2)

Component Description
Metadata Repository Stores information about data (structure, li
eage, usage) for governance and querying.
Query Engine Processes complex analytical queries (e.g., SQ
based engines in BigQuery).
Access Layer Interfaces for end-users (e.g., BI tools li
Tableau, Power BI).
Source: IBM Data Warehouse Topics

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Warehouse Applications (1)

Industry Application Example


Retail Sales trend analysis, in- Walmart optimizes
ventory management product placement
across stores
Finance Fraud detection, risk as- American Express
sessment detects transaction
anomalies
Source: Analytics Vidhya Data Warehouse Examples

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Warehouse Applications (2)

Industry Application Example


Healthcare Patient care improve- Hospital networks track
ment, disease trend treatment outcomes
analysis
Source: IBM Data Warehouse Topics

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Lake Applications (1)

Industry Application Example


Manufacturing Predictive maintenance, Tesla predicts machine
supply chain optimiza- failures with IoT data
tion
Telecom Network optimization, Verizon enhances ser-
customer experience vice with network logs
Source: Databricks Data Lakes Discover

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Data Lake Applications (2)

Industry Application Example


Media & Entertainment Content recommenda- Netflix trains M
tions, ad strategies els for personaliz
tent
Source: Analytics Vidhya Data Lake Examples

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Conclusion

Data Warehouse: Structured, optimized for analytics and


reporting.
Traditional Database: Real-time transactions,
operational efficiency.
Data Lake: Raw, flexible, ideal for big data and ML.
Together, they form a robust data ecosystem for modern
enterprises.

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab


Understanding Data Warehouse
Data Warehouse vs Traditional Database
Introducing Data Lake
OLAP vs OLTP: A Detailed Comparison
Components of a Data Warehouse
Applications in Enterprises

Thank You

Questions?
Contact: [Your Email]

Khang Nguyen-Hoang Unveiling the Data Ecosystem: Data Warehouses, Datab

You might also like