Data Warehouse OLAP
Data Warehouse OLAP
Khang Nguyen-Hoang
Hutech University
Outline
6 Applications in Enterprises
Characteristic Description
Subject-Oriented Organized around business domains (e.g., sales,
inventory) for thematic analysis.
Integrated Unified data from heterogeneous sources,
cleansed and transformed.
Time-Variant Retains historical data for trend analysis,
append-only storage.
Non-Volatile Stable data, rarely updated, ensuring reliable
historical records.
Source: TechTarget Data Warehouse Definition
Feature Description
Raw Data Storage Stores data as-is (e.g., JSON, images, logs).
Scalability Leverages distributed systems (e.g., HDFS, S3).
Flexibility Schema-on-read, no predefined structure
needed.
Cost-Effective Low-cost storage, processing deferred.
Source: Databricks Data Lakes Discover
Component Description
Data Sources Heterogeneous systems providing raw data (e.g.
CRM, ERP, IoT devices).
ETL/ELT Pipelines Extract, transform, and load data into the ware
house (e.g., Apache Airflow, Informatica).
Storage Layer Centralized repository for structured data, often
columnar (e.g., Snowflake, Redshift).
Source: Snowflake Documentation Key Concepts
Component Description
Metadata Repository Stores information about data (structure, li
eage, usage) for governance and querying.
Query Engine Processes complex analytical queries (e.g., SQ
based engines in BigQuery).
Access Layer Interfaces for end-users (e.g., BI tools li
Tableau, Power BI).
Source: IBM Data Warehouse Topics
Conclusion
Thank You
Questions?
Contact: [Your Email]