0% found this document useful (0 votes)
68 views30 pages

Defining Data Warehouse Concepts and Terminology

This document defines key concepts and terminology related to data warehousing. It defines a data warehouse as an enterprise repository used for information retrieval and decision support that stores integrated, subject-oriented, and time-variant historical data. The document outlines key properties of data warehouses including being subject-oriented, integrated, and time-variant. It also distinguishes between data warehouses and operational databases and describes Oracle's data warehousing components, tools, and services.

Uploaded by

avijust4u
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
68 views30 pages

Defining Data Warehouse Concepts and Terminology

This document defines key concepts and terminology related to data warehousing. It defines a data warehouse as an enterprise repository used for information retrieval and decision support that stores integrated, subject-oriented, and time-variant historical data. The document outlines key properties of data warehouses including being subject-oriented, integrated, and time-variant. It also distinguishes between data warehouses and operational databases and describes Oracle's data warehousing components, tools, and services.

Uploaded by

avijust4u
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPT, PDF, TXT or read online on Scribd
You are on page 1/ 30

Defining Data Warehouse Concepts and Terminology

Chapter 3

Definition of a Data Warehouse


An enterprise structured repository of subject-oriented, time-variant, historical data used for information retrieval and decision support. The data warehouse stores atomic and summary data.
Oracle Data Warehouse Method

Data Warehouse Properties


Subject Oriented Integrated

Data Warehouse
Non Volatile

Time Variant

Subject-Oriented
Data is categorized and stored by business subject rather than by application
OLTP Applications Equity Plans Data Warehouse Subject

Shares

Insurance
Savings Loans

Customer financial information

Integrated
Data on a given subject is defined and stored once.
Savings

Current accounts

Loans

Customer

OLTP Applications

Data Warehouse

Time-Variant
Data is stored as a series of snapshots, each representing a period of time

Time Jan-97 Feb-97 Mar-97

Data January February March

Nonvolatile
Typically data in the data warehouse is not updated or delelted. Operational Warehouse

Load

Insert Update Delete

Read

Read

Changing Data
First time load Warehouse Database

Operational Database

Refresh

Refresh

Refresh

Data Warehouse Versus OLTP


Property Response Time
Operations

Operational Sub seconds to seconds


DML

Data Warehouse
Seconds to hours Primarily read only

Nature of Data

30-60 days

Data Organization Applications Size Small to large Data Source Activities

Snapshots over time Subject, time


Large to very large

Operational, Internal, Operational, Internal External


Processes Analysis

Usage Curves
Operational system is predictable Data warehouse - Variable - Random

User Expectations
Control expectations Set achievable targets for query response Set SLAs Educate Growth and use is exponential

Enterprisewide Warehouse
Large scale implementation Scope the entire business Data from all subject areas Developed incrementally Single source of enterprisewide data Single distribution point to dependent data marts

Data Warehouses Versus Data Marts


Data Warehouse
Data Mart

Property Scope Subject Data Source Size(typical) Implementation time

Data Warehouse Enterprise Multiple Many 100 GB to>1 TB Months to years

Data Mart Department Single-subject, LOB Few <100 GB Months

Dependent Data Mart


Flat Files Operational Systems Marketing Sales Finance Human Resources Data Warehouse External Data Marketing

Marketing

Marketing Data Marts

Independent Data Mart


Flat Files
Operational Systems

Sale or Marketing External Data

Data Warehouse Terminology


Operational data store (ODS) Stores tactical data from production systems that are subject-oriented and integrated to address operational needs Metadata
Metadata

Data Warehouse Terminology


Architecture
Enterprise data warehouse Business area warehouse

Data Integration
Source data

Methodolgy
Ensures a successful data warehouse Encourages incremental development Provides a staged approach to an enterprisewide warehouse - Safe - Manageable - Proven - Recommended

Modeling
Warehouses differ from operational structures: - Analytical requirements - Subject orientation Data must map to subject oriented information: - Identify business subjects - Define relationships between subjects - Name the attributes of each subject Modeling is iterative Modeling tools are available

Extraction, Transformation, and Transportation

OLTP Databases

Staging File

Warehouse Database

Purchase specialist tools, or develop programs Extraction-- select data using different methods Transformation--validate, clean, integrate, and time stamp data Transportation--move data into the

Data Management
Efficient database server and management tools for all aspects of data management Imperatives - Productive - Flexible - Robust - Efficient Hardware, operating system and

Data Access and Reporting


Simple Queries Forecasting
Warehouse Database

Drill-down

Tools that retrieve data for business analysis Imperatives - Ease of use - Intuitive - Metadata - Training More than one tool may be required

Oracle Warehouse Components


Any Source Any Data Any Access

Operational data

Relational / Multidimensional

Relational tools

Text, image External data

Spatial

OLAP tools

Web

Audio video

Applications/Web

Oracle Data Mart Suite


Data Modeling
Oracle Data Mart Designer OLTP Databases Data Mart Database

OLTP Engines Data Extraction Oracle Data Mart Builder

Warehousing Engines

SQL*Plus

Data Management Oracle Enterprise Manager

Data Access & Analysis Discoverer & Oracle Reports

Data Mart Implementation with the Oracle Data Mart Suite


Oracle Oracle Oracle Oracle Oracle Oracle Oracle Enterprise Server Enterprise Manager Data Mart Builder Data Mart Designer Discoverer Web Application Server Reports

Oracle Warehouse Builder Architecture


Extraction Facilities Loader Remotes SQL Gateways - OLE-DB/ODBC - Mainframe - Specialized ERP Data - SAP - Peoplesoft - Oracle

Sources
Filter Transform

PL/SQL, Java Transforms Transform Driver PL/SQL, Java Wrapper External Functions Target Tables Oracle 8i

Oracle Business Intelligence Tools

IS develops users Views

Business users

Analysis

Current
Oracle Reports

Tactical
Oracle Discover

Strategic
Oracle Express

The Tool for Each Task


Tool
Oracle Reports Oracle Discover Oracle Express

Task
Production reporting Ad hoc query and analysis Advanced analysis

Question
What were sales by region last quarter? What is driving the increase in North American sales?

Given the rapid increase in Web sales, what will total sales be for the rest of the year?

Oracle Warehouse Services


Oracle Education Oracle Consulting

Customers

Oracle Support Services

Summary
This lesson covered the following topics: Identifying a common, broadly accepted definition of the data warehouse Distinguishing the differences between OLTP systems and analytical systems Defining some of the common data warehouse terminology Identifying some of the elements and processes in a data warehouse Identifying and positioning the Oracle Warehouse vision, products, and services

You might also like