Lect 4
Lect 4
Business Intelligence
&
Analytics
Lecture 4
Lecture 04
Dataware housing
Contents
• Business Intelligence and Data warehousing
• History of Data warehousing
• Characteristics of Data ware housing
• Data warehousing architectures
• Extraction, transformation, and load (E T L) processes
Learning Objectives (1 of 2)
Understand the basic definitions and concepts of data
warehousing
Understand data warehousing architectures
Describe the processes used in developing and managing data
warehouses
Explain data warehousing operations
Explain the role of data warehouses in decision support
Business Intelligence and Data Warehousing
• BI used to be
Business Analytics
everything related to
use of data for
managerial decision
support Descriptive Predictive Prescriptive
• Now, it is a part of
Questions
What happened? What will happen? What should I do?
What is happening? Why will it happen? Why should I do it?
Business Analytics
Business reporting Data mining Optimization
– BI = Descriptive
ü ü ü
Enablers
Data Warehouse
One management and analytics platform
for product configuration, warranty, and
diagnostic readout data
Data Applications
Sources No data marts option (Visualization)
Data
Marts Routine
ERP Business
ETL
Reporting
Process
Data mart
Select (Marketing)
Legacy Metadata Data/text
/ Middleware
Extract mining
Data mart
Transform Enterprise (Operations)
POS Data warehouse
OLAP,
Integrate
Dashboard,
API
Data mart
(Finance) Web
Other Load
OLTP/Web
Replication Data mart
(...) Custom built
External
applications
Data
DW Architecture
• Three-tier architecture
1. Data acquisition software (back-end)
2. The data warehouse that contains the data & software
3. Client (front-end) software that allows users to access and
analyze data from the warehouse
• Two-tier architecture
– First two tiers in three-tier architecture are combined into one
DW Architectures
3-tier
architecture
Tier 1: Tier 2: Tier 3:
Client workstation Application server Database server
2-tier
architecture
Tier 1: Tier 2:
Client workstation Application & database server
Data Warehousing Architectures
• Issues to consider when deciding which architecture to use:
– Which database management system (DBMS) should be
used?
– Will parallel processing and/or partitioning be used?
– Will data migration tools be used to load the data
warehouse?
– What tools will be used to support data retrieval and
analysis?
A Web-based DW Architecture
Web pages
Application
Server
Client Web
(Web browser) Internet/ Server
Intranet/
Extranet
Data
warehouse
Alternative DW Architectures (1 of 2)
(a) Independent Data Marts Architecture
ETL
End user
Source Staging Independent data marts
access and
Systems Area (atomic/summarized data)
applications
ETL
Dimensionalized data marts End user
Source Staging
linked by conformed dimentions access and
Systems Area
(atomic/summarized data) applications
ETL
End user
Source Staging Normalized relational
access and
Systems Area warehouse (atomic data)
applications
ETL
Normalized relational End user
Source Staging
warehouse (atomic/some access and
Systems Area
summarized data) applications
5. Constraints on resources
9. Technical issues
Packaged Transient
application data source
Data
warehouse
Data
marts
Other internal
applications
ETL (Extract, Transform, Load)
• Issues affecting the purchase of an ETL tool
– Data transformation tools are expensive
– Data transformation tools may have a long learning curve
• Important criteria in selecting an ETL tool
– Ability to read from and write to an unlimited number of
data sources/architectures
– Automatic capturing and delivery of metadata
– A history of conforming to open standards
– An easy-to-use interface for the developer and the
functional user
Data Warehouse Development
• Data warehouse development approaches
– Inmon Model: EDW approach (top-down)
– Kimball Model: Data mart approach (bottom-up)
– Which model is best?
• Table 3.3 provides a comparative analysis between EDW and
Data Mart approach
• Another alternative is the hosted data warehouses
Comparing EDW and Data Mart (1 of 2)
Table 3.3 Contrasts between the DM and EDW Development Approaches