0% found this document useful (0 votes)
151 views17 pages

Lecture 1

The document discusses an introduction to data warehousing and data mining course. It provides an overview of topics to be covered in the course including introduction and background, data normalization, online analytical processing, dimensional modeling, extract-transform-load, data quality management, speed techniques, data mining, data warehouse implementation steps, a case study, lab work, and other topics. It also outlines the semester project which involves developing an application for an organization and documenting the process. The course aims to develop an understanding of database concepts in very large databases and data warehouses.

Uploaded by

Sana Mehmood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
151 views17 pages

Lecture 1

The document discusses an introduction to data warehousing and data mining course. It provides an overview of topics to be covered in the course including introduction and background, data normalization, online analytical processing, dimensional modeling, extract-transform-load, data quality management, speed techniques, data mining, data warehouse implementation steps, a case study, lab work, and other topics. It also outlines the semester project which involves developing an application for an organization and documenting the process. The course aims to develop an understanding of database concepts in very large databases and data warehouses.

Uploaded by

Sana Mehmood
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

COMSATS University 1

Data Warehousing & Data Mining

LECTURE-1
INTRODUCTION AND BACKGROUND

Zahoor Tanoli (PhD)


2

Introduction and
Background
Reference Books
 W. H. Inmon, Building the Data Warehouse
3
(Second Edition), John Wiley & Sons Inc., NY.

 A. Abdullah, “Data Warehousing for beginners: Concepts & Issues” (First


Edition).

 Paulraj Ponniah, Data Warehousing Fundamentals,


John Wiley & Sons Inc., NY.
Additional Material
4
 Research Papers

 Magazine Articles
Summary of course
Topics (Total Lectures = 45)
1. Introduction & Background
2. De-normalization
3. On Line Analytical Processing (OLAP)
4. Dimensional modeling
5. Extract – Transform – Load (ETL)
6. Data Quality Management (DQM)
7. Need for speed (Parallelism, Join and Indexing techniques)
8. Data Mining
9. DWH Implementation steps
10. Complete implementation case study
11. Lab and tool usage
12. Others 5
Summary of course

Topics
1. Introduction & Background
2. De-normalization
3. On Line Analytical Processing (OLAP)
4. Dimensional modeling

6
Summary of course

Topics
5. Extract – Transform – Load (ETL)
6. Data Quality Management (DQM)
7. Need for speed (Parallelism, Join and
Indexing techniques)
8. Data Mining
9. DWH Implementation steps

7
Summary of course

Topics
10. Complete implementation case study
11. Lab and tool usage
12. Others

8
Semester Project
9
Develop an application for an organization of your choice.

A case study and coding based approach to be followed.

Use 4GL or a high level programming language.

You MUST collect the necessary data and should have a first
draft of the project description approved by the instructor
BEFORE initiating on detailed work.
Semester Project (Cont…)
10
The project report to include, but is not limited to, the following
as documentation:
 Narrative description of business and tables of appropriate data.
 Descriptions of decisions to be supported by information produced by
system.
 Summary narrative of results produced.
 Structure charts, dataflow diagrams and/or other diagrams to
document the structure of the system.
 Listings of computer models/programs utilized.
 Reports displaying results.
 Recommended decision from results.
 User instructions.
Approach of the course
11
 Developan understanding of underlying RDBMS
concepts.

 Applythese concepts to VLDB DSS environments


and understand where and why they break down?

 Exposethe differences between RDBMS and Data


Warehouse in the context of VLDB.

 Provide the basics of DSS tools such as OLAP, Data


Mining and demonstrate their application.

 Demonstrate the application of DSS concepts and


limitations of the OLTP concepts through lab
exercises.
Why this course?
12
 The world is changing (actually changed), either change or be
left behind.

 Missing the opportunities or going in the wrong direction has


prevented us from growing.

 What is the right direction?


 Harnessing the data, in a knowledge driven economy.
The need

“Drowning in data and starving


for information”
Knowledge is power, Intelligence
is absolute power!

13
The need
$
POWER

INTELLIGENCE

KNOWLEDGE

INFORMATION

DATA

14
Historical overview

1960
Master Files & Reports

1965
Lots of Master files!

1970
Direct Access Memory & DBMS

1975
Online high performance transaction processing 

15
Historical overview

1980
PCs and 4GL Technology (MIS/DSS) 
1985 & 1990 
Extract programs, extract processing,
The legacy system’s web

16
Historical overview: Crisis of Credibility
What is the financial health of our company?

??

 

-10%

+10%



17

You might also like