Data Mining
Data Mining
CO1 :: understand the overall architecture of a data warehouse, techniques, and methods for
data gathering and data pre-processing
CO2 :: categorize the high dimensional data for better organization of the data
CO3 :: outline the importance of data preprocessing in the data mining process
CO4 :: analyze the major issues in the data mining process and the need for association rule
mining
CO5 :: employ the various classification algorithms in real life data mining problems
Unit I
Data warehouse introduction : concepts of data warehouse, OLAP and OLTP systems, multitier
architecture of data warehouse, OLAP server, ETL process, data warehouse models
Unit II
Data warehouse modeling : data cube, schemas for multidimensional data models, OLAP
operations
General data features : data objects and attributes, measuring the central tendency, measuring the
dispersion of data
Unit III
Techniques for data preprocessing. : data cleaning, data integration, data reduction, data
transformation, data discretization
Unit IV
Introduction to data mining and association mining : Scope of data mining, knowledge
discovery process, data mining applications, major issues in data mining, market basket analysis,
association rule mining, apriori algorithm
Unit V
Methods for data classification : basic of classification, decision tree induction, bayes classification,
model evaluation and selection, techniques to improve classification accuracy
Unit VI
Cluster analysis : concept of clustering, partitioning methods, hierarchical methods, density-based
methods
Text Books:
1. DATA MINING: INTRODUCTORY AND ADVANCED TOPICS by MARGARET H.DUNHAM,
PEARSON
References:
1. DATA MINING: CONCEPTS AND TECHNIQUES by JIAWEI HAN, MICHELINE KAMBER, JIAN
PEI, MORGAN KAUFMANN
2. INTRODUCTION TO DATA MINING WITH CASE STUDIES by GUPTA, G. K., PHI Learning