CS402 Data Mining and Warehousing PDF
CS402 Data Mining and Warehousing PDF
CS402 Data Mining and Warehousing PDF
Course Name
code Credits Introduction
CS402 DATA MINING AND WAREHOUSING 3-0-0-3 2016
Course Objectives:
To introduce the concepts of data Mining and its applications
To understand investigation ofe data using practical data mining tools.
To introduce Association Rules Mining
To introduce advanced Data Mining techniques
Syllabus:
Data Mining, Applications, Data Mining Models, Data Warehousing and OLAP, Challenges,
Tools, Data Mining Principles, Data Preprocessing: Data Preprocessing Concepts, Data
Visualization, Data Sets and Their Significance, Classification Models, Multi Resolution Spatial
Data Mining, Classifiers, Association Rules Mining, Cluster Analysis, Practical Data Mining
Tools, Advanced Data Mining Techniques, Web Mining, Text Mining, CRM Applications and
Data Mining, Data warehousing.
Expected Outcome:
The Student will be able to :
i. identify the key process of Data mining and Warehousing
ii. apply appropriate techniques to convert raw data into suitable format for practical data
mining tasks
iii. analyze and compare various classification algorithms and apply in appropriate domain
KTU
iv.
STUDENTS
evaluate the performance of various classification methods using performance metrics
v. make use of the concept of association rule mining in real world scenario
vi.
vii.
select appropriate clustering and algorithms for various applications
extend data mining methods to the new domains of data
Text Books:
1. Dunham M H, “Data Mining: Introductory and Advanced Topics”, Pearson Education,
New Delhi, 2003.
2. Jaiwei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, Elsevier,
2006.
References:
1. M Sudeep Elayidom, “Data Mining and Warehousing”, 1st Edition, 2015, Cengage
Learning India Pvt. Ltd.
2. Mehmed Kantardzic, “Data Mining Concepts, Methods and Algorithms”, John Wiley
and Sons, USA, 2003.
3. Pang-Ning Tan and Michael Steinbach, “Introduction to Data Mining”, Addison Wesley,
2006.
KTUV
Regression.
STUDENTS
SECOND INTERNAL EXAM
Association Rules Mining: Concepts, Apriori and FP-Growth
Algorithm. Cluster Analysis: Introduction, Concepts, Types of
data in cluster analysis, Categorization of clustering methods.
8 20
Partitioning method: K-Means and K-Medoid Clustering.
Hierarchical Clustering method: BIRCH. Density-Based
Clustering –DBSCAN and OPTICS.
Advanced Data Mining Techniques: Introduction, Web
Mining- Web Content Mining, Web Structure Mining, Web
VI 8 20
Usage Mining. Text Mining.
Graph mining:- Apriori based approach for mining frequent
subgraphs. Social Network Analysis:- characteristics of social
networks. Link mining:- Tasks and challenges.
END SEMESTER EXAMINATION
KTU
5. Part D
a. Total marks : 24
STUDENTS
d. Each question can have maximum THREE subparts.