0% found this document useful (0 votes)
310 views6 pages

Unit - 1 INTRODUCTION, DATA - 1: What Is Data Mining? Motivating Challenges The Origins of Data 6 Hours

This document contains a lesson plan for a Data Mining course taught over 8 units and 33 sessions. The plan lists the content covered in each session, including introductions to key data mining tasks, techniques for data preprocessing, classification algorithms like decision trees and nearest neighbors, association analysis methods, and applications. Sessions involve lectures, discussions, examples, and assignments to explain concepts and engage students.

Uploaded by

jain
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
310 views6 pages

Unit - 1 INTRODUCTION, DATA - 1: What Is Data Mining? Motivating Challenges The Origins of Data 6 Hours

This document contains a lesson plan for a Data Mining course taught over 8 units and 33 sessions. The plan lists the content covered in each session, including introductions to key data mining tasks, techniques for data preprocessing, classification algorithms like decision trees and nearest neighbors, association analysis methods, and applications. Sessions involve lectures, discussions, examples, and assignments to explain concepts and engage students.

Uploaded by

jain
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
You are on page 1/ 6

Aca Format X

BAHUBALI COLLEGE OF ENGINEERING, SHRAVANABELAGOLA


Lesson/Session Plan Template
Department of Information Science & Engineering
Sub code: 06IS74
Sub: DATA MINING Sem: VII
S Date Content Activity
N
UNIT - 1
INTRODUCTION, DATA – 1: What is Data Mining? Motivating Challenges; The origins of data
mining; Data Mining Tasks. Types of Data; Data Quality.
6 Hours
1 What is Data Mining? Participation
&discussion
2 Motivating Challenges;
3 The origins of data mining; Explain with
an example
4 Data Mining Tasks. Discussions
5 Types of Data;
6 Types of Data Discussions
7 Data Quality.
Measurement & data Collection Issues
Issues Related To Application
8 QP
Assignment
UNIT – 2
DATA – 2: Data Preprocessing; Measures of Similarity and Dissimilarity
6 Hours
9 Data Preprocessing Participation
Aggregation &discussion
Sampling

10 Data Preprocessing cont.. Discussions


• Dimensionality Reduction
• Feature Subset Selection
• Feature Creation

11 Data Preprocessing cont Explain with


Discretization & Binarization an example
Variable Transformation
12 Measures of Similarity and Dissimilarity
•Basics
•Similarity & Dissimilarity between Simple Attributes
•Dissimilarities Between Data Objects

13 Measures of Similarity and Dissimilarity Explain with


• Similarities Between Data Objects an example
• Examples if Proximity Measures
14 Issues in Proximity Calculation

1
Aca Format X

Selecting The Right Proximity Measure


15 Question Paper
16 Question Paper
UNIT - 7
FURTHER TOPICS IN DATA MINING: Multidimensional analysis and descriptive mining of
complex data objects; Spatial data mining; Multimedia data mining; Text mining; Mining the WWW.
Outlier analysis.
7 Hours
17 Multidimensional analysis and descriptive mining of complex data Participation
objects &discussion
•Generalization of Structured Data
•Aggregation and Approximation in Spatial and Multimedia
Data Generalization
•Generalization of Object Identifiers and Class/subclass
Hierarchies

18 Multidimensional analysis and descriptive mining of complex data


objects cont
• Generalization of Class Composition Hierarchies
• Construction and Mining of Object Cubes
• Generalization Based Mining of Plan Databases by Divide
and Conquer
19 Spatial data mining; Discussions
• Spatial data Cube Construction and Spatial OLAP
• Mining Spatial Association and Co-location Patterns
20 Spatial Clustering Methods
Spatial Classification and Spatial Trend Analysis
21 Multimedia data mining; Mining Raster Databases
• Similarity Search in Multimedia Data
• Multidimensional Analysis of Multimedia Data
22 Multimedia data mining; Mining Raster Databases cont..
• Classification and Predication Analysis of Multimedia Data
• Mining Association in Multimedia Data
• Audio & Video Data Mining
23 Text mining Text Data Analysis and Information Retrieval
• Dimensionality Reduction for Text
• Text Mining Approach
24 Mining the WWW.
• Mining the Web page layout structure
• Mining the Web link Structure to Identify Authoritative

2
Aca Format X

Web Pages
25 Mining Multimedia Data on the Web
•Automatic Classification of Web Documents
•Web Usage Mining
UNIT - 8
APPLICATIONS: Data mining applications; Data mining system products and research prototypes;
Additional themes on Data mining; Social impact of Data mining; Trends in Data mining.
6 Hours
26 Data mining applications; Participation
&discussion
•Data mining for Financial Data Analysis
•Retail Industry
•Telecommunication Industry
27 Biological Data Analysis
Other Scientific Application
Intrusion Detection
28 Data mining system products and research prototypes; Explain with
•How to Choose a Data mining System an example
•Examples of Commercial Data Mining Systems
29 Additional themes on Data mining Discussions
• Theoretical Foundation of Data Mining
• Statistical Data Mining
30 Visual and Audio Data Mining
Data Mining Privacy and Data Security
31 Social impact of Data mining;
• Ubiquitous and Invisible Data Mining
• Data Mining Privacy and Data Security
32 Trends in Data mining.
33 Question Bank Discussions
UNIT – 3
CLASSIFICATION: Preliminaries; General approach to solving a classification problem; Decision
tree induction; Rule-based classifier; Nearest-neighbor classifier.
8 Hours
34 General approach to solving a classification problem; Participation
&discussion
35 Decision tree induction; Discussions
• How a Decision Tree Works
• How To Build A Decision Tree
• Method for expressing attribute test conditions

36 Decision tree induction cont… Discussions

3
Aca Format X

• Measure for selecting the best split


• Algorithm for decision tree induction
• An example : web robot detection
• Characteristics Of decision tree induction

37 Rule-based classifier Discussions


• How a rule based classifier works
• Rule ordering schemes
• How to build a rule based classifier
38 Rule-based classifier cont..;
• Direct methods for rule extraction
• Indirect method for rule extraction
• Characteristics of rule based classifier
39 Nearest-neighbor classifier. Discussions
• Algorithm
• Characteristics Of Nearest Neighbor Classifier
40 Question Paper Assignement
UNIT - 4
ASSOCIATION ANALYSIS – 1: Problem Definition; Frequent Itemset generation; Rule
Generation; Compact representation of frequent itemsets; Alternative methods for generating frequent
itemsets.
6 Hours
41 Problem Definition; Participation
&discussion
42 Frequent Itemset generation;
• The Apriori Principal
• Frequent Itemset Generation in the Apriori Algorithm
• Candidate Generation and Pruning
• Support Counting
• Computational Complexity
43 Rule Generation; Discussions
•Confidence Based Pruning
•Rule Generation in Apriori Algorithm
•An Example: Congressional Voting Records
44 Compact representation of frequent itemsets;
•Maximal Frequent Itemsets
•Closed Frequent Itemsets
45 Alternative methods for generating frequent itemsets. Discussions
46 Alternative methods for generating frequent itemsets.
47 Question paper Assignment
UNIT - 5
ASSOCIATION ANALYSIS – 2: FP-Growth algorithm, Evaluation of association patterns; Effect

4
Aca Format X

of skewed support distribution; Sequential patterns.


6 Hours
48 FP-Growth algorithm, FP Tree Representation Participation
&discussion
Frequent Itemset
Generation in FP Growth Algorithm
49 Evaluation of association patterns; Discussions
• Objective Measures of Interestingness
• Measure beyond pairs of Objective measures of
Interestingness binary variables
• Simson’s Paradox
50 Effect of skewed support distribution;
51 Problem Formulation Explain with
an example
• Sequential Pattern Discovery
• Timing Constraints
• Alternative Counting Schemes
52 Sequential patterns
53 Question paper Assignment
UNIT - 6
CLUSTER ANALYSIS: Overview, K-means, Agglomerative hierarchical clustering, DBSCAN,
Overview of Cluster Evaluation.
7 Hours
54 Overview, Participation
&discussion
• What Is Cluster Analysis
• Different Types of Clustering
• Different Types of Clusters
55 K-means,
• The basic K-means Algorithm
• K-means: Additional issues
• Bisecting K-Means
• K-Means and Different Types of Cluster
• Strength and Weaknesses
• K-means as an Optimization Problem
56 Agglomerative hierarchical clustering Discussions
• Basic Agglomerative Hierarchical Clustering Algorithm
• Specific Techniques
• The Launce-Williams Formula for Cluster
• Key issue in Hierarchical Clustering
• Strength & Weakness
57 DBSCAN

5
Aca Format X

• Traditional Density: Center-Based Approach


• The DBSCAN Algorithm
• Strengths and Weaknesses
58 Overview of Cluster Evaluation. Discussions
• Overview
• Unsupervised Cluster Evaluation Using Cohesion and
Separation
• Unsupervised Cluster Evaluation Using Proximity Matrix
• Unsupervised Evaluation of Hierarchical Clustering

59 Overview of Cluster Evaluation.


• Determining the correct Number of Clusters
• Clustering Tendency
• Supervised Measures of Cluster Validity
• Assessing the Significance of Cluster Validity Measures
60 Question paper Assignment

TEXT BOOKS:
1. Introduction to Data Mining - Pang-Ning Tan, Michael Steinbach, Vipin Kumar,
Pearson Education, 2007
2. Data Mining – Concepts and Techniques - Jiawei Han and Micheline Kamber, 2nd
Edition, Morgan Kaufmann, 2006.

REFERENCE BOOKS:
1. Insight into Data Mining – Theory and Practice - K.P.Soman, Shyam Diwakar,
V.Ajay, PHI, 2006

You might also like