0% found this document useful (0 votes)
16 views

DataMining Syllabus

This document outlines the course details for a Data Mining Lab using WEKA course. The course aims to help students apply concepts of data mining techniques hands-on using the WEKA software. It is a 3 credit hour course with both continuous evaluation and an end term exam. The course outcomes include developing skills in data preprocessing, classification, clustering, association rule mining and evaluating model performance. The lab programs cover tasks like classification, clustering, association rule mining on various datasets. Students will also design ensemble models and projects analyzing real-world datasets.

Uploaded by

shouryabiz07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views

DataMining Syllabus

This document outlines the course details for a Data Mining Lab using WEKA course. The course aims to help students apply concepts of data mining techniques hands-on using the WEKA software. It is a 3 credit hour course with both continuous evaluation and an end term exam. The course outcomes include developing skills in data preprocessing, classification, clustering, association rule mining and evaluating model performance. The lab programs cover tasks like classification, clustering, association rule mining on various datasets. Students will also design ensemble models and projects analyzing real-world datasets.

Uploaded by

shouryabiz07
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

MANAV RACHNA INTERNATIONAL INSTITUTE OF RESEARCH AND STUDIES

(Deemed to be University under section 3 of the UGC Act 1956)


NAAC 'A' Grade University

BCS-DS-632: Data Mining

Periods/week Credits Max. Marks : 200


L: 3 T: 0 3.0 Continuous Evaluation : 100
Duration of Examination: 3 Hrs End Sem Examination :
100

Pre-Requisite: Database Management Systems (BCS-DS-404)


Course Type: Program Electives

Course Outcomes: Students will be able to-


BCS-DS-632.1. Identify appropriate data mining algorithms to solve real world problems.
BCS-DS-632.2. Compare and evaluate different data mining techniques like classification, prediction,
clustering and association rule mining.
BCS-DS-632.3. Describe complex data types with respect to spatial and web mining.
BCS-DS-632.4. Develop skills to write queries using DMQL
BCS-DS-632.5. Develop skill in selecting the appropriate data mining algorithm for solving practical
problems.
PART-A

Unit 1: Data Mining & Business Intelligence:


1.1 Introduction: Scope of Data Mining: What is Data Mining;
1.2 How does Data Mining Works, Predictive Modeling:
1.3 Data Mining and Data Warehousing: Architecture for Data Mining:
1.4 Profitable Applications: Data Mining Tools,
1.5 Business Intelligence, Business Intelligence tools,
1.6 Business Intelligence Infrastructure, Business Intelligence Applications,
1.7 BI versus Data Warehouse, BI versus Data Mining, Future of BI.

Unit 2: Data Preprocessing:


2.1 Introduction, Data Preprocessing Overview,
2.2 Data Cleaning, Data Integration and Transformation,
2.3 Data Reduction, Discretization
2.4 Concept Hierarchy Generation.

Unit 3: Data Mining Techniques- An Overview:


3.1 Introduction, Data Mining,
3.2 Data Mining Versus Database Management System,
3.3 Data Mining Techniques-
3.4 Association rules, Classification, Regression,
3.5 Clustering, Neural networks.
PART-B

Unit 4: Clustering:
4.1 Introduction, Clustering,
4.2 Cluster Analysis,
4.3 Clustering Methods- K means, Hierarchical clustering,
4.4 Agglomerative clustering, Divisive clustering,
4.5 Clustering and segmentation software, evaluating clusters.

Unit 5: Web Mining:


5.1 Introduction, Terminologies,
5.2 Categories of Web Mining – Web Content Mining,
5.3 Web Structure Mining,
5.4 Web Usage Mining,
5.5 Applications of Web Mining, and Agent based and Data base approaches,
5.6 Web mining Software.

Unit 6: Applications of Data mining:


6.1 Introduction, Business Applications Using Data Mining
6.2 Risk management and targeted marketing,
6.3 Customer profiles and feature construction,
6.4 Medical applications (diabetic screening),
6.5 Scientific Applications using Data Mining, Other Applications.

Text Books / Reference Books:


1. Arun Pujar, 2010, Data Mining Techniques, 2nd Edition, Orient Black Swan/ Universities Press.
2. Alex Berson, 2004, Data Warehousing, Data Mining and OLTP, 1st Edition, MGH.
3. W.H. Inman, 2005, Building the Data Warehouse, 4th Edition, John Wiley & Sons.
4. W.H Ionhman, C.Klelly, 1993, Developing the Data Warehouses, John Wiley & Sons.
5. Sourav S Bhowmick , Sanjay K Madria , Wee K Ng , Hardcover,2003, Web Data Management,
Springer.
6. Pieter Adriaans & Dolf Zantinge, 1997, Data Mining, Pearson,

Instructions for paper setting: Seven questions are to be set in total. First question will be
conceptual covering entire syllabus and will be compulsory to attempt. Three questions will be set
from each Part A and Part B (one from each unit) Student needs to attempt two questions out of
three from each part. Each question will be of 20 marks.

Distribution of Continuous Evaluation:


Sessional- I 30%
Sessional- II 30%
Assignment/Tutorial 20%
Class Work/ Performance 10%
Attendance 10%

Evaluation Tools:
Assignment/Tutorials
Sessional tests
Surprise questions during lectures/Class Performance
End Sem examination

COURSE ARTICULATION MATRIX:

CO Statement PO PO P PO P PO PO P PO PO P PO PSO PSO PSO


(BCS-DS-632) 1 2 O 4 O 6 7 O 9 10 O 12 1 2 3
3 5 8 11
BCS-DS-632.1 3 - 2 - 1 2 1 - 2 1 1 1 - 1 2
BCS-DS-632.2 - 2 3 2 - 1 1 1 - 1 1 2 - 2 -
BCS-DS-632.3 3 - 2 - 1 1 - 2 2 - 2 3 1 - 1
BCS-DS-632.4 2 - 1 3 3 - 2 - 1 - 1 - 2 1 1
BCS-DS-632.5 - 2 3 2 2 - - 1 - 1 - - 3 1 -
MANAV RACHNA INTERNATIONAL INSTITUTE OF RESEARCH AND STUDIES
(Deemed to be University under section 3 of the UGC Act 1956)
NAAC 'A' Grade University

BCS-DS-682: Data mining Lab using WEKA

Periods/week Credits Max. Marks : 100


P: 2 1.0 Continuous Evaluation : 50
Duration of Examination: 2 Hrs End Sem Examination : 50

Co-Requisite: Data Mining (BCS-DS-632)


Course Type: Program Electives

Course Outcomes: Students will be able to-


BCS-DS-682.1. Relate the concept of loading, preprocessing, and analysis of large data sets.
BCS-DS-682.2. Describe the concept of various classification models and also draw a connection
between these models.
BCS-DS-682.3. Implement various clustering techniques and also draw a connection between these
techniques.
BCS-DS-682.4. Able to apply and explore the ideas and find association among data .
BCS-DS-682.5. Design and analyze new models to solve various real world problems.
BCS-DS-682.6. Plan and generate alternate solution to the existing framework and also study and
analyze the results through data visualization.

Program 1: Use Boston House Price Dataset i.e. housing.arff. Apply all preprocessing algorithms and
create a version of the initial data set in which the categorical data are converted into numerical data.
Program 2: Use all the above algorithms to classify weather data from the “weather.arff” file.
Perform initial preprocessing and create a version of the initial dataset in which all numeric attributes
should be converted to categorical data.
Program 3: Use k-means algorithm to bank data from the “bank.arff” file. Perform initial
preprocessing and create a version of the initial data set in which the ID field should be removed and
the "children" attribute should be converted to categorical data.
Program 4: Use Apriori algorithm to generate association rules for Iris data from the “iris.arff” file.
Perform preprocessing and convert categorical data into numeric attributes and analyze results.
Program 5: Use “vote.arff” file for the processing of various attribute selection algorithm and
evaluation of various performance measures.
Program 6: Use “diabetes.arff” file to evaluate various performance parameters for any three
classifiers. Then generate and display the comparison graph for all the above performance
parameters through various charts or graphs.
Program 7: Design and create an ensemble model using more than one classification algorithms
(Bagging and Boosting)
Program 8: Design and create an ensemble model using more than two classification algorithms
(Boosting).
Program 9: Design a Project for the Analysis and Identification handwritten digits using neural
network model behavior.
Program 10: Design a Project for the Analysis and Prediction of student’s behavior using suitable
algorithms.

Text Books/ Reference Books:


1. Bostjan Kaluza, 2013, Instant Weka How-to, Packt Publishing
2. Eric Goh, 2019, Learn By Examples - A Quick Guide to Data Mining with Weka and Java using
Weka, SVBook Pte. Ltd.
3. Eric Goh, 2019, Learn By Examples - A Quick Guide to Data Mining with Rapid miner and Weka,
SVBook Pte. Ltd.

Software required/Weblinks:
WEKA 3.8.3
www.cs.wakato.ac.nz
http://wekatutorial.com
www.tutorialspoint.com

Distribution of Continuous Evaluation:


Viva- I 30%
Viva- II 30%
File/Records 20%
Class Work/ Performance 10%
Attendance 10%

Evaluation Tools:
Experiments in lab
File work/Class Performance
Viva (Question and answers in lab)
End Term Practical Exam

COURSE ARTICULATION MATRIX:

CO PO PO PO PO PO P PO PO PO PO PO PO PSO PSO PSO


Statement 1 2 3 4 5 O 7 8 9 10 11 12 1 2 3
(BCS-DS- 6
682)
BCS-DS-682.1 3 3 3 3 1 1 1 1 1 1 1 1 1 1 1
BCS-DS-682.2 3 3 3 3 3 3 2 1 3 2 2 3 3 2 2
3 3 3 3 3 3 2 1 3 2 2 3 2 2
BCS-DS-682.3
3
BCS-DS-682.4 3 3 3 3 3 3 2 1 2 2 2 3 3 2 2
BCS-DS-682.5 3 3 3 3 3 3 2 1 3 1 3 3 3 2 3
BCS-DS-682.6 3 3 3 3 3 3 2 1 2 1 3 3 3 2 3

You might also like