0% found this document useful (0 votes)
2 views2 pages

U23CBE509-Data Mining and Analytics

The document outlines the course structure for 'Data Mining and Analytics' in a B.Tech program, detailing objectives, outcomes, and evaluation methods. It includes topics such as data preprocessing, association rules, linear models, and time series analysis, with corresponding credit and marks distribution. Additionally, it lists recommended textbooks and web resources for further learning.

Uploaded by

vijiperumal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views2 pages

U23CBE509-Data Mining and Analytics

The document outlines the course structure for 'Data Mining and Analytics' in a B.Tech program, detailing objectives, outcomes, and evaluation methods. It includes topics such as data preprocessing, association rules, linear models, and time series analysis, with corresponding credit and marks distribution. Additionally, it lists recommended textbooks and web resources for further learning.

Uploaded by

vijiperumal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Department CSBS Programme: B.Tech.

Semester V Course Category: PE *End Semester Exam Type: TE


Periods / Week Credit Maximum Marks
Course Code U23CBE509 L T P C CAM ESE TM
Course Name DATA MINING AND ANALYTICS 2 0 0 2 25 75 100
1) To introduce the fundamental concepts of data mining and data representation.
2) To learn the data preprocessing task and attribute oriented analysis
Course
3) To understand the association rules, classification and prediction algorithms
Objectives
4) To learn and apply the linear models of data analysis
5) To understand the time series analysis and aspects of prescriptive analysis
BT Mapping
On completion of the course, the students will be able to
(Highest Level)
CO1 Understand the fundamentals of data mining and data representation. (K2)
CO2 Perform preprocessing tasks for the data set. (K2)
Course
Outcome CO3 Apply association rules and predictive methods for data mining. (K3)
CO4 Build data models using linear regression techniques. (K3)
CO5 Gain knowledge on time series analysis and prescriptive analysis. (K2)
UNIT-I INTRODUCTION AND KNOWLEDGE REPRESENTATION (9Hrs)
Introduction - Related technologies - Machine Learning, DBMS, OLAP, Statistics, Stages of the Data Mining Process,
Data Mining Techniques, Knowledge Representation Methods, Task relevant data, Background knowledge, CO1
Representing input data and output knowledge, Visualization techniques, Applications.
UNIT-II DATA PREPROCESSING (9Hrs)
Data preprocessing: Data cleaning, Data transformation, Data reduction, Discretization and generating concept CO2
hierarchies.
Attribute-oriented analysis: Attribute generalization, Attribute relevance, Class comparison, Statistical measures
UNIT-III ASSOCIATION AND MINING METHODS (9Hrs)
Association rules: Motivation and terminology, Basic idea: item sets, Generating item sets and rules efficiently,
Correlation analysis. Classification: Basic learning/mining tasks, Inferring rudimentary rules: 1R, algorithm, Decision CO3
trees, covering rules.
Prediction: The prediction task, Statistical (Bayesian) classification, Bayesian networks, Instance based methods
(nearest neighbor), linear models
UNIT- IV LINEAR MODELS (9Hrs)
Descriptive analytics: Data Modeling, Trend Analysis, Simple Linear Regression Analysis
Forecasting models: Heuristic methods, predictive modeling and pattern discovery,
Logistic Regression: Logit transform, ML estimation, Tests of hypotheses, Wald test, LR test, score test, test for
CO4
overall regression, multiple logistic regression, forward, backward method, interpretation of parameters, relation
with categorical data analysis. Interpreting Regression Models, Implementing Predictive Models.
Generalized Linear model: Link functions such as Poisson, binomial, inverse binomial, inverse Gaussian, Gamma.
UNIT- V TIME SERIES ANALYSIS (9Hrs)
Time Series Analysis: Auto - Covariance, Auto-correlation and their properties. Exploratory time series analysis,
Test for trend and seasonality, Exponential and moving average smoothing, Holt – Winter smoothing, forecasting
based on smoothing.
Linear time series models: Autoregressive, Moving Average, Autoregressive Moving Average and Autoregressive
Integrated Moving Average models; Estimation of ARIMA models such as Yule-Walker estimation for AR Processes,
Maximum likelihood and least squares estimation for ARIMA Processes, Forecasting using ARIMA models. CO5
Prescriptive Analytics: Mathematical optimization, Networks modeling-Multi-objective optimization-Stochastic
modeling, Decision and Risk analysis, Decision trees.
Content beyond Syllabus
Non Linear Regression Models
Text Books
1. Jiawei Han and Micheline Kamber, “Data Mining Concepts and Techniques”, Third Edition, Elsevier, 2012.
2. Lior Rokach and Oded Maimon, “Data Mining and Knowledge Discovery Handbook”, Springer, 2nd edition, 2010.
3. Ian H. Witten, Eibe Frank and Mark A. Hall “Data Mining: Practical Machine Learning Tools and Techniques”,Fourth
Edition, Elsevier, 2017.
Reference Books
1. Box, G.E.P and Jenkins G.M. (1970) Time Series Analysis, Forecasting and Control, Holden-Day.
2. Draper, N. R. and Smith, H., “Applied Regression Analysis”, Third Edition, John Wiley, 1998.
3. Hosmer, D. W. and Lemeshow, S., “Applied Logistic Regression”, Third Edition, Wiley, 2003.
Web References
1. https://nptel.ac.in/courses/106/105/106105174/
2. https://nptel.ac.in/courses/110/106/110106072/
3. https://www.tutorialspoint.com/data_mining/index.htm
4. https://www.javatpoint.com/data-mining
5. https://www.guru99.com/data-mining-tutorial.html.
* TE – Theory Exam, LE – Lab Exam

COs/POs/PSOs Mapping
Program Specific
Program Outcomes (POs)
COs Outcomes (PSOs)
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3
1 2 1 - - - - - - - - - - 2 1 1
2 2 1 - - - - - - - - - - 2 1 1
3 3 2 1 1 - - - - - - - 1 2 1 -
4 3 2 1 1 - - - - - - - 1 2 - 1
5 2 1 - - - - - - - - - 1 2 1 1

Correlation Level: 1 - Low, 2 - Medium, 3 – High

Evaluation Method

Continuous Assessment Marks (CAM) End


Semester Total
Assessmen Mode
CA CA Assignment Attendanc Examinatio Mark
t l
T1 T2 * e n (ESE) s
Exam
Marks
Marks 10 5 5 5 75 100

* Application oriented / Problem solving / Design / Analytical in content beyond the syllabus

You might also like