Data Mining End 23 24

The document outlines the end semester examination for the Data Mining course at Motilal Nehru National Institute of Technology, covering various topics such as data mining tasks, frequent itemsets, SVM classification, decision tree algorithms, and neural networks. It includes specific questions related to data analysis techniques, sampling methods, and classification algorithms. The exam is structured to assess students' understanding of theoretical concepts and practical applications in data mining.

Uploaded by

Zeke. 1232

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Data Mining End 23 24

Uploaded by

Zeke. 1232

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

yRIG-211004 (4RGI)

Motilal Nehru National Institute of TechnologyAllahabad

Prayagraj-211004 [India]

Computer Science & Engineering

End Semester (Even) Examination 2024
Programme Name: B.Tech. Semester: VI

Course Code: CS16103 Course Name:Data Mining

Branch: Computer Science & Engineering Student Reg. No.: 2 0 2 | oo 3

Duration: 03 Hours Max. Marks: 60

Note: Attempt all guestions. In case of any doubts in any question, make suitable assumptions, state them, and
justify them
Marks

Q.1 (a) Discuss whether each of the following activities is a data mining task: (a) Dividing the [6]
customers of a company according to their profitability, (b) Sorting a student database
based on student identification numbers, (c) Predicting the outcomes of tossing a (fair)
pair of dice, (d) Predicting the future stock price of a company using historical records,
(e) Monitoring the heart rate of a patient for abnormalities, (f) Extracting the frequencies
of a sound wave.
(b) Using the data for age given (in increasing order) for the attribute age: 13, 15, 16, 16, 19, [6]
20, 20, 21, 22, 22) 25, 25, 25, 25, 30,, 33, 33, 35, 35, 35 35, 36, 40, 45, 46) 52, 70. Sketch
examples of each sampling technique: Sampling without Replacement (SRSWOR),
Sampling with Replacement (SRSVWR), and stratified sampling. Use samples of size five
and the strata "youth," "middle-aged," and "senior."

Q.2 (a) A database has 5 transactions. Let min sup= 60% and min_conf = 80%. [8]
TID Item_bought
T100 {M, O, N, K, E, Y}
T200 {D, O, N, K, E, Y}
T300 {M, A, K, E)
T400 (M,U, C, K, Y}
T500 {C, O, 0, K,I E)
Find all frequent itemsets using Apriori and FP-growth, respectively.
6) Discuss the advantages and disadvantages of Cosine, Jaccard and Simple Matching
Coefficient similarity measures in brief.

Q.3 (a) The support vector machine is a highly accurate classification method. However, SVM [6]
classifiers suffer from slow processing when training with a large set of data tuples.

Page 1 of 2
Discuss how to overcome this difficulty and develop a scalable SVM algorithm for
efficient SVM classification in large data sets.
(b) RID age income student credit rating Class: buyscomputer [6]
youth high no fair
2 youth high eKcellent
middleaged high no fair Ves
senior medium fair yes \
senio low yes fair yes
6 senio low yes excellent
middle aged Jow yes Cxcellent YeS

13
youth
youth
sernior
youth
medium
low
medium
medium
Yes

yes
tiiii:
fair
fair
fair
ecellent
ys
yes
12
middleaged medium no excellent yes
13
middle aged high yes air ves
14 senior medium no cxcellent no

Table 1: Computer Purchase Data

What would be the class label of the test tuple (age =youth, income = medium, student =
yes, credit rating = Excellent) using naive Bayesian classifier using the data in Table 1.
Q.4 (a) Consider the traditional decision tree algorithnm in your mind and provide the solution [8]
for the following issues related to the decision tree algorithm: (a) Handling continuous
attributes (b) Dealing with cost associated attributes, (c) Handling inherent bias
associated with information gain measure,(d) Handling missing values.
(b) Use these methods to normalize the following group of data: 200,300,400,600,1000 (a) [4]
min-max normalization by seting min= 0 and max = 1 (b) z-score normalization (c) z
score normalization using the mean absolute deviation instead of standard deviation (d)
normalization by decimal scaling

05 (a) Consider a multlayer feed-forward neural network that uses back propogation [8]
algorithm with given weight and bais values. Wi4 0.2, Wis =-.3, W24 =0.4, W2s =
0.1, W34=-0.5, W3s =0.2, W6 =-0.3, Ws6 =-0.2, 04 = -0.4, ; = 0.2, 6 = 0.1
1
The activation function 0, = 1+ei is used on each hidden or output unit j that receive an
input I, =ZiWyOi+, with respect to previous layer, i.
.2

W15
W46

WS6

What would be the predicted class for the test sample X= (1,0,0) using the mulilayer
feed-forward neural network classifier?
(b) Outline methods for addressing the class imbalance problem. Suppose a bank would like
[4]
to develop a classifier that guards against fraudulent credit card transactions. Illustrate
how you can induce a quality classifier based on a large set of non-fraudulent examples
and a very small set of fraudulent cases.

Page 2 of 2

Nptel ML Questions
No ratings yet
Nptel ML Questions
12 pages
Lecture 6 - Classification - SVM
No ratings yet
Lecture 6 - Classification - SVM
48 pages
Aam Ut-1 QB Ans - (Final)
No ratings yet
Aam Ut-1 QB Ans - (Final)
28 pages
Aam Ut-1 QB Ans (Final)
No ratings yet
Aam Ut-1 QB Ans (Final)
26 pages
Aam Ut-1 QB Ans
No ratings yet
Aam Ut-1 QB Ans
12 pages
Ia1 ML Scheme Common To Is, Ai, Cs
No ratings yet
Ia1 ML Scheme Common To Is, Ai, Cs
10 pages
23 24 Endsem
No ratings yet
23 24 Endsem
12 pages
ML Mu Qpapers 2022-2024
No ratings yet
ML Mu Qpapers 2022-2024
4 pages
DWDM Unit Wise Question Bank
No ratings yet
DWDM Unit Wise Question Bank
8 pages
ML SP24 Mid Term Exam - Solution
No ratings yet
ML SP24 Mid Term Exam - Solution
8 pages
Unit4 Mcqs
No ratings yet
Unit4 Mcqs
7 pages
MLfinal 1
No ratings yet
MLfinal 1
7 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
PCCCS504 Module 4
No ratings yet
PCCCS504 Module 4
4 pages
Previous Year Paper - Sem 7
No ratings yet
Previous Year Paper - Sem 7
12 pages
Survey Paper On Classification
No ratings yet
Survey Paper On Classification
6 pages
ML Papers
No ratings yet
ML Papers
10 pages
ML SP24 Mid Term Exam - Solution
No ratings yet
ML SP24 Mid Term Exam - Solution
8 pages
C3 DataMining
No ratings yet
C3 DataMining
3 pages
DMW MCQ
No ratings yet
DMW MCQ
388 pages
A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks
No ratings yet
A H192009 Pages: 3: Answer All Questions, Each Carries 4 Marks
3 pages
Machine Learning Question Bank
No ratings yet
Machine Learning Question Bank
7 pages
CAT2 Key
No ratings yet
CAT2 Key
10 pages
MFDS - Test 1 Problems
No ratings yet
MFDS - Test 1 Problems
9 pages
ML 20240315
No ratings yet
ML 20240315
8 pages
B.Tech May2022 Comp CSPE-64 Sem4
No ratings yet
B.Tech May2022 Comp CSPE-64 Sem4
4 pages
Unit 4 - Question Bank
No ratings yet
Unit 4 - Question Bank
11 pages
hw2 2011spring
0% (1)
hw2 2011spring
3 pages
Assignment Data Mining
No ratings yet
Assignment Data Mining
27 pages
DM-I Q Paper 2024
No ratings yet
DM-I Q Paper 2024
12 pages
DataMining - Workbook MCQ
No ratings yet
DataMining - Workbook MCQ
16 pages
Exam dm1 121017 Ans
No ratings yet
Exam dm1 121017 Ans
8 pages
Ijcsea 2
No ratings yet
Ijcsea 2
13 pages
ML Suggestion 2
No ratings yet
ML Suggestion 2
11 pages
Midterm F07 Solutions
No ratings yet
Midterm F07 Solutions
4 pages
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
No ratings yet
COSC 6335 Data Mining (Dr. Eick) Solution Sketches Midterm Exam October 25, 2012
11 pages
Machine Learning CA 2
No ratings yet
Machine Learning CA 2
19 pages
Dsa - DK Question Paper
No ratings yet
Dsa - DK Question Paper
4 pages
CS246 Final Exam Solutions, Winter 2011
No ratings yet
CS246 Final Exam Solutions, Winter 2011
18 pages
B. Sc. H Computer S 3OWYH6v
No ratings yet
B. Sc. H Computer S 3OWYH6v
6 pages
Data Mining List of Important Question
No ratings yet
Data Mining List of Important Question
4 pages
Winsem2012-13 Cp0535 Modqst Model QP
No ratings yet
Winsem2012-13 Cp0535 Modqst Model QP
4 pages
Pyqp - Cs402-Qp-Jun21
No ratings yet
Pyqp - Cs402-Qp-Jun21
3 pages
Q1S 1
No ratings yet
Q1S 1
2 pages
ML Question Papers
No ratings yet
ML Question Papers
8 pages
1000099853
No ratings yet
1000099853
2 pages
HW1
No ratings yet
HW1
4 pages
CS 515 Data Warehousing and Data Mining
No ratings yet
CS 515 Data Warehousing and Data Mining
5 pages
Cluster Analysis: Classification Analysis, or Numerical Taxonomy
No ratings yet
Cluster Analysis: Classification Analysis, or Numerical Taxonomy
13 pages
Advantages:: Q.No 1.a Ans
No ratings yet
Advantages:: Q.No 1.a Ans
12 pages
Shivaji University, Kolhapur
No ratings yet
Shivaji University, Kolhapur
12 pages
Final Exam Review
No ratings yet
Final Exam Review
6 pages
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
No ratings yet
B.Tech Degree S8 (S, FE) / S6 (PT) (S, FE) Examination June 2023 (2015 Scheme)
4 pages
DWM - END SEM LAB Questions
No ratings yet
DWM - END SEM LAB Questions
9 pages
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools To Build Learning Machines 3rd Edition by OReilly Media ISBN 9781098122461 1098122461 Instant Download
100% (1)
Hands On Machine Learning With Scikit Learn and TensorFlow Techniques and Tools To Build Learning Machines 3rd Edition by OReilly Media ISBN 9781098122461 1098122461 Instant Download
75 pages
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
No ratings yet
1 Analytical Part (3 Percent Grade) : + + + 1 N I: y +1 I 1 N I: y 1 I
5 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
Gujarat Technological University
0% (1)
Gujarat Technological University
2 pages
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
No ratings yet
CEGP013091: 49.248.216.238 08/12/2018 13:08:58 Static-238
3 pages
Text Document Classification Quiz: Q1. Classification Techniques Have Been Applied To
0% (3)
Text Document Classification Quiz: Q1. Classification Techniques Have Been Applied To
12 pages
Foundations Deep Learning Matt Monaco
No ratings yet
Foundations Deep Learning Matt Monaco
401 pages
Long Short-Term Memory Networks PDF
No ratings yet
Long Short-Term Memory Networks PDF
22 pages
SRM VALLIAMMAI 1924103-Machine-Learning
100% (1)
SRM VALLIAMMAI 1924103-Machine-Learning
10 pages
Week1 UDL CM20315 01 Intro
No ratings yet
Week1 UDL CM20315 01 Intro
49 pages
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
No ratings yet
Final Exam, Data Mining (CEN 871) : Name Surname: Student's ID
2 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Introduction To Radial Basis Function Networks
No ratings yet
Introduction To Radial Basis Function Networks
45 pages
DMDW
No ratings yet
DMDW
24 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
Algorithm - Pseudocode of 2D CNN
No ratings yet
Algorithm - Pseudocode of 2D CNN
7 pages
Part 4 Mining Freqent Patterns
No ratings yet
Part 4 Mining Freqent Patterns
59 pages
Be Central
No ratings yet
Be Central
98 pages
Lecture 16-Multilayer Perceptron
No ratings yet
Lecture 16-Multilayer Perceptron
24 pages
CS8082 Unit 2
No ratings yet
CS8082 Unit 2
38 pages
Syl6 ML
No ratings yet
Syl6 ML
3 pages
Final Exam ANNFL 2015-1
No ratings yet
Final Exam ANNFL 2015-1
9 pages
Chapter 05 - Sharda 11e Full Accessible PPT 05
No ratings yet
Chapter 05 - Sharda 11e Full Accessible PPT 05
31 pages
Recurrent Neural Networks: Prof. Gheith Abandah
No ratings yet
Recurrent Neural Networks: Prof. Gheith Abandah
32 pages
Machine Learning Syllabus
No ratings yet
Machine Learning Syllabus
4 pages
Unit 5 - Cluster Analysis
No ratings yet
Unit 5 - Cluster Analysis
14 pages
ssw9 PS2-13 Wu
No ratings yet
ssw9 PS2-13 Wu
6 pages
Fixed Weight Competitive Networks Fixed Weight Competitive Nets
No ratings yet
Fixed Weight Competitive Networks Fixed Weight Competitive Nets
5 pages
What Is A Perceptron?
No ratings yet
What Is A Perceptron?
1 page
Neural Network Algorithm
No ratings yet
Neural Network Algorithm
2 pages
Flow Regime Prediction Using Artificial Neural Network
No ratings yet
Flow Regime Prediction Using Artificial Neural Network
8 pages
ML 06 Multiclass
No ratings yet
ML 06 Multiclass
11 pages
Student Solutions Manual for Mathematics for Economics, fourth edition
From Everand
Student Solutions Manual for Mathematics for Economics, fourth edition
Michael Hoy
No ratings yet
Data Interpretation Guide For All Competitive and Admission Exams
From Everand
Data Interpretation Guide For All Competitive and Admission Exams
Mohmmad Khaja Shareef
2.5/5 (6)