Data Mining - 2023 Solutions
Data Mining - 2023 Solutions
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-01-25, 21:45 IST
1) Which of the following is usually the last step in the data mining process? 1 point
Week 1 ()
a. Visualization
Lecture 1 : Introduction,
b. Preprocessing
Knowledge Discovery
Process (unit?
c. Modelling
unit=17&lesson=18) d. Deployment
Quiz: Week 1 :
3) HTML links are an example of: 1 point
Assignment 1
(assessment?name=99)
a) Record data
Week 01: Feedback Form b) Ordered data
(unit?unit=17&lesson=24)
c) Graph data
Week 01: Assignment d) None of the above
Solution (unit?
unit=17&lesson=104) Yes, the answer is correct.
Score: 1
Week 2 () Accepted Answers:
c) Graph data
Week 3 ()
4) Name of a place, can be considered an attribute of type? 1 point
Week 4 ()
a) Nominal
b) Ordinal
Week 5 ()
c) Interval
Week 6 () d) Ratio
DOWNLOAD VIDEOS () 5) A store sells 10 items. Maximum possible number of candidate 3-itemsets is: 1 point
a) 120
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=17&assessment=99 1/3
3/21/24, 9:58 PM Data Mining - - Unit 3 - Week 1
Problem Solving b) 6
Session () c) 15
d) 56
6) If a record data matrix has reduced number of columns after a transformation, the transformation has performed: 1 point
a) Data Sampling
b) Dimensionality Reduction
c) Noise Cleaning
d) Discretization
a) 1
b) 0.5
c) 0.25
d) 0
a) 1
b) 0.5
c) 0.25
d) 0.75
a) 2/3
b) 2/2
c) 1/4
d) 3/4
a) 2/3
b) 1
c) 0
d) 0.5
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=17&assessment=99 2/3
3/21/24, 9:58 PM Data Mining - - Unit 3 - Week 1
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=17&assessment=99 3/3
3/21/24, 9:58 PM Data Mining - - Unit 4 - Week 2
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
a. 2N-1
Week 1 ()
b. 2N-1
Week 2 () c. N/2
d. N-1
Lecture 6: Rule
generation (unit? No, the answer is incorrect.
Score: 0
unit=26&lesson=27)
Accepted Answers:
Lecture 7: Classification b. 2N-1
(unit?unit=26&lesson=28)
2) An association rule is valid if it satisfies: 1 point
Lecture 8: Decision Tree -
I (unit?
a. Support criteria
unit=26&lesson=29)
b. Confidence criteria
Lecture 9: Decision Tree -
c. Both support and confidence criteria
II (unit?
unit=26&lesson=30) d. None of these
Week 7 () 5) Consider three itemsets I1={bat, ball, wicket}, I2={bat, ball}, I3={bat}. Which of the following statements are correct? 1 point
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=26&assessment=101 1/3
3/21/24, 9:58 PM Data Mining - - Unit 4 - Week 2
For questions 6-10, consider the following small database of four transactions. The minimum support is 60% and the minimum
confidence is 80%.
Trans_id Itemlist
T1 {F, A, D, B}
T2 {D, A, C, E, B}
T3 {C, A, B, E}
T4 {B, A, D}
6) The 1-itemsets that satisfy the support criteria are: 1 point
a. A -> B
b. B -> A
c. A -> D
d. D -> A
a. A -> DB
b. D -> AB
c. AD -> B
d. DB -> A
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=26&assessment=101 2/3
3/21/24, 9:58 PM Data Mining - - Unit 4 - Week 2
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=26&assessment=101 3/3
3/21/24, 9:57 PM Data Mining - - Unit 5 - Week 3
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-02-13, 22:19 IST
1) 1 point
Week 1 ()
Week 2 ()
Week 3 ()
Quiz: Week 3 :
Assignment 3
(assessment?
name=102)
Week 6 ()
Week 7 ()
Week 8 ()
DOWNLOAD VIDEOS ()
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=36&assessment=102 1/4
3/21/24, 9:57 PM Data Mining - - Unit 5 - Week 3
3) 2 points
Problem Solving
Session ()
A.
B.
C.
D.
4) 1 point
A.
B.
C.
D.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=36&assessment=102 2/4
3/21/24, 9:57 PM Data Mining - - Unit 5 - Week 3
5) 1 point
A.
B.
C.
D.
6) 1 point
A.
B.
C.
D.
7) 1 point
A.
B.
C.
D.
8) 1 point
A.
B.
C.
D.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=36&assessment=102 3/4
3/21/24, 9:57 PM Data Mining - - Unit 5 - Week 3
9) 1 point
A.
B.
C.
D.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=36&assessment=102 4/4
3/21/24, 9:57 PM Data Mining - - Unit 6 - Week 4
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
(a) It always provides zero error when class distributions are known
(b)It always provides the lowest possible error when class distributions are known
(c) It may not always provide the lowest possible error when class distributions are known
(d) It always provides the lowest possible error when class distributions are estimated
2) Let A be an example, and C be a class. The probability P(C|A) is known as: 1 point
3) Let A be an example, and C be a class. The probability P(C) is known as: 1 point
4) Consider a binary classification problem with two classes C1 and C2. Class labels of ten other training set instances sorted in 1 point
increasing order of their distance to an instance x are as follows: {C1, C2, C1, C2, C2, C2, C1, C2, C1, C2}. How will a K=3 nearest neighbor
classifier classify x?
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=45&assessment=106 1/3
3/21/24, 9:57 PM Data Mining - - Unit 6 - Week 4
5) According to the following graph, what should be the appropriate value of K if KNN algorithm is used? 1 point
Course outline
Week 0 ()
Week 1 ()
Week 2 ()
Week 3 ()
Week 4 ()
Lecture 17 : K Nearest
Neighbor I (unit?
unit=45&lesson=46)
Lecture 18 : K Nearest
Neighbor II (unit? (a) 5
unit=45&lesson=47)
(b) 10
Lecture 19: K Nearest
(c) 15
Neighbor III (unit?
unit=45&lesson=48)
(d) 20
Week 5 () 7) Which of the following will be Manhattan Distance between the two data point A(1,3) and B(2,3)? 1 point
Week 6 () (a) 1
(b) 2
Week 7 () (c) 4
(d) 8
Week 8 ()
Yes, the answer is correct.
Score: 1
DOWNLOAD VIDEOS ()
Accepted Answers:
(a) 1
Problem Solving
Session ()
8) You are given the following set of training examples where x and y are the two inputs and Class is the target. What would be the 1 point
target class of a data point x=1, y=1 using Euclidean distance in 3-NN?
Y Class
X
-1 1 -
0 1 +
0 2 -
1 -1 -
1 0 +
1 2 +
2 2 -
2 3 +
(a) Class +
(b) Class –
(c) None of the above
(d) Can’t be determined
N h i i
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=45&assessment=106 2/3
3/21/24, 9:57 PM Data Mining - - Unit 6 - Week 4
(a) Class +
(b) Class –
(c) None of the above
(d) Can’t be determined.
10) In the following figure you are given the distances between the two points A(x1,y1) and B(x2,y2). 1 point
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=45&assessment=106 3/3
3/21/24, 9:57 PM Data Mining - - Unit 7 - Week 5
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-03-01, 21:21 IST
1) 1 point
Week 1 ()
Week 2 ()
Week 3 ()
Week 4 ()
Week 5 ()
A.
Lecture 22: Support B.
Vector Machine I (unit? C.
unit=54&lesson=55)
D.
Lecture 23: Support
Yes, the answer is correct.
Vector Machine II (unit? Score: 1
unit=54&lesson=56)
Accepted Answers:
Lecture 24: Support B.
Vector Machine III (unit?
unit=54&lesson=57) 2) 1 point
Week 7 ()
Week 8 ()
DOWNLOAD VIDEOS ()
A.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=54&assessment=108 1/4
3/21/24, 9:57 PM Data Mining - - Unit 7 - Week 5
Problem Solving B.
Session () C.
D.
4) 1 point
A.
B.
C.
D.
5) 1 point
A.
B.
C.
D.
6) 1 point
A.
B.
C.
D.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=54&assessment=108 2/4
3/21/24, 9:57 PM Data Mining - - Unit 7 - Week 5
7) 1 point
A.
B.
C.
D.
8) 1 point
A.
B.
C.
D.
9) 1 point
A.
B.
C.
D.
10) 1 point
A.
B.
C.
D.
S 1
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=54&assessment=108 3/4
3/21/24, 9:57 PM Data Mining - - Unit 7 - Week 5
Score: 1
Accepted Answers:
A.
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=54&assessment=108 4/4
3/21/24, 9:56 PM Data Mining - - Unit 8 - Week 6
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-03-06, 21:23 IST
1) Sufficient Number of output nodes required in an ANN used for two-class classification problem is: 1 point
Week 1 ()
a. Random number
Week 2 () b. Same as number of input nodes
c. 1
Week 3 ()
d. 2
Week 6 ()
2) How are the weights and biases initialized in an ANN in general? 1 point
Lecture 27: Kernel
a. Can be initialized randomly
Machines (unit?
unit=63&lesson=64) b. Always initialized to zero
c. Always initialized to infinity
Lecture 28: Artificial
Neural Networks I (unit? d. Always initialized as 1
unit=63&lesson=65)
Yes, the answer is correct.
Score: 1
Lecture 29: Artificial
Neural Networks II (unit? Accepted Answers:
unit=63&lesson=66) a. Can be initialized randomly
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=63&assessment=111 1/3
3/21/24, 9:56 PM Data Mining - - Unit 8 - Week 6
6) A neuron with 3 inputs has the weight vector [0.2 -0.1 0.1]^T and a bias θ = 0. If the input vector is X = [0.2 0.4 0.2]^T then the 1 point
total input to the neuron is:
a. 0.2
b. 0.02
c. 0.4
d. 0.10
7) A neural Network given below takes two binary inputs X1, X2 ϵ {0,1} and the activation function for each neuron is the binary 1 point
threshold function (g(a)= 1 if a >0; 0 otherwise). Which of the following logical functions does it compute?
a. AND
b. NAND
c. XOR
d. NOR
8) The neural network given bellow takes two binary valued inputs x1, x2 ϵ {0,1}, the activation function for each neuron is the binary 1 point
threshold function (g(a)= 1 if a >0; 0 otherwise). Which of the following logical functions does it compute?
a. AND
b. NAND
c. XOR
d. OR
9) The neural network given bellow takes two binary valued inputs x1, x2 ϵ {0,1} and the activation function is the binary threshold 1 point
function (h(z)=1 if z>0;0 otherwise). Which of the following logical functions does it compute?
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=63&assessment=111 2/3
3/21/24, 9:56 PM Data Mining - - Unit 8 - Week 6
a. OR
b. AND
c. NAND
d. NOR
10) Under which of the following situation would you expect overfitting to happen? 1 point
a. With training iterations error on training set as well as test set decreases
b. With training iterations error on training set decreases but test set increases
c. With training iterations error on training set as well as test set increases
d. With training iterations training set as well as test set error remains constant
Yes, the answer is correct.
Score: 1
Accepted Answers:
b. With training iterations error on training set decreases but test set increases
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=63&assessment=111 3/3
3/21/24, 9:56 PM Data Mining - - Unit 9 - Week 7
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-03-06, 22:08 IST
1) Which of the following statement is NOT true about clustering? 1 point
Week 1 ()
a. It is a supervised learning technique
Week 2 () b. It is an unsupervised learning technique
c. It is also known as exploratory data analysis
Week 3 ()
d. It groups data into homogeneous groups
Week 6 ()
2) Which of the following clustering technique start with the points as individual clusters and, at each step, merge the closest pair of 1 point
clusters
Week 7 ()
a. K-Means clustering
Lecture 32: Clustering I
(unit?unit=72&lesson=73) b. DBSCAN
c. Divisive clustering
Lecture 33: Clustering II
(unit?unit=72&lesson=74) d. Agglomerative clustering
Week 7 : Assignment
4) The Euclidean distance matrix between four 2-dimensional points, p1, p2, p3, and p4, is shown below. A possible set of co- 1 point
Solution (unit? ordinate values of these points are:
unit=72&lesson=118)
Week 8 ()
DOWNLOAD VIDEOS ()
Problem Solving
Session ()
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=72&assessment=112 1/3
3/21/24, 9:56 PM Data Mining - - Unit 9 - Week 7
6) Distance between two clusters in complete linkage clustering is defined as: 1 point
7) Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1), and p5=(0, 0.5). Euclide-an distance is the 1 point
distance function. Single linkage clustering is used to cluster the points into two clusters. The clusters are:
8) Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1), and p5=(0, 0.5). Euclide-an distance is the 1 point
distance function. Complete linkage clustering is used to cluster the points into two clus-ters. The clusters are:
9) Consider a set of five 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1), and p5=(0, 0.5). Euclidean distance is the 1 point
distance function. The k-means algorithm is used to cluster the points into two clusters. The initial cluster centers are p1 and p5. The clusters
after two iterations of k-means are:
10) Given a set of seven 2-dimensional points p1=(0, 0), p2=(5, 0), p3=(5, 1), p4=(0, 1), p5=(0, 0.5), p6=(0, 9), and p7=(5.5, 1). 1 point
Euclidean distance is the distance function. The DBSCAN algorithm is used to cluster the points. Epsilon = 1, and MinPts = 2 is used for
DBSCAN. The clusters and outliers obtained are:
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=72&assessment=112 2/3
3/21/24, 9:56 PM Data Mining - - Unit 9 - Week 7
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=72&assessment=112 3/3
3/21/24, 9:55 PM Data Mining - - Unit 10 - Week 8
(https://swayam.gov.in)
(https://swayam.gov.in/nc_details/NPTEL)
Announcements (announcements) About the Course (preview) Q&A (forum) Progress (student/home) Mentor (student/mentor)
Week 0 ()
Assignment submitted on 2023-03-22, 22:07 IST
1) Target variable in Regression is _________ 1 point
Week 1 ()
a. Continuous variable
Week 2 () b. Discrete variable
c. Character variable
Week 3 ()
d. All of the above
Week 6 ()
2) Regression is used in: 1 point
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=81&assessment=114 1/3
3/21/24, 9:55 PM Data Mining - - Unit 10 - Week 8
Week 08 Assignment
Solution (unit? x y
unit=81&lesson=119) 0 1
0.5 1.9
DOWNLOAD VIDEOS ()
1 2.5
1.25 3
Problem Solving
Session () a. 0.00
b. 0.25
c. 0.50
d. 0.51
Yes, the answer is correct.
Score: 1
Accepted Answers:
d. 0.51
6) The linear regression model y = a0 + a1x is to be fitted to the data in the table shown below. What is the optimal regression model 1 point
obtained by minimizing sum squared error?
x y
0 1
1 1.9
2 3.2
2.5 3.4
a. y = 1.01 –2.10x
b. y = 1.01 +2.10x
c. y = 1.01 – 0.98x
d. y = 1.01 + 0.98x
7) In the figures below the training instances are described by dots. The blue dotted lines indicate the actual functions and the red 1 point
lines indicate the regression model. Which of the following statement is correct?
a. 1,3
b. 2,3
c. 1,2,3
d. Eigenvalues cannot be found.
a. Multivariate regression
b. Autoregression
c. Logistic regression
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=81&assessment=114 2/3
3/21/24, 9:55 PM Data Mining - - Unit 10 - Week 8
d. Sinusoidal regression
10) In principal component analysis, the projected lower dimensional space corresponds to – 1 point
https://onlinecourses.nptel.ac.in/noc23_cs43/unit?unit=81&assessment=114 3/3