0% found this document useful (0 votes)

10 views5 pages

Quiz2 A

Uploaded by

lakshay22266

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Quiz2 A

Uploaded by

lakshay22266

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CSE343/CSE543/ECE363/ECE563: Machine Learning Sec A (Monsoon 2024)

Quiz - 2 Set A

Date of Examination: 24.09.2024 Duration: 45 mins Total Marks: 15 marks

Instructions –

• Attempt all questions.

• MCQs have a single correct option.
• State any assumptions you have made clearly.
• Standard institute plagiarism policy holds.
• No evaluation without suitable justification.
• 0 marks if the option or justification of MCQs is incorrect.

1. Consider the Perceptron algorithm applied to a binary classification task. Which of the following statements are correct?
(Select all that apply) (1 mark)

(A) The Perceptron algorithm is guaranteed to converge for any dataset, as long as the learning rate is appropriately
chosen.
(B) The weights are updated in the Perceptron algorithm based on the dot product of the input vector and the error
term, even when the sample is correctly classified.
(C) The bias in the Perceptron is updated when a sample is misclassified, but the magnitude of the update is not
dependent on the input values.
(D) If the Perceptron converges, it will find a decision boundary that minimizes the number of misclassifications on the
training data.

A. False
Reason: The Perceptron algorithm is only guaranteed to converge if the data is linearly separable. The algorithm does
not require a learning rate, and convergence depends solely on the separability of the data, not on a learning rate.
B. False
Reason: The weights are updated only when there is a misclassification. If the sample is classified correctly, no update
to the weights occurs. The update rule is:
W = w + errori · xi

C. True
Reason: The bias is updated when a sample is misclassified, using the rule:

B = b + errori

The magnitude of the bias update is independent of the input values, and it only depends on the error term.
D. True
Reason: If the Perceptron converges (for linearly separable data), it finds a decision boundary that minimizes the number
of misclassifications, reducing them to zero on the training data. However, it does not necessarily produce the most
optimal decision boundary, as there may be multiple valid solutions.
1 mark for correct option and correct reason

2. Which of the following statements regarding Random Forests and ensemble methods are not TRUE? (1 mark)

(A) Random Forests can handle both categorical and continuous variables, allowing for greater flexibility in modeling
various types of data.
(B) In a Random Forest, each tree is built in a sequential manner, where each tree depends on the results of the previous
tree.
(C) The primary advantage of using Random Forests over a single Decision Tree is the significant increase in bias while
keeping variance the same.
(D) Random Forests utilize an averaging scheme for classification tasks, where the final prediction is the average of the
predictions from all individual trees.
A. False: Random Forests can handle both categorical and continuous variables, making them versatile for different
types of datasets.
B. True: In a Random Forest, each tree is built independently and in parallel; there is no dependency between the trees,
which distinguishes them from boosting methods, where models are built sequentially.
C. True: The primary advantage of using Random Forests is a reduction in variance without significantly increasing
bias. They typically improve generalization compared to a single Decision Tree.
D. True: For classification tasks, Random Forests utilize a majority voting scheme, where the final prediction is based
on the mode of predictions from all individual trees, not an average.
1 mark for correct option
3. When growing a decision tree using the ID3 algorithm, which of the following is TRUE about the role of information
gain? (1 mark)
(a) Information gain measures how well a given attribute separates the training examples according to their target
classification.
(b) The attribute with the highest information gain is always selected at each step while growing the tree.
(c) Information gain is based on the reduction in entropy after the data is split on an attribute.
(d) Information gain ensures that the tree will never overfit the training data.
A. True: Information gain measures how well a given attribute separates the training examples according to their target
classification. It evaluates the effectiveness of an attribute in classifying the training data.

B. True: The attribute with the highest information gain is always selected at each step while growing the tree in
the ID3 algorithm. This ensures that the attribute that best reduces uncertainty is chosen.

C. True: Information gain is calculated based on the reduction in entropy after the data is split on an attribute.
It measures how much information a feature contributes towards classifying the data.

1 mark for correct option

4. Given the Perceptron Convergence Theorem, what can you say about the margin of a classifier and how it affects the
convergence of the Perceptron algorithm? Which of the following statements is not true? (1 mark)
(a) A small margin is more desirable because it leads to faster convergence of the Perceptron algorithm.
(b) A large margin is more desirable because it leads to faster convergence of the Perceptron algorithm.
(c) A small margin is more desirable because it leads to more updates, improving accuracy.
(d) The margin of the classifier does not affect the convergence of the Perceptron algorithm.
a) True: A small margin means the classifier is less confident, leading to more updates. It does not necessarily result in
faster convergence.
b) False: A large margin between the classes leads to fewer mistakes and quicker convergence of the Perceptron algorithm.
The algorithm requires fewer updates when the margin is large, as the data points are further from the decision boundary.
A small margin would result in more updates and slower convergence.
c) True: More updates due to a small margin do not necessarily improve accuracy and can lead to overfitting.
d) True: The margin directly affects convergence. The Perceptron Convergence Theorem states that the number of
updates depends on the margin size.
1 mark for correct option
5. Suppose that X1 , ..., Xm are categorical input attributes and Y is the categorical output attribute. Suppose we plan to
learn a decision tree without pruning using the standard algorithm. Which of the following is true? (1 marks)
(a) If Xi and Y are independent in the distribution that generated this dataset, then Xi will not appear in the decision
tree.
(b) If G(Y |Xi ) = 0 according to the values of entropy and conditional entropy computed from the data, then Xi will
not appear in the decision tree.
(c) The maximum depth of the decision tree must be less than m + 1.
(d) Suppose the data has R records. The maximum depth of the decision tree must be less than 1 + log2 R.

A: False (because the attribute may become relevant further down the tree when the records are restricted to some
value of another attribute) (e.g. XOR)
B: False for same reason
C: True because the attributes are categorical and can each be split only once
D: False because the tree may be unbalanced
1 mark for correct option and correct reason

6. Consider a Naı̈ve Bayes classifier with 3 boolean input variables, X1 , X2 , and X3 , and one boolean output, Y . (5 marks)

1. How many parameters must be estimated to train such a Naı̈ve Bayes classifier? (2.5 marks)
2. How many parameters would have to be estimated to learn the above classifier if we do not make the Naı̈ve Bayes
conditional independence assumption? (2.5 marks)

Solutions:
a. For a naive Bayes classifier, we need to estimate parameters:
P (Y = 1),
P (X1 = 1 | Y = 0),
P (X2 = 1 | Y = 0),
P (X3 = 1 | Y = 0),
P (X1 = 1 | Y = 1),
P (X2 = 1 | Y = 1),
P (X3 = 1 | Y = 1).
Other probabilities can be obtained with the constraint that the probabilities sum up to 1 (like P (X1 = 1 | Y = 0) =
1 − P (X1 = 0 | Y = 0)). So we need to estimate 7 parameters.
1 mark for correct parameters and 1.5 for correct parameter number
b. Without the conditional independence assumption, we still need to estimate P (Y = 1). (0.5 mark)
For Y = 1, we need to know all the enumerations of (X1 , X2 , and X3 ), i.e., 23 of possible (X1 , X2 , and X3 ). (1 mark)
Considering the constraint that the probabilities sum up to 1, we must estimate 23 − 1 = 7 parameters for Y = 1.
Therefore, the total number of parameters is 1 + 2(23 − 1) = 15.. (1 mark)

7. Using the dataset provided below, construct a decision tree to predict whether a person will play tennis or not. The
attributes available are:

• Outlook (Sunny, Overcast, Rain)

• Temperature (Hot, Mild, Cool)
• Humidity (High, Normal)
• Wind (Weak, Strong)

The target variable is whether the person will Play Tennis (Yes or No). The dataset is as follows:

1. Calculate the initial entropy for the target variable Play Tennis. (2 mark)
2. Calculate the information gain for the attributes: Outlook, Temperature, Humidity, and Wind. Which attribute
would be chosen as the root of the decision tree based on the ID3 algorithm? (3 mark).
Day Outlook Temperature Humidity Wind Play Tennis
1 Sunny Hot High Weak No
2 Sunny Hot High Strong No
3 Overcast Hot High Weak Yes
4 Rain Mild High Weak Yes
5 Rain Cool Normal Weak Yes
6 Rain Cool Normal Strong No
7 Overcast Cool Normal Strong Yes
8 Sunny Mild High Strong No
9 Sunny Cool Normal Weak Yes
10 Rain Mild Normal Weak Yes
11 Sunny Mild Normal Strong Yes
12 Overcast Mild High Strong Yes
13 Overcast Hot Normal Weak Yes
14 Rain Mild High Strong No

1. Calculate the initial entropy for the target variable Play Tennis.
Initial Entropy of Play Tennis:

9 9 5 5
Entropy(S) = − log2 − log2
14 14 14 14

9 5
Entropy(S) = −( × −0.6374) − ( × −1.4854) ≈ 0.918
14 14
Initial Entropy = 0.940
2 mark for correct answer or log
2.
Information Gain for Outlook:

2 2 3 3
Entropy(Sunny) = − log2 − log2 ≈ 0.971
5 5 5 5

Entropy(Overcast) = 0

3 3 2 2
Entropy(Rain) = − log2 − log2 ≈ 0.971
5 5 5 5

5 4 5
Entropy(Outlook) = × 0.971 + ×0+ × 0.971 ≈ 0.693
14 14 14

Gain(S, Outlook) = 0.940 − 0.693 = 0.247

Information Gain for Outlook = 0.247

0.5 mark for correct answer or correct log
Information Gain for Temperature:

2 2 2 2
Entropy(Hot) = − log2 − log2 =1
4 4 4 4

4 4 2 2
Entropy(M ild) = − log2 − log2 ≈ 0.918
6 6 6 6

3 3 1 1
Entropy(Cool) = − log2 − log2 ≈ 0.811
4 4 4 4

4 6 4
Entropy(T emperature) = ×1+ × 0.918 + × 0.811 ≈ 0.911
14 14 14

Gain(S, T emperature) = 0.940 − 0.911 = 0.029

Information Gain for Temperature = 0.029

0.5 mark for correct answer or correct log
Information Gain for Humidity:

3 3 4 4
Entropy(High) = − log2 − log2 ≈ 0.985
7 7 7 7

6 6 1 1
Entropy(N ormal) = − log2 − log2 ≈ 0.592
7 7 7 7

7 7
Entropy(Humidity) = × 0.985 + × 0.592 ≈ 0.789
14 14

Gain(S, Humidity) = 0.940 − 0.789 = 0.151

Information Gain for Humidity = 0.151

0.5 mark for correct answer or correct log
Information Gain for Wind:

6 6 1 1
Entropy(W eak) = − log2 − log2 ≈ 0.592
7 7 7 7

3 3 4 4
Entropy(Strong) = − log2 − log2 ≈ 0.985
7 7 7 7
7 7
Entropy(W ind) = × 0.592 + × 0.985 ≈ 0.789
14 14
Gain(S, W ind) = 0.940 − 0.789 = 0.151

Information Gain for Wind = 0.151

0.5 mark for correct answer or correct log
Attribute chosen as root: Outlook (highest information gain = 0.247)
1 mark for correct answer

Huawei Final Written Exam
50% (2)
Huawei Final Written Exam
18 pages
HUAWEI Final Written Exam 3333
50% (2)
HUAWEI Final Written Exam 3333
13 pages
Final Exam Update Huawei
0% (1)
Final Exam Update Huawei
13 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
DMT MCQ
No ratings yet
DMT MCQ
15 pages
CH 2 Digital Communications
No ratings yet
CH 2 Digital Communications
22 pages
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
No ratings yet
Solution of Final Exam: 10-701/15-781 Machine Learning: Fall 2004 Dec. 12th 2004
27 pages
Prolog - Unification - Backtracking - Recursion - Lists - Cut
No ratings yet
Prolog - Unification - Backtracking - Recursion - Lists - Cut
78 pages
Huawei Final Written Exam 2.2 Attempts
No ratings yet
Huawei Final Written Exam 2.2 Attempts
19 pages
Algorithms Lab Viva Questions
No ratings yet
Algorithms Lab Viva Questions
2 pages
Newbold Sbe8 Tif Ch07
100% (3)
Newbold Sbe8 Tif Ch07
42 pages
Fourier 4
No ratings yet
Fourier 4
73 pages
5 Adequacy of Length of Record Log Probability Law
No ratings yet
5 Adequacy of Length of Record Log Probability Law
15 pages
Final Written Exam Edit 3.3
No ratings yet
Final Written Exam Edit 3.3
13 pages
AV 315 Control System: Raman Chawla SC12B042
No ratings yet
AV 315 Control System: Raman Chawla SC12B042
37 pages
10f 601 Midterm
No ratings yet
10f 601 Midterm
17 pages
Data Science Interview Questions: Answer Here
No ratings yet
Data Science Interview Questions: Answer Here
54 pages
Department of Computer Science & Engineering Machine Learning Quiz - I Set - I
No ratings yet
Department of Computer Science & Engineering Machine Learning Quiz - I Set - I
3 pages
ML Finals16 PDF
No ratings yet
ML Finals16 PDF
12 pages
MachineLearning MidTerm UMT Spring 2021
100% (1)
MachineLearning MidTerm UMT Spring 2021
12 pages
Hatdog 1.2
No ratings yet
Hatdog 1.2
18 pages
Mathematic Modelling of Dynamic SYSTEMS Ch. 2
No ratings yet
Mathematic Modelling of Dynamic SYSTEMS Ch. 2
31 pages
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
No ratings yet
Machine Learning Foundations and Applications Assignment 1 Due Date: 10 October, 2021
3 pages
Final: CS 189 Spring 2016 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2016 Introduction To Machine Learning
12 pages
Image Compression Models: Fig: Functional Block Diagram of A General Image Compression System
No ratings yet
Image Compression Models: Fig: Functional Block Diagram of A General Image Compression System
2 pages
ANN-Regression-Python Examples
No ratings yet
ANN-Regression-Python Examples
35 pages
Hidden Markov Models: Modified From
No ratings yet
Hidden Markov Models: Modified From
32 pages
MCQs Dumps 2
No ratings yet
MCQs Dumps 2
15 pages
Ece468 1
No ratings yet
Ece468 1
34 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
UNIT IV 5 Weak Slot and Filler Structures
No ratings yet
UNIT IV 5 Weak Slot and Filler Structures
41 pages
MCQ Question
No ratings yet
MCQ Question
5 pages
Department of Course Outline Session: Jan. - June 2016 Semester
No ratings yet
Department of Course Outline Session: Jan. - June 2016 Semester
5 pages
3 (Energy & Power Signal)
100% (1)
3 (Energy & Power Signal)
10 pages
Spline Interpolation Fortran Code
No ratings yet
Spline Interpolation Fortran Code
4 pages
Midterm 2006
No ratings yet
Midterm 2006
11 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
Assignment 6: Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6: Introduction To Machine Learning Prof. B. Ravindran
3 pages
UCS622
No ratings yet
UCS622
1 page
MCC Esa99 Final
No ratings yet
MCC Esa99 Final
14 pages
Assignment 6
No ratings yet
Assignment 6
2 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
Midterm Solutions PDF
No ratings yet
Midterm Solutions PDF
17 pages
MLRECT2 Solution
No ratings yet
MLRECT2 Solution
9 pages
Tugas 6 Analisis Multivariat Data Panel
No ratings yet
Tugas 6 Analisis Multivariat Data Panel
11 pages
Informed Search
No ratings yet
Informed Search
65 pages
ML Final MCQsa
No ratings yet
ML Final MCQsa
7 pages
Numerical Methods Test
No ratings yet
Numerical Methods Test
1 page
PCA Code-Checkpoint
No ratings yet
PCA Code-Checkpoint
4 pages
ML Midsem 2022
No ratings yet
ML Midsem 2022
8 pages
MLfinal 1
No ratings yet
MLfinal 1
7 pages
PANDAS
No ratings yet
PANDAS
2 pages
ML Quiz 3
No ratings yet
ML Quiz 3
4 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
ML Assignment 2 2019 Nptel
No ratings yet
ML Assignment 2 2019 Nptel
34 pages
Deep Learning For Diagnosis and Classification of Faults in Industrial Rotating Machinery
No ratings yet
Deep Learning For Diagnosis and Classification of Faults in Industrial Rotating Machinery
23 pages
Instruments and Control System(s)
No ratings yet
Instruments and Control System(s)
2 pages
SSL - C4.5 Rules
No ratings yet
SSL - C4.5 Rules
13 pages
F Inal CoursePack - CCS - R1UC505C
No ratings yet
F Inal CoursePack - CCS - R1UC505C
17 pages
It ML
No ratings yet
It ML
10 pages
212 Final-Solution
No ratings yet
212 Final-Solution
23 pages
Quiz2 B
No ratings yet
Quiz2 B
6 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
R 2031053
No ratings yet
R 2031053
12 pages
Data Analytic MCQ
No ratings yet
Data Analytic MCQ
5 pages
Quiz 4
No ratings yet
Quiz 4
4 pages
SMAI End 2015 S
No ratings yet
SMAI End 2015 S
4 pages
MLvsMAP Merged
No ratings yet
MLvsMAP Merged
208 pages
B. Sc. H Computer S 3OWYH6v
No ratings yet
B. Sc. H Computer S 3OWYH6v
6 pages
EE2211 Past Paper
No ratings yet
EE2211 Past Paper
14 pages
Finals 19
No ratings yet
Finals 19
16 pages
Cryptography and Network Security
No ratings yet
Cryptography and Network Security
14 pages
Ai ML Unit 3
No ratings yet
Ai ML Unit 3
15 pages
ML MCQs Set
No ratings yet
ML MCQs Set
18 pages
Quiz 3
No ratings yet
Quiz 3
12 pages
Expected Value
No ratings yet
Expected Value
3 pages
Quiz 1
No ratings yet
Quiz 1
6 pages
Giant Pile ML Problems
No ratings yet
Giant Pile ML Problems
56 pages
ML Unit3,4,5 Ans
No ratings yet
ML Unit3,4,5 Ans
11 pages
ML End Sem Nov2024 Paper
No ratings yet
ML End Sem Nov2024 Paper
4 pages
Midterm2022 Solutions
No ratings yet
Midterm2022 Solutions
12 pages
2nd Sem 4th Module Important Questions Cse 24-25-1
No ratings yet
2nd Sem 4th Module Important Questions Cse 24-25-1
2 pages
ML Suggestion 2
No ratings yet
ML Suggestion 2
11 pages
Final 2019
No ratings yet
Final 2019
15 pages
Final2019 Solutions
No ratings yet
Final2019 Solutions
23 pages
Aml Mid-2 Objective
No ratings yet
Aml Mid-2 Objective
17 pages
Devi Bus PDF May 2025 New 0
No ratings yet
Devi Bus PDF May 2025 New 0
8 pages
ML FinalUpdated 1
No ratings yet
ML FinalUpdated 1
45 pages
Advanced AI Final Sheet
No ratings yet
Advanced AI Final Sheet
32 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet