0% found this document useful (0 votes)

39 views7 pages

Endsem ML Regular AK

Uploaded by

Ambati Bramarambika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

Endsem ML Regular AK

Uploaded by

Ambati Bramarambika

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Birla Institute of Technology & Science, Pilani

Work-Integrated Learning Programmes Division

MTech. Software Engineering at DSE (FC04, FA04_1-2021) Cluster

Second Semester 2021-2022

Endsem -Semester Test
(EC-3 Regular)

Course No. : DSECLZG565

Course Title : Machine Learning
Nature of Exam : Open Book No. of Pages = 2
Weightage : 40% No. of Questions = 8
Duration : 2 Hours
Date of Exam : 25-09-22(FN)
Note:
1. Please follow all the Instructions to Candidates given on the cover page of the answer
book.
2. All parts of a question should be answered consecutively. Each answer should start from
a fresh page.
3. Assumptions made if any, should be stated clearly at the beginning of your answer.

Q.1 The results of the election are to be predicted for candidates based on dataset D.
There are three different hypotheses h1, h2 and h3 are used to predict the result
of candidates winning or losing an election. The probability of h1 given dataset D
is 0.5, the probability of h2 given dataset D is 0.3 and the probability of h3 given
dataset D is 0.2. Given a new candidate, h1 predicts that a candidate will win the
election whereas h2 and h3 predict that candidate will lose the election. What’s
the most probable classification of a new candidate? [3 Marks]

Solution:

+ = win, - = lose

P(h1|D) = .5, P(−|h1) = 0, P(+|h1) = 1

P(h2|D) = .3, P(−|h2) = 1, P(+|h2) = 0
P(h3|D) = .2, P(−|h3) = 1, P(+|h3) = 0

P(+|D) = 10.5+00.3+0*0.2=0.5 [1.5M]

P(-|D) = 0*0.5+1*0.3+1*0.2=0.5 [1.5M]
Winning and losing both are equiprobable

Q.2 Use kernel trick and find the equation for hyperplane using nonlinear SVM.
Positive Points: {(1,0), (3,0), (5,0)} Negative Points: {(0,0), (2,0), (4,0), (6,0)}. Plot
the point before and after the transformation. [5 Marks]

Solution:
(x) = x mod 2 [3M]
Equation of hyperplane : y=0.5 [2M]

Q.3 Suppose we have the following one-dimensional data at -4.0, -.3.0, -2.0, -1.0, 0.0,
1.0, 2.0, 3.0, 4.0. Use the EM algorithm to find a Gaussian mixture model consisting
of exactly one Gaussian that fits the data. Assume that the initial mean of the
Gaussian is 10.0 and the initial variance is 1.0. [7 Marks]
Answer
First we note that 𝜋1 = 1 since there is only one Gaussian in the mixture model.
Computing the posterior probabilities 𝑃(𝑧𝑛1 = 1⁄𝑥𝑛 ) = 𝛾(𝑧𝑛1 ) we see that the
posterior probabilities are all equal to 1 since both the numerator and
denominator are equal to 𝜋1 𝑁(𝑥𝑛 ⁄𝜇1 , Σ1 ). [1.5 M]

Also 𝑁1 = ∑𝛾(𝑧𝑛1 ) = 𝑁, the number of data points. [0.5M]

This completes the E-step.

In the M-step, we see that

1 ∑𝑛=𝑁
𝑛=1 𝑥𝑛 −4.0 +−3.0 + −2.0+ −1.0 + 0.0+1.0+2.0+3.0+4.0
𝜇1𝑛𝑒𝑤 = 𝑁 ∑𝑁
𝑛=1 𝛾(𝑧𝑛1 )𝑥𝑛 = 𝑁
= 9
= 0.0
1
[2M]

1
and Σ𝑘𝑛𝑒𝑤 = ∑𝑛=𝑁(𝑥 − 𝜇1𝑛𝑒𝑤 )(𝑥𝑛 − 𝜇1𝑛𝑒𝑤 )𝑇 .
𝑁1 𝑛=1 𝑛
Here the 𝑥𝑛 and 𝜇1𝑛𝑒𝑤 are 1 × 1 matrices and the expression for Σ𝑘𝑛𝑒𝑤 simplifies to
∑𝑁 2
𝑛=1 𝑥𝑛 2∗(4.02 +3.02 +2.02 +1.02 )
𝑁
which is 9
=6.66. [2M]

In the next iteration the E-step computes the posterior probabilities to be 1 and
the M-step computes the same mean and covariance matrix as above, so the
algorithm converges.[1M]

Q.4 A dataset 𝐷 consists of the results of 100 independent coin tosses of the same
coin where 30 turn out to be heads and 70 turn out to be tails. Let 𝑝 be the
probability of tossing a head. How many datasets on 100 coin tosses are possible
which have the same likelihood as the given dataset 𝐷? Determine the maximum
likelihood estimate of the parameter 𝑝 using appropriate calculations. [5 Marks]
Answer: The likelihood of the given data 𝐷 given 𝑝 is 𝑃(𝐷⁄𝑝) = 𝑝30 (1 − 𝑝)70 .

Any other dataset 𝐷 ′ with the same number of heads and tails as 𝐷 will have the
same likelihood given the same probability 𝑝 of tossing a head. [1M]

There are 𝑛𝐶𝑟 ways of choosing 𝑟 locations out of 𝑛 to place heads.

Therefore the number of datasets that have the same likelihood as 𝐷 is 100𝐶30 =
100!
(70!)(30!)
. [2M]

To calculate the value of 𝑝 that maximizes likelihood we take log to get

log(𝑃(𝐷⁄𝑝) = 30 𝑙𝑜𝑔𝑝 + 70 log(1 − 𝑝) .
Then taking the derivative of log(𝑃(𝐷⁄𝑝) and setting it to zero,
30 70
we get 0 = 𝑝
− 1−𝑝.
Solving for 𝑝 we get 𝑝 = 0.3. [2M]

Q.5 Consider a following dataset [ 5 Marks]

X1 X2 Y

-1 -1 Positive class

-1 1 Negative class

1 -1 Negative class

1 1 Positive class
Answer the following with respect to above dataset
a) Comment on the separability of the dataset with an explanation.[0.5M]
b) Provide the 1-dimensional transformation of this dataset for each linearly and non-
linearly separable case with justification[2M]
c) Model the above dataset with an Artificial Neural Network which has two hidden
layers, each of which contains two units. Assume that the weights in each layer are set
to 1 so that the top unit in each layer applies sigmoid activation to the sum of its inputs
and the bottom unit in each layer applies tanh activation to the sum of its inputs.
Finally, the single output node applies ReLU activation to the sum of its two inputs.
Write the output of this neural network in closed form as a function of x1 and x2. (no
need to calculate exact values) [2.5M]

Solution:
In b) just transformation with no explanation deduct 0.5M in each case.
In c) any mistake in function – 0 M
Q.6 Solve the below and find the equation for hyper plane using linear Support Vector
Machine method. Positive Points: {(3, 2), (4, 3), (2, 3), (3, -1)} Negative Points: {(1, 0),
(-1, -1), (0, 2), (-1, 2)} [5 Marks]
A. Find the support vectors
B. Determine the equation of hyperplane if it is changed and give a reason if it is not
changed for the following two cases
a. If the point (2, 3) is removed.
b. If the point (5,4) is added
Solution:
A. Support vectors are (2,3), (1,0) and (3,-1) [1M- if one of the SVs is wrong then 0 M]
Equation of decision Hyperplane - 2M
The solution obtained using the Lagrange method or by geometrical inspection is
acceptable
B. a. If the point (2, 3) is removed. [1M – 0.5M if no reason is given]:
The equation of the decision hyperplane will change as the removed point is the
support vector.

b. If the point (5,4) is added [1M – 0.5M if no reason is given]: The equation of the
decision hyperplane will not change as the added point is not support vector.

Q.7 Consider training a boosting classifier using decision stumps on the following data set.
Circle the examples which will have their weights increased at the end of each
iteration. Run the iteration till zero training error is achieved. [3 Marks]

Solution:
No of iteration - 3
Q.8 Consider the following dataset. [7 Marks]
x1 -1 -1 -1 -1 0 4 4
x2 2 1 -1 -2 0 2 -2
Class label 1 1 1 1 1 2 2
The given class label exhibits two natural clusters formed in the given dataset and acts as
a ground truth. Now remove class labels and use the K-means clustering algorithm to find
the 2 clusters by initializing two cluster centres as follows:
A. C1(-1,2) and C2(0,0)
B. C1(-0.5,0) and C2(0,0)
For both the above cases run the algorithm till centres do not change (convergence
criteria) and give the final cluster assignment [2+2M]. In each case, comment on the
correctness of cluster assignment. [1+1M] Also, comment in no more than 20 words on
the drawback of k-means which is depicted in above two cases.[1M]
Solution:
A.

c1 c2
x1 -1 -1 -1 -1 0 4 4
x2 2 1 -1 -2 0 2 -2
d1 0 1 3 4 2.236068 5 6.403124
d2 2.236068 1.414214 1.414214 2.236068 0 4.472136 4.472136
cluster 1 1 2 2 2 2 2

new c1 new c2
x1 -1 1.2
x2 1.5 -0.6
d1 0.5 0.5 2.5 3.5 1.802776 5.024938 6.103278
d2 3.405877 2.720294 2.236068 2.607681 1.341641 3.820995 3.130495
cluster 1 1 2 2 2 2 2

new c1 new c2
x1 -1 1.2
x2 1.5 -0.6
Comment on cluster assignment:
Algorithm has converged after 2 iterations but the cluster assignment does not
depict the natural clusters in the datasets as given by the ground truth.
B
c1 c2 c1
x1 -1 -1 -1 -1 0 4 4 -0.5
x2 2 1 -1 -2 0 2 -2 0
d1 2.061553 1.118034 1.118034 2.061553 0.5 4.924429 4.924429
d2 2.236068 1.414214 1.414214 2.236068 0 4.472136 4.472136
cluster 1 1 1 1 2 2 2

new c1 new c2
x1 -1 2.666667
x2 0 0
d1 2 1 1 2 1 5.385165 5.385165
d2 4.176655 3.800585 3.800585 4.176655 2.666667 2.403701 2.403701
cluster 1 1 1 1 1 2 2

new c1-
mean new c2
x1 -0.8 4
x2 0 0
d1 2.009975 1.019804 1.019804 2.009975 0.8 5.2 5.2
d2 5.385165 5.09902 5.09902 5.385165 4 2 2
cluster 1 1 1 1 1 2 2
new c1 new c2
x1 -0.8 4
x2 0 0

Comment on cluster assignment:

Algorithm has converged after 3 iterations and the cluster assignment shows the
natural clusters in the datasets as given by the ground truth.
The drawback of k-means as demonstrated by the above two cases:
Correctness of the K- means the algorithm is sensitive to the initialization of cluster
centres.

Applied Logistic Regression - 3rd Edition Scribd Download
100% (8)
Applied Logistic Regression - 3rd Edition Scribd Download
17 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Assignment 3 - Test Plan
No ratings yet
Assignment 3 - Test Plan
57 pages
Matrix - Xla - Excel Addin
No ratings yet
Matrix - Xla - Excel Addin
112 pages
10KW Visit Report
No ratings yet
10KW Visit Report
16 pages
A Bluetooth ESP32 TFT Touch Macro Keypad
100% (2)
A Bluetooth ESP32 TFT Touch Macro Keypad
28 pages
EXCEL ENERGY - Bill
0% (1)
EXCEL ENERGY - Bill
3 pages
ML Assignments 2025
No ratings yet
ML Assignments 2025
91 pages
Linux Commands Everyone Should Know
No ratings yet
Linux Commands Everyone Should Know
19 pages
Motherboard Manual
No ratings yet
Motherboard Manual
23 pages
2023 Summer Final
No ratings yet
2023 Summer Final
21 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
High Availability and DR Test Report: T24 Architecture With JMS Connectivity Oracle Stack
No ratings yet
High Availability and DR Test Report: T24 Architecture With JMS Connectivity Oracle Stack
59 pages
Amir Maleki Moghaddam: Advanced Workflow To Evaluate and Compare The Performance of Directional Drilling Control Tools
No ratings yet
Amir Maleki Moghaddam: Advanced Workflow To Evaluate and Compare The Performance of Directional Drilling Control Tools
80 pages
Lecture10 Mid
No ratings yet
Lecture10 Mid
43 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Midterm With Solutions
No ratings yet
Midterm With Solutions
26 pages
Manual de Servicio (Básico) DEP450 Inglés 2020
No ratings yet
Manual de Servicio (Básico) DEP450 Inglés 2020
95 pages
SMAI End 2015 S
No ratings yet
SMAI End 2015 S
4 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Agi32 Basics For Wsdot Highway Lighting: Headquarters Traffic Operations
No ratings yet
Agi32 Basics For Wsdot Highway Lighting: Headquarters Traffic Operations
103 pages
Customer On-Boarding - Disclaimer For Retail Bug - Fix - 2024-05-23 - 1155.export-Log
No ratings yet
Customer On-Boarding - Disclaimer For Retail Bug - Fix - 2024-05-23 - 1155.export-Log
109 pages
بنك الاسئله لنظم التشغيل
No ratings yet
بنك الاسئله لنظم التشغيل
46 pages
Mock End Term Solution
No ratings yet
Mock End Term Solution
12 pages
Machine 2020 Jul-Dec Practice 7,8
No ratings yet
Machine 2020 Jul-Dec Practice 7,8
37 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
Topic 4
No ratings yet
Topic 4
32 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
Example Exam
No ratings yet
Example Exam
12 pages
Endsem ML Makeup AK - 1
No ratings yet
Endsem ML Makeup AK - 1
7 pages
Acción Psicológica - Home Page
No ratings yet
Acción Psicológica - Home Page
1 page
Analytics of Observational Data Lec 12
No ratings yet
Analytics of Observational Data Lec 12
24 pages
12f-601-Midterm Machine Learning
No ratings yet
12f-601-Midterm Machine Learning
21 pages
2023-24 ML End-Semester Make-Up QP Anwer-Keys
No ratings yet
2023-24 ML End-Semester Make-Up QP Anwer-Keys
9 pages
12s 701 Final
No ratings yet
12s 701 Final
17 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Quarterly Presentation On Training As Probationary Deputy Executive Engineer (Civil)
No ratings yet
Quarterly Presentation On Training As Probationary Deputy Executive Engineer (Civil)
22 pages
Led 08 02 2020
No ratings yet
Led 08 02 2020
41 pages
Final Exam, 10701 Machine Learning, Spring 2009: Max. Score Score 1 2 3 4 5 6 7 8 9 10
No ratings yet
Final Exam, 10701 Machine Learning, Spring 2009: Max. Score Score 1 2 3 4 5 6 7 8 9 10
25 pages
CS-3035 (ML) - CS End April 2024
No ratings yet
CS-3035 (ML) - CS End April 2024
21 pages
Final Compre - Solutions - Updated FoDS
No ratings yet
Final Compre - Solutions - Updated FoDS
12 pages
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
No ratings yet
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
12 pages
AI60201 Module3 4 Problems
No ratings yet
AI60201 Module3 4 Problems
4 pages
Survey On Multilevel Security Using Honeypot
No ratings yet
Survey On Multilevel Security Using Honeypot
5 pages
Installing Clickhouse On A Kubernetes Cluster
No ratings yet
Installing Clickhouse On A Kubernetes Cluster
3 pages
5G Bootcamp Syllabus 3.0 - APPROVED 10 - 12 - 22-1
No ratings yet
5G Bootcamp Syllabus 3.0 - APPROVED 10 - 12 - 22-1
9 pages
10f 601 Midterm
No ratings yet
10f 601 Midterm
17 pages
Water Quality Management
No ratings yet
Water Quality Management
7 pages
Awsm: CS 771A: Intro To Machine Learning, IIT Kanpur (19 Oct 2022) Name Roll No Dept
No ratings yet
Awsm: CS 771A: Intro To Machine Learning, IIT Kanpur (19 Oct 2022) Name Roll No Dept
2 pages
TPR (Pass Year 2019)
No ratings yet
TPR (Pass Year 2019)
6 pages
ML ES 23-24-II Key
No ratings yet
ML ES 23-24-II Key
4 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
No ratings yet
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
10 pages
Sherwin's Resume and Application Letter
No ratings yet
Sherwin's Resume and Application Letter
8 pages
2017-18-I MS Key
No ratings yet
2017-18-I MS Key
6 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Fa22 Rba 003
No ratings yet
Fa22 Rba 003
7 pages
Homework Set 3
No ratings yet
Homework Set 3
7 pages
Quiz 4
No ratings yet
Quiz 4
4 pages
35C+ & 45C+ Gas Fryer Parts Manual: Pitco Frialator Inc
No ratings yet
35C+ & 45C+ Gas Fryer Parts Manual: Pitco Frialator Inc
16 pages
Problem Sheet 1
No ratings yet
Problem Sheet 1
3 pages
Quiz3 2023
No ratings yet
Quiz3 2023
2 pages
ES Key
No ratings yet
ES Key
4 pages
2019-20-I ES Key
No ratings yet
2019-20-I ES Key
4 pages
Seismic Zones Factor Zone 4 Normal Occupancies 8: I Occupancy Requirements Table 2.2D
No ratings yet
Seismic Zones Factor Zone 4 Normal Occupancies 8: I Occupancy Requirements Table 2.2D
5 pages
Week0 Soln
No ratings yet
Week0 Soln
6 pages
ML Questions
No ratings yet
ML Questions
6 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
PRML 2022 Endsem
No ratings yet
PRML 2022 Endsem
3 pages
Ain Shams University Faculty of Engineering
No ratings yet
Ain Shams University Faculty of Engineering
8 pages
10-601 Machine Learning: Homework 7: Instructions
No ratings yet
10-601 Machine Learning: Homework 7: Instructions
5 pages
Worksheet For Quiz
No ratings yet
Worksheet For Quiz
5 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
AI42001 Practice 2
No ratings yet
AI42001 Practice 2
4 pages
Distributed Generation and Microturbines
No ratings yet
Distributed Generation and Microturbines
5 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
CSIT308 Mid-Remote Winter2022 Krityuk
No ratings yet
CSIT308 Mid-Remote Winter2022 Krityuk
2 pages
Ex 83622 2025 1
No ratings yet
Ex 83622 2025 1
2 pages
CS-3035 (ML) - CS Mid March 2023
No ratings yet
CS-3035 (ML) - CS Mid March 2023
3 pages
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
No ratings yet
ECE457 Pattern Recognition Techniques and Algorithms: Answer All Questions
3 pages
EXERCISE 13A - Door and Window Schedule
No ratings yet
EXERCISE 13A - Door and Window Schedule
1 page
Brodie Paton Resume
No ratings yet
Brodie Paton Resume
2 pages
CS 771A: Intro To Machine Learning, IIT Kanpur Name Roll No Dept
No ratings yet
CS 771A: Intro To Machine Learning, IIT Kanpur Name Roll No Dept
2 pages

Endsem ML Regular AK

Uploaded by

Endsem ML Regular AK

Uploaded by

Birla Institute of Technology & Science, Pilani

Work-Integrated Learning Programmes Division

Second Semester 2021-2022

Course No. : DSECLZG565

P(h1|D) = .5, P(−|h1) = 0, P(+|h1) = 1

P(+|D) = 1*0.5+0*0.3+0*0.2=0.5 [1.5M]

Also 𝑁1 = ∑𝛾(𝑧𝑛1 ) = 𝑁, the number of data points. [0.5M]

In the M-step, we see that

There are 𝑛𝐶𝑟 ways of choosing 𝑟 locations out of 𝑛 to place heads.

To calculate the value of 𝑝 that maximizes likelihood we take log to get

Q.5 Consider a following dataset [ 5 Marks]

Comment on cluster assignment:

You might also like

P(+|D) = 10.5+00.3+0*0.2=0.5 [1.5M]