Quiz3 2024

The document outlines the structure and instructions for Quiz 3 of the Introduction to ML course, including details on answering true/false questions and providing explanations. It contains various questions related to machine learning concepts, such as generative classification models, kernel functions, and logistic regression. The quiz is designed to assess understanding of key topics in machine learning within a 45-minute timeframe.

Uploaded by

piyushrvardhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views2 pages

Quiz3 2024

Uploaded by

piyushrvardhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Introduction to ML (CS771), 2024-2025-Sem-I Total Marks 25

Quiz 3. October 24, 2024 Duration 45 minutes

Name Roll No.
Instructions:
1. Clearly write your name (in block letters) and roll number in the provided boxes above.
2. Write your final answers concisely in the provided space. You may use blue/black pen.
3. We won’t be able to provide clarifications during the quiz. If any aspect of some question
appears ambiguous/unclear to you, please state your assumption(s) and answer accordingly.

Question 1: Write T or F for True/False in the box next to each question given below, with a brief
(1-2 sentences at most) explanation in the provided space in the box below the question. Marks will
be awarded only when the answer (T/F) and explanation both are correct. (3 x 2 = 6 marks)
1.1 To predict the label using a generative classification model, comparing the probabilities
𝑝(𝑦 = 𝑘|𝒙) for different values of 𝑘 is equivalent to comparing the class-conditional F
probability densities 𝑝(𝒙|𝑦 = 𝑘) for different values of 𝑘
𝑝(𝑦 = 𝑘|𝒙) ∝ 𝑝(𝑦 = 𝑘)𝑝(𝒙|𝑦 = 𝑘) so it also incorporate the class prior (class marginal)
distribution 𝑝(𝑦 = 𝑘)

1.2 A Gaussian prior 𝑝(𝒘) = 𝒩(𝒘|𝒘0 , 𝜆−1 𝑰) on the weight vector 𝒘 ∈ ℝ𝐷 will cause a F
regularization effect and encourage the entries in 𝒘 to take small values.
This prior corresponds to a regularizer of the form 𝜆‖𝒘 − 𝒘0 ‖2 which will encourage each entry
of the vector 𝒘 to be close the the corresponding entry in the vector 𝒘0 . Only when 𝒘0 is the
zero vector, the statement above would be true but in general it would be false.

1.3 Even though the MAP estimate is the mode of the posterior distribution, to compute T
the MAP estimate, it is not necessary to compute the posterior distribution.
𝑝(𝜃)𝑝(𝑦|𝜃)
Recall that the posterior is 𝑝(𝜃|𝑦) = . Because of the denominator (marginal likelihood)
𝑝(𝑦)
is independent of 𝜃, maximization of the posterior only requires maximization of the numerator
𝑝(𝜃)𝑝(𝑦|𝜃) (or log 𝑝(𝜃) + log 𝑝(𝑦|𝜃)) and we don’t need to compute the full posterior for the
maximization.
Question 2: Answer the following questions concisely in the space provided below the question.
2.1 Consider the RBF kernel 𝑘(𝒙 , 𝒙 ) = exp (−𝛾‖𝒙 − 𝒙 ‖2 ) where 𝒙 and 𝒙 are 𝐷 dim
𝑖 𝑗 𝑖 𝑗 𝑖 𝑗
inputs. Consider two cases: (1) when bandwidth hyperparameter 𝛾 is set as very-very large,
and (2) when 𝛾 is set as very-very small. For each of these two cases, answer (with brief
justification) whether the resulting kernel function would be practically useful. (4 marks)
(1) When 𝛾 is very-very large, 𝑘(𝒙𝑖 , 𝒙𝑗 ) will be nonzero (will equal 1) only when 𝒙𝑖 and 𝒙𝑗 are
nearly identical. For all other pairs of inputs, the kernel will give 0 similarity.
(2) When 𝛾 is very-very small, 𝑘(𝒙𝑖 , 𝒙𝑗 ) will be close to 1 for all pairs of inputs, thus treating
all pairs of inputs as equally similar to each other.
Clearly, neither of these two extreme cases are desirable.
2.2 Briefly explain why using kernels with the landmarks approach or the random features
approach is faster at test time than using kernels in the standard manner? (3 marks)
When using landmarks or random features approach, we use the kernel to construct an 𝐿
dimensional feature representation 𝜓(𝒙𝑛 ), and train a linear model on these representations to get
a weight vector 𝒘 that is 𝐿 dimensional. Thus, for a test input 𝒙∗ the prediction cost for computing
𝑤 ⊤ 𝜓(𝒙∗ ) is also 𝑂(𝐿). In contrast, when using the kernel in the standard manner, this cost of
𝑂(𝑁) which can be very high if the number of training inputs is very large
2.3 Given a dataset 𝑿 as the 𝑁 × 𝐷 input matrix with 𝑁 inputs and 𝐷 features, write down the
𝐾-means hard-clustering problem for this dataset in form of an equivalent matrix
factorization problem, clearly specifying the meanings of the variables involved in the matrix
factorization, their dimensions, and constraints on them, if any. (4 marks)
̂, 𝝁
{𝒁 ̂ } = argmin𝒁,𝝁 ‖𝑿 − 𝒁𝝁‖𝟐 . Here 𝒁 is the 𝑁 × 𝐾 matrix with row 𝑛 (𝒛𝑛 ) being a one-hot
vector denoting which cluster the input 𝒙𝑛 belongs to, and 𝝁 denotes the 𝐾 × 𝐷 matrix with row
𝑘 (𝝁𝑘 ) denoting the mean of the 𝑘𝑡ℎ cluster.
Constraints on 𝒛𝑛 : Must be a one-hot vector
Constraints on 𝝁𝑘 : None
2.4 Why is it difficult to compute the predictive distribution of a logistic regression model which,
by definition, is given by 𝑝(𝑦∗ = 1|𝒙∗ , 𝑿, 𝒚) = ∫ 𝑝(𝑦∗ = 1|𝒘, 𝒙∗ )𝑝(𝒘|𝑿, 𝒚)𝑑𝒘. Suggest a
method to approximate it and clearly show the necessary equations. (3 marks)
It is difficult because the integral here is not tractable (involves integrating 𝑝(𝑦∗ = 1|𝒘, 𝒙∗ ) which
is a sigmoid function over the posterior 𝑝(𝒘|𝑿, 𝒚) and even if the latter is Gaussian (like in Laplace
approximation), the integral still is intractable. To approximate the integral, one way is to use
Monte-Carlo approximation where we draw 𝑆 i.i.d. samples 𝒘(1) , 𝒘(2) , … , 𝒘(𝑆) from the posterior
and approximate the predictive distribution as
1 𝑆
𝑝(𝑦∗ = 1|𝒙∗ , 𝑿, 𝒚) ≈ ∑ 𝑝(𝑦∗ = 1|𝒘(𝑠) , 𝒙∗ )
𝑆 𝑠=1

2.5 Show that, for generative classification with uniform class marginal and Gaussian class
conditionals 𝒩(𝒙|𝝁𝑘 , 𝚺), the posterior probability of input 𝒙 belonging to class 𝑘, i.e.,
𝑝(𝑦 = 𝑘|𝒙) ∝ exp (𝒘⊤ 𝑘 𝒙 + 𝑏𝑘 ), and write down the expressions for 𝒘𝑘 and 𝑏𝑘 (5 marks)
Since we have a uniform class marginal, the posterior probability will be
𝑝(𝒙|𝑦 = 𝑘)
𝑝(𝑦 = 𝑘|𝒙) = ∝ 𝑝(𝒙|𝑦 = 𝑘)
𝑝(𝒙)
Since the class conditional 𝑝(𝒙|𝑦 = 𝑘) = 𝒩(𝒙|𝝁𝑘 , 𝚺), we have
1 1
𝑝(𝒙|𝑦 = 𝑘) ∝ exp (− ((𝒙 − 𝝁𝑘 )𝚺−1 (𝒙 − 𝝁𝑘 )) ∝ exp (𝝁⊤ −1
𝑘𝚺 𝒙 − 𝝁⊤
𝑘 𝚺𝝁𝑘 ), where in the
2 2
last expression (after the proportionality sign), we are ignoring any terms that are not specific to
1
class 𝑘. Thus 𝑝(𝑦 = 𝑘|𝒙) ∝ 𝑝(𝒙|𝑦 = 𝑘) ∝ exp (𝝁⊤ −1
𝑘 𝚺 𝒙− 𝝁⊤
𝑘 𝚺𝝁𝑘 ) which is clearly in the
2
1
form of exp (𝒘⊤
𝑘𝒙 + 𝑏𝑘 ) where 𝒘𝑘 = (𝝁⊤ −1 ⊤
𝑘𝚺 ) =𝚺 −1
𝝁𝑘 and 𝑏𝒌 = − 𝝁⊤
𝑘 𝚺𝝁𝑘2

Side note (not required for the answer): Note that the above implies that this generative
classification model has a similar form as softmax classification model, although the weight vector
is learned using a generative manner, and not using GD as we do in case of softmax classification

418 CUMMINS 6CTA8.3-C215 Dongfeng Part Catalogue
100% (1)
418 CUMMINS 6CTA8.3-C215 Dongfeng Part Catalogue
84 pages
Wa0030.
No ratings yet
Wa0030.
36 pages
Mind Map
100% (1)
Mind Map
13 pages
Pile Type 1 - Screw Pile Load Test Outline (Terna)
No ratings yet
Pile Type 1 - Screw Pile Load Test Outline (Terna)
113 pages
13 - Histograms and The Normal Distribution - pcs-1
No ratings yet
13 - Histograms and The Normal Distribution - pcs-1
28 pages
ST1837 B46TU-B48TU Engines
100% (2)
ST1837 B46TU-B48TU Engines
40 pages
Saic-Q-1035 Sub-Base & Base Course
No ratings yet
Saic-Q-1035 Sub-Base & Base Course
4 pages
Gill
No ratings yet
Gill
474 pages
Geo SCADA 2022 Update Sep 2023 (85.8650.1) Release Notes
No ratings yet
Geo SCADA 2022 Update Sep 2023 (85.8650.1) Release Notes
39 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
SMAI Question Papers
No ratings yet
SMAI Question Papers
13 pages
Exam 2011
No ratings yet
Exam 2011
22 pages
Types of Modulator
No ratings yet
Types of Modulator
31 pages
Midterm Practice Questions
No ratings yet
Midterm Practice Questions
14 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
56 pages
MIKE11 UserManual
No ratings yet
MIKE11 UserManual
542 pages
ML Review Exam So Lns
No ratings yet
ML Review Exam So Lns
6 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
Ls Inverter Ic5
No ratings yet
Ls Inverter Ic5
20 pages
ML Practice 1
No ratings yet
ML Practice 1
106 pages
hw3 Solutions PDF
No ratings yet
hw3 Solutions PDF
11 pages
Final Exam Epfl 2020 Machine Leaning
No ratings yet
Final Exam Epfl 2020 Machine Leaning
16 pages
EM 3rd Sem Assignment
No ratings yet
EM 3rd Sem Assignment
1 page
Tissin Positioner TS900-manual E
No ratings yet
Tissin Positioner TS900-manual E
52 pages
Aidco 450E BR
No ratings yet
Aidco 450E BR
4 pages
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
No ratings yet
CS 7641 CSE/ISYE 6740 Mid-Term Exam 2 (Fall 2016) Solutions: 1 Probability and Bayes' Rule (14 PTS)
12 pages
Kernel PCA
No ratings yet
Kernel PCA
13 pages
KODAG
No ratings yet
KODAG
24 pages
Midterm 2010 F
No ratings yet
Midterm 2010 F
15 pages
Midterm 2010 Solutions
No ratings yet
Midterm 2010 Solutions
8 pages
cs675 SS2022 Midterm Solution PDF
No ratings yet
cs675 SS2022 Midterm Solution PDF
10 pages
Prajwal Deshmukh - Batch A
No ratings yet
Prajwal Deshmukh - Batch A
38 pages
Final: CS 189 Spring 2013 Introduction To Machine Learning
No ratings yet
Final: CS 189 Spring 2013 Introduction To Machine Learning
9 pages
PDMS Procedure: 2D DRAFT Intermediate - Structural Discipline
No ratings yet
PDMS Procedure: 2D DRAFT Intermediate - Structural Discipline
14 pages
ML Question CMU
No ratings yet
ML Question CMU
12 pages
Practice Midterm 2010
No ratings yet
Practice Midterm 2010
4 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
Geotechnical Characteristics of Copper Mine Tailings: A Case Study
No ratings yet
Geotechnical Characteristics of Copper Mine Tailings: A Case Study
13 pages
ML ES 23-24-II Key
No ratings yet
ML ES 23-24-II Key
4 pages
Practice Midterm
No ratings yet
Practice Midterm
4 pages
Final Compre - Solutions - Updated FoDS
No ratings yet
Final Compre - Solutions - Updated FoDS
12 pages
ST3189 - Machine Learning - 2019 Exam - Zone-B
No ratings yet
ST3189 - Machine Learning - 2019 Exam - Zone-B
6 pages
HTML Tags
No ratings yet
HTML Tags
14 pages
An Overview of Genetic Algorithms: Part 1, Fundamentals
No ratings yet
An Overview of Genetic Algorithms: Part 1, Fundamentals
16 pages
Homework Set 3
No ratings yet
Homework Set 3
7 pages
COMPSCI5014 1 Machine Learning (M) 201904
No ratings yet
COMPSCI5014 1 Machine Learning (M) 201904
7 pages
CS 215: Data Analysis and Interpretation: Sample Questions
No ratings yet
CS 215: Data Analysis and Interpretation: Sample Questions
10 pages
COMP 1003&1433 Midterm (Tuesday)
No ratings yet
COMP 1003&1433 Midterm (Tuesday)
8 pages
CBSE Class 12 Chemistry Question Paper Solution 2019
No ratings yet
CBSE Class 12 Chemistry Question Paper Solution 2019
6 pages
Exposing The Deception Deepfake Detection
No ratings yet
Exposing The Deception Deepfake Detection
13 pages
Exercise Solution 05 Linear Classification
No ratings yet
Exercise Solution 05 Linear Classification
9 pages
Quiz 3
No ratings yet
Quiz 3
12 pages
2011 End Spring 2011 Computer Science Machine Learning
No ratings yet
2011 End Spring 2011 Computer Science Machine Learning
10 pages
GLB Earn Proration Anytime
No ratings yet
GLB Earn Proration Anytime
11 pages
Ass8 Solns
No ratings yet
Ass8 Solns
10 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
Experimental Study On Self Compacting Concrete With Various Percentage of Steel Fibres
No ratings yet
Experimental Study On Self Compacting Concrete With Various Percentage of Steel Fibres
4 pages
Deye Hybrid 5K y 6K
No ratings yet
Deye Hybrid 5K y 6K
2 pages
MMW Midterms Notes
No ratings yet
MMW Midterms Notes
6 pages
2017-18-I MS Key
No ratings yet
2017-18-I MS Key
6 pages
2019-20-I MS Key
No ratings yet
2019-20-I MS Key
6 pages
Final 2012 W
No ratings yet
Final 2012 W
8 pages
Assignment 1
No ratings yet
Assignment 1
6 pages
Spinach 1
No ratings yet
Spinach 1
7 pages
ES Key
No ratings yet
ES Key
6 pages
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
No ratings yet
Taller 3 (A. NG.) - Introducción Al Aprendizaje Supervisado
8 pages
Cryptanalysis of A New Ultralightweight RFID Authentication ProtocolSASI
No ratings yet
Cryptanalysis of A New Ultralightweight RFID Authentication ProtocolSASI
5 pages
ES Key
No ratings yet
ES Key
4 pages
ST3189 Exam Paper - October 2023
No ratings yet
ST3189 Exam Paper - October 2023
5 pages
hw3 Red
No ratings yet
hw3 Red
4 pages
Quiz 1
No ratings yet
Quiz 1
5 pages
Cs 419 Endsemsols
No ratings yet
Cs 419 Endsemsols
6 pages
Electrical and Optical Properties of Germanium-Doped Zinc Oxide Thin Films
No ratings yet
Electrical and Optical Properties of Germanium-Doped Zinc Oxide Thin Films
4 pages
Practice Problems For ML Midterms
No ratings yet
Practice Problems For ML Midterms
5 pages
Quiz3 2023
No ratings yet
Quiz3 2023
2 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
3 pages
University of Edinburgh College of Science and Engineering School of Informatics
No ratings yet
University of Edinburgh College of Science and Engineering School of Informatics
5 pages
Problem Sheet 1
No ratings yet
Problem Sheet 1
3 pages
SVM Problems1
No ratings yet
SVM Problems1
5 pages
Quiz 1
No ratings yet
Quiz 1
3 pages
CS725 2020 Quiz1
No ratings yet
CS725 2020 Quiz1
3 pages
CS236 Hw1 Answers
No ratings yet
CS236 Hw1 Answers
9 pages
Practice Questions Lec 18 45
No ratings yet
Practice Questions Lec 18 45
4 pages
Trs en
No ratings yet
Trs en
2 pages
189 Cheat Sheet Minicards
No ratings yet
189 Cheat Sheet Minicards
2 pages
Ex 83622 2025 1
No ratings yet
Ex 83622 2025 1
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Differential Forms
From Everand
Differential Forms
Henri Cartan
5/5 (2)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Quiz3 2024

Uploaded by

Quiz3 2024

Uploaded by

Introduction to ML (CS771), 2024-2025-Sem-I Total Marks 25

Quiz 3. October 24, 2024 Duration 45 minutes

You might also like