0% found this document useful (0 votes)

6 views2 pages

Tut3 Questions

MLPR questions

Uploaded by

Amir Sharifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Tut3 Questions

MLPR questions

Uploaded by

Amir Sharifi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

MLPR Tutorial1 Sheet 3

Reminders: Attempt the tutorial questions, and ideally discuss them, before your tutorial.
You can seek clarifications and hints on the class forum. Full answers will be released.
This week has less linear algebra! Try to spend some time preparing clear explanations.

1. A Gaussian classifier:
A training set consists of one-dimensional examples from two classes. The training
examples from class 1 are {0.5, 0.1, 0.2, 0.4, 0.3, 0.2, 0.2, 0.1, 0.35, 0.25} and the examples
from class 2 are {0.9, 0.8, 0.75, 1.0}.

a) Fit a one-dimensional Gaussian to each class by matching the mean and variance.
Also estimate the class probabilities π1 and π2 by matching the observed class
fractions. (This procedure fits the model with maximum likelihood: it selects the
parameters that give the training data the highest probability.) Sketch a plot of
the scores p( x, y) = P(y) p( x | y) for each class y, as functions of input location x.

b) What is the probability that the test point x = 0.6 belongs to class 1? Mark the
decision boundary/ies on your sketch, the location(s) where P(class 1 | x ) =
P(class 2 | x ) = 0.5. You are not required to calculate the location(s) exactly.

c) Are the decisions that the model makes reasonable for very negative x and very
positive x? Are there any changes we could consider making to the model if we
wanted to change the model’s asymptotic behaviour?

2. Gradient descent:
Let E(w) be a differentiable function. Consider the gradient descent procedure

w(t+1) ← w(t) − η ∇w E.

a) Are the following true or false? Prepare a clear explanation, stating any necessary
assumptions:

i) Let w(1) be the result of taking one gradient step. Then the error never gets
worse, i.e., E(w(1) ) ≤ E(w(0) ).

ii) There exists some choice of the step size η such that E(w(1) ) < E(w(0) ).

b) A common programming mistake is to forget the minus sign in either the descent
procedure or in the gradient evaluation. As a result one unintentionally writes a
procedure that does w(t+1) ← w(t) + η ∇w E. What happens?

3. Maximum likelihood and logistic regression:

Maximum likelihood logistic regression maximizes the log probability of the labels,

∑ log P(y(n) | x(n) , w),

with respect to the weights w. As usual, y(n) is a binary label at input location x(n) .
The training data is said to be linearly separable if the two classes can be completely

1. Parts of this tutorial sheet are based on previous versions by Amos Storkey, Charles Sutton, and Chris Williams

MLPR:tut3 Iain Murray, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2018/ 1

separated by a hyperplane. That means we can find a decision boundary
1
P(y(n) = 1 | x(n) , w, b) = σ (w> x(n) + b) = 0.5, where σ ( a) = ,
1 + e− a
such that all the y = 1 labels are on one side (with probability greater than 0.5), and all
of the y 6= 1 labels are on the other side.

a) Show that if the training data is linearly separable with a decision hyperplane
specified by w and b, the data is also separable with the boundary given by w̃
and b̃, where w̃ = cw and b̃ = cb for any scalar c > 0.

b) What consequence does the above result have for maximum likelihood training
of logistic regression for linearly separable data?

4. Logistic regression and maximum likelihood: (Murphy, Exercise 8.7, by Jaaakkola.)

Consider the following data set:

a) Suppose that we fit a logistic regression model with a bias weight w0 , that
is p(y = 1 | x, w) = σ (w0 + w1 x1 + w2 x2 ), by maximum likelihood, obtaining
parameters ŵ. Sketch a possible decision boundary corresponding to ŵ. Is your
answer unique? How many classification errors does your method make on the
training set?

b) Now suppose that we regularize only the w0 parameter, that is, we minimize
J0 (w) = −`(w) + λw02 ,
where ` is the log-likelihood of w (the log-probability of the labels given those
parameters).
Suppose λ is a very large number, so we regularize w0 all the way to 0, but
all other parameters are unregularized. Sketch a possible decision boundary.
How many classification errors does your method make on the training set?
Hint: consider the behaviour of simple linear regression, w0 + w1 x1 + w2 x2 when
x1 = x2 = 0.

c) Now suppose that we regularize only the w1 parameter, i.e., we minimize

J1 (w) = −`(w) + λw12 ,
Again suppose λ is a very large number. Sketch a possible decision boundary.
How many classification errors does your method make on the training set?

d) Now suppose that we regularize only the w2 parameter, i.e., we minimize

J2 (w) = −`(w) + λw22 ,
Again suppose λ is a very large number. Sketch a possible decision boundary.
How many classification errors does your method make on the training set?

MLPR:tut3 Iain Murray, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2018/ 2

Ammonia How Much Catalyst Is Needed For
No ratings yet
Ammonia How Much Catalyst Is Needed For
10 pages
Head First Object-Oriented Analysis and Design A Brain Friendly Guide To OOA&D
100% (5)
Head First Object-Oriented Analysis and Design A Brain Friendly Guide To OOA&D
603 pages
CSCI-43646364 S25 - Lecture 4
No ratings yet
CSCI-43646364 S25 - Lecture 4
92 pages
All Test Cases PDF
0% (1)
All Test Cases PDF
7 pages
Microwave Remote Sensing
No ratings yet
Microwave Remote Sensing
66 pages
Credit Scoring Using Machine Learning
No ratings yet
Credit Scoring Using Machine Learning
381 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Linear - Regression
100% (1)
Linear - Regression
39 pages
Ip Study Material
No ratings yet
Ip Study Material
185 pages
DSCTP 2022 1 ML Slides
No ratings yet
DSCTP 2022 1 ML Slides
351 pages
Module3 Ch1
No ratings yet
Module3 Ch1
83 pages
Linear Regression Review
67% (3)
Linear Regression Review
4 pages
11 - Học máy cơ bản - Hồi quy tuyến tính 1
No ratings yet
11 - Học máy cơ bản - Hồi quy tuyến tính 1
105 pages
Control of A Two-Tank System - MATLAB & Simulink Example PDF
No ratings yet
Control of A Two-Tank System - MATLAB & Simulink Example PDF
21 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
04 - Linear-Classification-2024
No ratings yet
04 - Linear-Classification-2024
65 pages
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
No ratings yet
Logistic Regression: Some Slides Adapted From Dan Jurfasky and Brendan O'Connor
53 pages
Modelling With Ordinary Differential Equations: A Comprehensive Approach (Chapman & Hall/Crc Numerical Analysis and Scientific Computing) Alfio Borzì
100% (2)
Modelling With Ordinary Differential Equations: A Comprehensive Approach (Chapman & Hall/Crc Numerical Analysis and Scientific Computing) Alfio Borzì
55 pages
Chapter02 Introduction To DeepLearning
No ratings yet
Chapter02 Introduction To DeepLearning
84 pages
M6 RegressionLinearModels v2
No ratings yet
M6 RegressionLinearModels v2
97 pages
Lecture 0.3 - Linear Classifiers, Logistic Regression, Multiclass Classification
No ratings yet
Lecture 0.3 - Linear Classifiers, Logistic Regression, Multiclass Classification
48 pages
Lecture 6 Linear Classifier 2
No ratings yet
Lecture 6 Linear Classifier 2
42 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
Linear - Regression - SGD
No ratings yet
Linear - Regression - SGD
71 pages
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
No ratings yet
Machine Learning: Probabilistic View of Linear Regression Logistic Regression Hyperplane Based Classifiers and Perceptron
67 pages
Design Considerations For The Vibration of Floors - Part 2: Advisory Desk
No ratings yet
Design Considerations For The Vibration of Floors - Part 2: Advisory Desk
3 pages
Week 2
No ratings yet
Week 2
43 pages
ML Basics Lecture2 Linear Classification
No ratings yet
ML Basics Lecture2 Linear Classification
34 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
Lecture3 Logistic Regression Regularization
No ratings yet
Lecture3 Logistic Regression Regularization
39 pages
I-O List
No ratings yet
I-O List
6 pages
BSR Presentation For TOTAL
No ratings yet
BSR Presentation For TOTAL
16 pages
04 LogisticRegression
No ratings yet
04 LogisticRegression
29 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
Lecture 5 - Logistic Regression
No ratings yet
Lecture 5 - Logistic Regression
28 pages
ML - 5 Sovan LR SVM 1
No ratings yet
ML - 5 Sovan LR SVM 1
59 pages
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
No ratings yet
Note 4: EECS 189 Introduction To Machine Learning Fall 2020 1 MLE and MAP For Regression (Part I)
6 pages
L11+ Regularization
No ratings yet
L11+ Regularization
25 pages
Pipe Glossary
No ratings yet
Pipe Glossary
3 pages
Lec9 - Linear Models
No ratings yet
Lec9 - Linear Models
44 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
G.C. Calafiore (Politecnico Di Torino)
No ratings yet
G.C. Calafiore (Politecnico Di Torino)
23 pages
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
No ratings yet
Logistic Regression: Adapted From: Tom Mitchell's Machine Learning Book Evan Wei Xiang and Qiang Yang
15 pages
Unit 5 File Management PDF
No ratings yet
Unit 5 File Management PDF
40 pages
IE 5004 Lecture 2
No ratings yet
IE 5004 Lecture 2
45 pages
Industrial Training Presentation (BHEL)
No ratings yet
Industrial Training Presentation (BHEL)
25 pages
Hi Ac 8011 Liquid Particle Counting System Manual
No ratings yet
Hi Ac 8011 Liquid Particle Counting System Manual
36 pages
KODAG
No ratings yet
KODAG
24 pages
Midterm F02soln
No ratings yet
Midterm F02soln
14 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
Quiz2 Mock Solutions
No ratings yet
Quiz2 Mock Solutions
19 pages
6.867 Machine Learning: Mid-Term Exam October 13, 2004
No ratings yet
6.867 Machine Learning: Mid-Term Exam October 13, 2004
11 pages
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
No ratings yet
CS 229, Autumn 2017 Problem Set #2: Supervised Learning II
6 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Capacity Design
No ratings yet
Capacity Design
12 pages
Week 4 Logistic
No ratings yet
Week 4 Logistic
21 pages
Design of RF to DC conversion circuit for energy harvesting in CMOS 0.13-μm technology
No ratings yet
Design of RF to DC conversion circuit for energy harvesting in CMOS 0.13-μm technology
11 pages
02 Lecturenote GD
No ratings yet
02 Lecturenote GD
10 pages
Lecture 6 - Test Design Techniques
No ratings yet
Lecture 6 - Test Design Techniques
44 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
Hundred Page ML Book CH 3
No ratings yet
Hundred Page ML Book CH 3
16 pages
Gradient Descent Based Learners
No ratings yet
Gradient Descent Based Learners
11 pages
Application of Matrix - Linear Mapping-5
No ratings yet
Application of Matrix - Linear Mapping-5
7 pages
Logistic Regression Loss
No ratings yet
Logistic Regression Loss
7 pages
Homework2 v1.0
No ratings yet
Homework2 v1.0
5 pages
4 Linear Regression Additional Notes
No ratings yet
4 Linear Regression Additional Notes
8 pages
CS725 2021 Quiz1
No ratings yet
CS725 2021 Quiz1
5 pages
White Paper Droplet Based Microfluidics Elveflow Microfluidics
No ratings yet
White Paper Droplet Based Microfluidics Elveflow Microfluidics
28 pages
Lecture03c Maximum Likelihood
No ratings yet
Lecture03c Maximum Likelihood
8 pages
Lecture 2
No ratings yet
Lecture 2
8 pages
Introduction To Machine Learning: 2 Linear Classifiers
No ratings yet
Introduction To Machine Learning: 2 Linear Classifiers
4 pages
Solutions Problem Set 1
No ratings yet
Solutions Problem Set 1
7 pages
IEEEXplore Published Paper
No ratings yet
IEEEXplore Published Paper
8 pages
Experimental Study On Self Compacting Concrete With Various Percentage of Steel Fibres
No ratings yet
Experimental Study On Self Compacting Concrete With Various Percentage of Steel Fibres
4 pages
Machine Learning and Pattern Recognition Week 10 - Bayes - Logistic - Regression
No ratings yet
Machine Learning and Pattern Recognition Week 10 - Bayes - Logistic - Regression
4 pages
CMU 2018s NinaBALCAN HW3
No ratings yet
CMU 2018s NinaBALCAN HW3
7 pages
Trial Exam 2021 With Solutions
No ratings yet
Trial Exam 2021 With Solutions
10 pages
Output 23
No ratings yet
Output 23
6 pages
ATATool
No ratings yet
ATATool
6 pages
HTML Tags
No ratings yet
HTML Tags
14 pages
cs188 Fa22 Note21
No ratings yet
cs188 Fa22 Note21
4 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Tut5 Questions
No ratings yet
Tut5 Questions
2 pages
4 Callus Induction in Groundnut
No ratings yet
4 Callus Induction in Groundnut
6 pages
UNIT TEST CHAPTER 11 IMMUNITY - Jamal XI69069
No ratings yet
UNIT TEST CHAPTER 11 IMMUNITY - Jamal XI69069
8 pages
Tut7 Questions
No ratings yet
Tut7 Questions
2 pages
Tut4 Questions
No ratings yet
Tut4 Questions
2 pages
Tut1 Questions
No ratings yet
Tut1 Questions
2 pages
Name: in The Name of Almighty Statistical Pattern Recognition Homework 1
No ratings yet
Name: in The Name of Almighty Statistical Pattern Recognition Homework 1
2 pages
Kig1009 Um-Pt01-Mqf-Br003-S00
No ratings yet
Kig1009 Um-Pt01-Mqf-Br003-S00
2 pages
Assignment On MAT141
No ratings yet
Assignment On MAT141
2 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

Tut3 Questions

Uploaded by

Tut3 Questions

Uploaded by

MLPR Tutorial1 Sheet 3

3. Maximum likelihood and logistic regression:

∑ log P(y(n) | x(n) , w),

MLPR:tut3 Iain Murray, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2018/ 1

4. Logistic regression and maximum likelihood: (Murphy, Exercise 8.7, by Jaaakkola.)

c) Now suppose that we regularize only the w1 parameter, i.e., we minimize

d) Now suppose that we regularize only the w2 parameter, i.e., we minimize

MLPR:tut3 Iain Murray, http://www.inf.ed.ac.uk/teaching/courses/mlpr/2018/ 2

You might also like