T04 Soln

The document discusses the comparison between flexible and inflexible statistical learning methods in various scenarios, focusing on bias and variance. It includes explanations of bias-variance trade-offs, error curves, and the k-nearest neighbors regression model. Additionally, it provides insights on how the choice of parameters like sample size and number of features can influence model performance.

Uploaded by

xuyifei9866

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

T04 Soln

Uploaded by

xuyifei9866

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning 2025

National University of Singapore CS3244

Prof Lee Wee Sun and Prof Wang Ye

Tutorial 4
1. Flexible vs Inflexible Method (Modified from An Introduction to Statistical Learning)
For each of parts (a) through (d), indicate whether we would generally expect the perfor-
mance of a flexible statistical learning method to be better or worse than an inflexible method.
Justify your answer in terms of bias and variance.

(a) The sample size n is extremely large, and the number of features is small.
Solution: If the sample size is large and number of features is small, the variance can
usually be kept small even with flexible methods. Flexible methods can have smaller
bias so will likely be preferable in this case.
(b) The number of features is extremely large, and the number of observations n is small.
Solution: If the number of features is extremely large and number of observations is
small, it may be difficult to keep the variance small with flexible methods. Inflexible
methods may perform better even if it is biased if the variance is much smaller.
(c) The relationship between the features and response is highly non-linear.
Solution: Inflexible methods may not be able to represent highly non-linear functions
and have high bias. In this case, using flexible methods to reduce bias may be helpful,
although still need to consider the variance.
(d) The variance of the error terms, i.e. σ 2 = V ar(ϵ), is extremely high.
Solution: If the data is very noisy, then it is easy for flexible methods to overfit and
have high variance. An inflexible method would be less likely to overfit the noise.

2. Bias Variance and Error Curves. (From An Introduction to Statistical Learning)

(a) Provide a sketch of typical (squared) bias, variance, training error, test error, and irre-
ducible error curves, on a single plot, as we go from less flexible statistical learning
methods towards more flexible approaches. The x-axis should represent the amount
of flexibility in the method, and the y-axis should represent the values for each curve.
There should be five curves. Make sure to label each one.
Solution:
Tutorial 4 2

Figure 1: Curves showing bias, variance, training error, test error and Bayes error against flexibility.

(b) Explain why each of the five curves has the shape displayed in part (a).
Solution: As flexibility increases, the model is better able to approximate the target
conditional distribution and so bias decreases. For universal approximators, the bias
should decrease to zero. As flexibility increases, variance increase as the number of
ways to fit the same training dataset also increases. Training error will decrease, even-
tually to zero as the function class eventually is able to interpolate the data exactly. Test
error decreases initially as the reduction of bias dominates but eventually will increase
as variance increases faster than any reduction of bias. Irreducible error is external data
noise and is not affected by the approximator so does not change with flexibility of the
method.

3. Bias and Variance for kNN

In this problem we will consider the k-nearest regression fit model. Consider a training set
{(xi , yi )}i=1,...,N , where each sample follows the assumption of yi = f (xi ) + ϵi , where
ϵi ∼ N (0, σ 2 ) is i.i.d. Gaussian noise (such that E[ϵ] = 0 and V ar[ϵ] = σ 2 ).

(a) Let’s first try to understand the assumption about the training dataset. Which of these
statements are correct about the assumption of the training set?
A) all ϵi have the same mean
B) all yi have the same mean
C) all yi have the same variance
Solution: A and C.
Reason: Considering the data generation process of yi = f (xi ) + ϵi , f (xi ) is a function
of xi , which is not a random variable. However, ϵi is a random variable which follows
the Gaussian distribution ϵ ∼ N (0, σ 2 ), and it is randomly sampled for each i and
Tutorial 4 3

added to f (xi ). Therefore, we can get

E[yi ] = E[f (xi ) + ϵi ] = E[f (xi )] + E[ϵi ] = f (xi ).
V ar[yi ] = V ar[f (xi ) + ϵi ] = V ar[ϵi ] = σ 2 .
Note: f (x) is a constant here, not a random variable. The property of V ar[X + a] =
V ar[X], where X is a random variable and a is a constant can be used here.
(b) Using squared-error loss, the expected prediction error of a regression fit fˆ(x) at an
input point x = x0 can be written as the sum of irreducible error, bias squared and
variance. Assume that the neighbors are fixed, which make analysis simpler. If the
inputs xi are random rather than fixed, the analysis would not be exactly right but the
insights from the simpler analysis are nonetheless useful.
Under the assumption, the error can be expressed as:
Err(x0 ) = E[(y − fˆk (x0 ))2 |x0 )]
k
1X σ2
= σ 2 + (f (x0 ) − f (xi ))2 +
k i=1 k

where xi , (i = 1, . . . , k) are the k nearest data points.

Which part of the equation above represents irreducible error, bias squared and vari-
ance? Derive the equation above by calculating each of the three terms.
Solution:
• Irreducible error: σ 2 , based on the noise of the observation
• Variance:

V ar[f (xi )] = 0 because we assume the nearest neighbours xi are fixed, hence
values of f (xi ) are also fixed.
Tutorial 4 4

• Bias squared:

Pk Pk
Assuming fixed neighbours xi , E[ k1 i=1 y(xi )] = 1
k i=1 f (xi ).
(c) How would the equations from Part (b) change when k is varied? How would you
choose an optimal value of k using the above equations?
Solution:
• Choice of k will not affect the irreducible error.
• The variance will decrease as k increases.
• As the value of k increases, bias will likely increase – as the number of data points
increase, we will be considering points further away from x0 and we would move
further away from f (x0 ) (in the extreme case where k is equal to the number of
points in the training set, fˆ(x) would just give the mean of the training set output).

CS7015 (Deep Learning) : Lecture 8
No ratings yet
CS7015 (Deep Learning) : Lecture 8
86 pages
Exam Questions ITIL-4-Foundation
100% (1)
Exam Questions ITIL-4-Foundation
15 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Tips For Mainframe Programmers
No ratings yet
Tips For Mainframe Programmers
101 pages
Bosch Presentation
No ratings yet
Bosch Presentation
21 pages
CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization
No ratings yet
CPSC540: Regularization, Regularization, Nonlinear Prediction and Generalization
23 pages
02 Chap02 AssesingModelAccuracy
No ratings yet
02 Chap02 AssesingModelAccuracy
22 pages
Lecture 19
No ratings yet
Lecture 19
25 pages
A Short Course On Nonparametric Curve Estimation R PDF
No ratings yet
A Short Course On Nonparametric Curve Estimation R PDF
114 pages
ASSESSING MODEL Accuracy PDF
No ratings yet
ASSESSING MODEL Accuracy PDF
22 pages
ESGB Evaluation Methods
No ratings yet
ESGB Evaluation Methods
84 pages
Best Generalisation Error PDF
No ratings yet
Best Generalisation Error PDF
28 pages
Lecture3 2015
No ratings yet
Lecture3 2015
38 pages
Bias:Variance Tradeoff
No ratings yet
Bias:Variance Tradeoff
6 pages
Book's Solutions
No ratings yet
Book's Solutions
20 pages
Industrial Mathematics Institute: Research Report
No ratings yet
Industrial Mathematics Institute: Research Report
25 pages
SDS Solution1
No ratings yet
SDS Solution1
26 pages
AML Winter 2021 Solution
No ratings yet
AML Winter 2021 Solution
6 pages
Intro To Data Science Lecture 5
No ratings yet
Intro To Data Science Lecture 5
7 pages
Bias Variance Trade Off
No ratings yet
Bias Variance Trade Off
14 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
1 Introduction
No ratings yet
1 Introduction
8 pages
Q and A BIS
No ratings yet
Q and A BIS
7 pages
Solutions To The Exercises On The Bias-Variance Dilemma
No ratings yet
Solutions To The Exercises On The Bias-Variance Dilemma
8 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
Asymptotic Theory and Parametric Inference
No ratings yet
Asymptotic Theory and Parametric Inference
32 pages
MIT15 097S12 Lec04
No ratings yet
MIT15 097S12 Lec04
6 pages
Stability and Generalization: CMAP, Ecole Polytechnique F-91128 Palaiseau, FRANCE
No ratings yet
Stability and Generalization: CMAP, Ecole Polytechnique F-91128 Palaiseau, FRANCE
28 pages
Cours2 ML
No ratings yet
Cours2 ML
21 pages
MDL Assignment2 Spring23
No ratings yet
MDL Assignment2 Spring23
5 pages
Wa0197.
No ratings yet
Wa0197.
4 pages
Questions and Solutions On Linear Regression
No ratings yet
Questions and Solutions On Linear Regression
5 pages
Lec-01-Introduction To Statistical Learning
No ratings yet
Lec-01-Introduction To Statistical Learning
38 pages
SDSC3006 - Assignment 3
No ratings yet
SDSC3006 - Assignment 3
4 pages
HRH65 Standard Day 2 Live Session
No ratings yet
HRH65 Standard Day 2 Live Session
51 pages
HW1
No ratings yet
HW1
18 pages
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
No ratings yet
ECON 1630 Problem Set #2 Fall 2021: Bias Variance
9 pages
Lecture 10 - 04.09.2024 - Regression-02 Lecture Slides
No ratings yet
Lecture 10 - 04.09.2024 - Regression-02 Lecture Slides
61 pages
Machine 2021 Jul-Dec
No ratings yet
Machine 2021 Jul-Dec
46 pages
6.estimators (C)
No ratings yet
6.estimators (C)
5 pages
Intro&NP Stat
No ratings yet
Intro&NP Stat
122 pages
KNN Bias Variance Classification Metrics
No ratings yet
KNN Bias Variance Classification Metrics
81 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
Epfl Machine Learning Final Exam 2021 Solutions
No ratings yet
Epfl Machine Learning Final Exam 2021 Solutions
21 pages
Notes 1
No ratings yet
Notes 1
3 pages
Chap 10
No ratings yet
Chap 10
7 pages
4.4 Parametric and Non-Parametric Estimator
No ratings yet
4.4 Parametric and Non-Parametric Estimator
47 pages
ISL Answers
No ratings yet
ISL Answers
19 pages
1 5 Bias Variance Trade Off
No ratings yet
1 5 Bias Variance Trade Off
34 pages
ML 2023a Midsem Solution
No ratings yet
ML 2023a Midsem Solution
9 pages
DS&ML 2
No ratings yet
DS&ML 2
8 pages
Assignment Week 5
No ratings yet
Assignment Week 5
5 pages
Nonparametric Statistics Epiphany 2024-25
No ratings yet
Nonparametric Statistics Epiphany 2024-25
102 pages
Final Slides HL
No ratings yet
Final Slides HL
22 pages
Vapnik - Complete Statistical Theory of Learning Learning U
No ratings yet
Vapnik - Complete Statistical Theory of Learning Learning U
59 pages
Ch2 Statistical Learning
No ratings yet
Ch2 Statistical Learning
51 pages
Bias Variance Tradeoff
No ratings yet
Bias Variance Tradeoff
6 pages
Light Burn Docs
No ratings yet
Light Burn Docs
187 pages
ML - Bias Vs Variance - GeeksforGeeks
No ratings yet
ML - Bias Vs Variance - GeeksforGeeks
11 pages
CS4780 Homework 5 SP24-2
No ratings yet
CS4780 Homework 5 SP24-2
7 pages
HW 1
No ratings yet
HW 1
11 pages
HZCT 100 B
No ratings yet
HZCT 100 B
31 pages
ML 2024a QP Solution Full
No ratings yet
ML 2024a QP Solution Full
13 pages
Bias Variance
No ratings yet
Bias Variance
8 pages
Machine Learning Insem-01 QP
No ratings yet
Machine Learning Insem-01 QP
6 pages
109 MINIMAX Data Sheet of Module FMZ5000 Loop AP XP
No ratings yet
109 MINIMAX Data Sheet of Module FMZ5000 Loop AP XP
1 page
Algo Book
No ratings yet
Algo Book
368 pages
BX21 Technical Manual en v1.2
No ratings yet
BX21 Technical Manual en v1.2
24 pages
Id-11659 Scrapping Web
No ratings yet
Id-11659 Scrapping Web
295 pages
Internship Report 4
No ratings yet
Internship Report 4
44 pages
Maritime RST MSRP Partner PDF
No ratings yet
Maritime RST MSRP Partner PDF
48 pages
Hox Correctipon
No ratings yet
Hox Correctipon
79 pages
Operating Manual Programming and Diagnostic Tool
No ratings yet
Operating Manual Programming and Diagnostic Tool
40 pages
GIS-Based Application For DepEd Schools in The Philippines Using Spatial Data Analysis
No ratings yet
GIS-Based Application For DepEd Schools in The Philippines Using Spatial Data Analysis
5 pages
Seven Direct SD50 PH Ion Meter Manual
No ratings yet
Seven Direct SD50 PH Ion Meter Manual
70 pages
Simple-Ostinato: Release 0.0.1
No ratings yet
Simple-Ostinato: Release 0.0.1
41 pages
Sk-Final Inventory-Form Output 2023
No ratings yet
Sk-Final Inventory-Form Output 2023
4 pages
Apihackingin 90 Minutes 1660919248744
No ratings yet
Apihackingin 90 Minutes 1660919248744
51 pages
Gettingstartedwithmech Mindvisionsystem
No ratings yet
Gettingstartedwithmech Mindvisionsystem
34 pages
Raspberry Pi Thesis PDF
100% (2)
Raspberry Pi Thesis PDF
5 pages
Department of Education: Individual Workweek Accomplishment Report
No ratings yet
Department of Education: Individual Workweek Accomplishment Report
3 pages
2015 KS2 L3-5 EnglishGPS Paper2 Spelling PDFA
No ratings yet
2015 KS2 L3-5 EnglishGPS Paper2 Spelling PDFA
4 pages
Raspberry Pi ArduCam System Instruction Manual
No ratings yet
Raspberry Pi ArduCam System Instruction Manual
34 pages
Mastering Archimate Edition Iii A Serious Introduction To The Archimate Enterprise Architecture Modeling Language Illustrated Gerben Wierda Download
No ratings yet
Mastering Archimate Edition Iii A Serious Introduction To The Archimate Enterprise Architecture Modeling Language Illustrated Gerben Wierda Download
80 pages
Chapter 1 and 2
No ratings yet
Chapter 1 and 2
20 pages
The Hidden ROI of Embedded Analytics
No ratings yet
The Hidden ROI of Embedded Analytics
7 pages
Sony X90J Final
No ratings yet
Sony X90J Final
3 pages
Computer Programming 2 Prelim Reviewer (AMA)
No ratings yet
Computer Programming 2 Prelim Reviewer (AMA)
4 pages
Flight Control Course Flyer 1726502855
No ratings yet
Flight Control Course Flyer 1726502855
1 page
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

T04 Soln

Uploaded by

T04 Soln

Uploaded by

Machine Learning 2025

National University of Singapore CS3244

2. Bias Variance and Error Curves. (From An Introduction to Statistical Learning)

3. Bias and Variance for kNN

added to f (xi ). Therefore, we can get

where xi , (i = 1, . . . , k) are the k nearest data points.

You might also like