0% found this document useful (0 votes)

10 views

lecture03b_overfitting_annotated

The document discusses the concepts of underfitting and overfitting in machine learning, particularly focusing on linear models. It explains how linear models can underfit due to their simplicity and can also overfit when augmented with polynomial features, leading to excessive complexity. Strategies to mitigate overfitting include increasing the amount of training data while keeping model complexity fixed.

Uploaded by

Quan Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

lecture03b_overfitting_annotated

Uploaded by

Quan Nguyen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

bot

Machine Learning Course - CS-433

Underfitting and Overfitting

Sept 24, 2024

Martin Jaggi
Last updated on: September 24, 2024
credits to Mohammad Emtiyaz Khan & Rüdiger Urbanke
Motivation
- o
Models can be too limited or they can be too rich. In the
first case we cannot find a function that is a good fit for the
data in our model. We then say that we underfit. In the
-

second case we have such a rich model family that we do not

just fit the underlying function but we in fact fit the noise
in the data as well. We then talk about an overfit. Both of
-

these phenomena are undesirable. This discussion is made

more difficult since all we have is data and so we do not know
a priori what part is the underlying signal and what part is
noise.

Underfitting with Linear Models

It is easy to see that linear models might underfit. Consider
a scalar case as shown in the figure below.
1-
paran model
.

fo(X)

.
1 M =0
t
=
wo
0

−1
"

0 x 1

The solid curve is the underlying function and the circles

are the actual data. E.g., we assume that there is a scalar
function g(x) but that we do not observe g(xn) directly but
=
only a noisy version of it, yn = g(xn) + Zn, where Zn is
the noise. The noise might be due for example to some
measurement inaccuracies. The yn are shown as blue circles.
If our model family consists of only linear functions of the
scalar input x, i.e., H = {fw (x) = wx}, where w is a scalar
constant (the slope of the function), then it is clear that
we cannot match the given function accurately, regardless of
how many samples we get and how small the noise is. We
therefore will underfit.

Extended/Augmented Feature Vectors From the above

example it might seem that linear models are too simple to
ever overfit. But in fact, linear models are highly prone to
overfitting, much more so than complicated models like neu-
ral nets.
Since linear models are inherently not very rich the following
is a standard “trick” to make them more powerful.
In order to increase the representational power of linear mod-
els we typically “augment” the input. E.g., if the input (fea-
ture) is one-dimensional we might add a polynomial basis (of
arbitrary degree M ), original feature
g
(xn) := [1, xn, x2n, x3n, . . . , xM
n,] sin( 1
.

, log(u)
,

so that we end up with an extended feature vector. sign (x)

We then fit a linear model to this extended feature vector
(xn):

⑰
yn ⇡ w0 + w1xn + w2x2
2
n + . . . + w
in
M x·M
n =: (x
d
n ) >
w.

polynomial Xa ER
lifting linear in I ,
w EIRM
+
Overfitting with Linear Models
In the following four figures, circles are data points, the green
line represents the “true function”, and the red line is the
model. The parameter M is the maximum degree in the
polynomial basis.

e 1-param 2-param
,

1 M =0 1 M =1
- -

t t

e -
0 0

+
5 −1
↑ −1

-X 0 x 1 0 x 1

3-param 10-paam -

1 M =3 1 M =9
-
t t
-

20
0 0

−1
perfecte( −1

0 1 0
training log 1
x x
e= 0 test
point
For M = 0 (the model is a constant) the model is under-
fitting and the same is true for M = 1. For M = 3 the
model fits the data fairly well and is not yet so rich as to fit
in addition the small “wiggles” caused by the noise. But for
M = 9 we now have such a rich model that it can fit every
single data point and we see severe overfitting taking place.
What can we do to avoid overfitting? If you increase the
amount of data (increase N , but keep M fixed), overfitting
might reduce. This is shown in the following two figures
where we again consider the same model complexity M = 9
but we have extra data (N = 15 or even N = 100).
10-param 10-param
1 Ent
N = 15 1 ElN = 100
t t

0 0

−1 −1

0 x 1 0 x 1

more data

A Word About Notation

If it is important to distinguish the original input x from
the augmented input then we will use (x) to denote this
augmented input vector. But we can consider this augmen-
tation as part of the pre-processing, and then we might sim-
ply write x to denote the input. This will save us a lot of
notation.

Additional Materials
Read about overfitting in the paper by Pedro Domingos (Sections 3 and 5
of “A few useful things to know about machine learning”).

The Hundred-Page Machine Learning Book - Andriy Burkov
No ratings yet
The Hundred-Page Machine Learning Book - Andriy Burkov
16 pages
Holophane HMS High Mast System Brochure 8-78
100% (1)
Holophane HMS High Mast System Brochure 8-78
12 pages
lecture03b_overfitting
No ratings yet
lecture03b_overfitting
5 pages
Lec3 Linear Regression With Multiple Vars
No ratings yet
Lec3 Linear Regression With Multiple Vars
30 pages
w1d_linear_regression_regularization
No ratings yet
w1d_linear_regression_regularization
4 pages
Overfitting
No ratings yet
Overfitting
7 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
The Problem of Overfitting - Coursera
No ratings yet
The Problem of Overfitting - Coursera
1 page
Lec-6
No ratings yet
Lec-6
31 pages
Data Science Concepts Overfitting Underfitting
No ratings yet
Data Science Concepts Overfitting Underfitting
8 pages
Overfitting & Feature Engineering.pptx
No ratings yet
Overfitting & Feature Engineering.pptx
37 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
116 pages
ML 19.03 Sidenotes
No ratings yet
ML 19.03 Sidenotes
30 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
ML 01
No ratings yet
ML 01
24 pages
ML _ Underfitting and Overfitting - GeeksforGeeks
No ratings yet
ML _ Underfitting and Overfitting - GeeksforGeeks
8 pages
Bias and Variance in Machine Learning
No ratings yet
Bias and Variance in Machine Learning
3 pages
ML 3 & 4 Notes
No ratings yet
ML 3 & 4 Notes
18 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
Excellent 05 - Overfitting
No ratings yet
Excellent 05 - Overfitting
22 pages
ML Interview Questions
No ratings yet
ML Interview Questions
10 pages
Machine Learning and Pattern Recognition Week 3 Intro - Classification
No ratings yet
Machine Learning and Pattern Recognition Week 3 Intro - Classification
5 pages
Lecture 3-Linear-Regression-Part2
No ratings yet
Lecture 3-Linear-Regression-Part2
45 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
4 pages
CMPE257 - W2C3 - ML Fundamentals_ Part 2
No ratings yet
CMPE257 - W2C3 - ML Fundamentals_ Part 2
34 pages
DSA5105 Lecture1
No ratings yet
DSA5105 Lecture1
51 pages
Lecture5
No ratings yet
Lecture5
26 pages
Python Learning
No ratings yet
Python Learning
21 pages
U&O Fitting
No ratings yet
U&O Fitting
6 pages
Regression
No ratings yet
Regression
24 pages
linear+regression+with+multiple+variable
No ratings yet
linear+regression+with+multiple+variable
30 pages
EE2211 Lecture 7
No ratings yet
EE2211 Lecture 7
43 pages
Lecture13 - ML Linear & Log-Linear Models
No ratings yet
Lecture13 - ML Linear & Log-Linear Models
34 pages
DSA5102X_lecture1
No ratings yet
DSA5102X_lecture1
51 pages
Lecture1
No ratings yet
Lecture1
56 pages
ML Answer Key (M.tech)
No ratings yet
ML Answer Key (M.tech)
31 pages
07 - Evaluating Performance
No ratings yet
07 - Evaluating Performance
46 pages
Introduction To ML
No ratings yet
Introduction To ML
55 pages
Nndl Notes
No ratings yet
Nndl Notes
73 pages
Week 15
No ratings yet
Week 15
41 pages
DL UNIT2
No ratings yet
DL UNIT2
22 pages
ML Bu
No ratings yet
ML Bu
31 pages
3 Polyreg
No ratings yet
3 Polyreg
22 pages
Supervised Learning
No ratings yet
Supervised Learning
5 pages
Underfitting and Overfitting Slides and Transcript
No ratings yet
Underfitting and Overfitting Slides and Transcript
13 pages
CH 1
No ratings yet
CH 1
24 pages
Slides on DataI
No ratings yet
Slides on DataI
33 pages
Interview Questions On Machine Learning
100% (4)
Interview Questions On Machine Learning
22 pages
CS115 01
No ratings yet
CS115 01
38 pages
DL_Unit1 (1)
100% (1)
DL_Unit1 (1)
79 pages
Lecture 7 - Overfitting, Bias-Variance Trade Off (DONE!!) PDF
No ratings yet
Lecture 7 - Overfitting, Bias-Variance Trade Off (DONE!!) PDF
42 pages
Underfitting (2)
No ratings yet
Underfitting (2)
13 pages
Underfitting and Overfitting in Machine Learning by ROll (41,42)
No ratings yet
Underfitting and Overfitting in Machine Learning by ROll (41,42)
29 pages
CH 5 Regularization
No ratings yet
CH 5 Regularization
16 pages
Lecture 2.2 Example Data Preparation Feature Engineering
No ratings yet
Lecture 2.2 Example Data Preparation Feature Engineering
25 pages
Machine Learning Basics: Lecture Slides For Chapter 5 of Deep Learning Ian Goodfellow
No ratings yet
Machine Learning Basics: Lecture Slides For Chapter 5 of Deep Learning Ian Goodfellow
85 pages
18ai61-Model Question Paper Solutions
No ratings yet
18ai61-Model Question Paper Solutions
71 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
volume2
No ratings yet
volume2
270 pages
RISC-V Instruction Set Summary
No ratings yet
RISC-V Instruction Set Summary
4 pages
Inf Theory 3
No ratings yet
Inf Theory 3
76 pages
Chapter6 Slides
No ratings yet
Chapter6 Slides
28 pages
Chapter4 Slides
No ratings yet
Chapter4 Slides
42 pages
08 Giaigandung Hephuongtrinh BG Tuan8 Editted
No ratings yet
08 Giaigandung Hephuongtrinh BG Tuan8 Editted
18 pages
07PPLapdon - BG - T7 Editted
No ratings yet
07PPLapdon - BG - T7 Editted
21 pages
Computer Network Report - CA2 - Tirtharaj Pati - 14201619089 PDF
No ratings yet
Computer Network Report - CA2 - Tirtharaj Pati - 14201619089 PDF
4 pages
Baron White Paper Radar System Selection
No ratings yet
Baron White Paper Radar System Selection
6 pages
EUBP BP Mechanical Recycling
No ratings yet
EUBP BP Mechanical Recycling
6 pages
Med-Info: New Qms Requirements in Japan
No ratings yet
Med-Info: New Qms Requirements in Japan
4 pages
Brain Imaging
No ratings yet
Brain Imaging
14 pages
Sample Questions 1-2
No ratings yet
Sample Questions 1-2
23 pages
Lightning Strike and Surge Counter
100% (1)
Lightning Strike and Surge Counter
2 pages
IIT-Genius-2024
No ratings yet
IIT-Genius-2024
8 pages
Floating Neutral Impacts in Power Distribution - EEP
No ratings yet
Floating Neutral Impacts in Power Distribution - EEP
9 pages
FF
No ratings yet
FF
3 pages
If 'All, Minarny Gobel, Fahmi Dan Irfan Pakaya
No ratings yet
If 'All, Minarny Gobel, Fahmi Dan Irfan Pakaya
7 pages
List 2.1.
No ratings yet
List 2.1.
18 pages
Rumen Impaction in A 3 / - Year Old Balami Ewe: Case Report and Literature Review
No ratings yet
Rumen Impaction in A 3 / - Year Old Balami Ewe: Case Report and Literature Review
4 pages
NJ2192GK 944AA1904VC Datasheet
No ratings yet
NJ2192GK 944AA1904VC Datasheet
5 pages
Karakteristik Morfometrik Sapi Aceh, Sapi PO Dan Sapi Bali Berdasarkan Analisis Komponen Utama (AKU)
No ratings yet
Karakteristik Morfometrik Sapi Aceh, Sapi PO Dan Sapi Bali Berdasarkan Analisis Komponen Utama (AKU)
6 pages
Pig Housing
100% (2)
Pig Housing
25 pages
Gold Exp Exam A2KeyfS Audioscripts
No ratings yet
Gold Exp Exam A2KeyfS Audioscripts
4 pages
Hermann e Lecomte 2019 Current Status of Fusarium Oxysporum Formae Speciales and Races Phytopathology
No ratings yet
Hermann e Lecomte 2019 Current Status of Fusarium Oxysporum Formae Speciales and Races Phytopathology
19 pages
Raytheon NGJ Test
100% (1)
Raytheon NGJ Test
6 pages
Exp - 9 (Machine Lab IIT Guwahati)
No ratings yet
Exp - 9 (Machine Lab IIT Guwahati)
10 pages
Bài Tập Luyện Nghe Tiếng Anh Thụ Động PDF
No ratings yet
Bài Tập Luyện Nghe Tiếng Anh Thụ Động PDF
23 pages
Zoology
No ratings yet
Zoology
63 pages
Lesson 1 Systems of Equations
No ratings yet
Lesson 1 Systems of Equations
12 pages
11th Matric Physics 1 To 6 Chapters Objectives EM Questions Only
No ratings yet
11th Matric Physics 1 To 6 Chapters Objectives EM Questions Only
10 pages
PDF Catalog Ocana ESP 2024206
No ratings yet
PDF Catalog Ocana ESP 2024206
133 pages
Ascari Cars
No ratings yet
Ascari Cars
3 pages
TN Department of Revenue Sales and Use Tax Rules PDF
No ratings yet
TN Department of Revenue Sales and Use Tax Rules PDF
40 pages
ĐÁP ÁN ĐỀ 15.1
No ratings yet
ĐÁP ÁN ĐỀ 15.1
6 pages
NIOSH SHO 10-Emergency Preparedness and Response
100% (1)
NIOSH SHO 10-Emergency Preparedness and Response
58 pages

lecture03b_overfitting_annotated

Uploaded by

lecture03b_overfitting_annotated

Uploaded by

bot

Machine Learning Course - CS-433

Underfitting and Overfitting

Sept 24, 2024

second case we have such a rich model family that we do not

these phenomena are undesirable. This discussion is made

Underfitting with Linear Models

The solid curve is the underlying function and the circles

Extended/Augmented Feature Vectors From the above

so that we end up with an extended feature vector. sign (x)

A Word About Notation

You might also like