Open navigation menu

Scribd

0% found this document useful (0 votes)

14 views

Fundamentals Part 3

1) The document discusses assessing model accuracy, specifically comparing training error to test error. Training error is calculated on the same data used to train the model, while test error is calculated on unseen data, providing a better measure of how the model will generalize. 2) There is a bias-variance tradeoff when choosing a flexible machine learning model - more flexible models will have higher variance and fit the training data better, but may not generalize as well, resulting in higher test error. 3) The expected test error can be decomposed into components of variance, bias, and irreducible error. The goal is to minimize total test error by balancing the bias-variance tradeoff.

Uploaded by

Agustin Agustin

Copyright

© © All Rights Reserved

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Fundamentals Part 3

1) The document discusses assessing model accuracy, specifically comparing training error to test error. Training error is calculated on the same data used to train the model, while test error is calculated on unseen data, providing a better measure of how the model will generalize. 2) There is a bias-variance tradeoff when choosing a flexible machine learning model - more flexible models will have higher variance and fit the training data better, but may not generalize as well, resulting in higher test error. 3) The expected test error can be decomposed into components of variance, bias, and irreducible error. The goal is to minimize total test error by balancing the bias-variance tradeoff.

Uploaded by

Agustin Agustin

Copyright

© © All Rights Reserved

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Machine Learning and Data Analytics

Fundamentals – Part 3

Dr. Rossana Cavagnini

Deutsche Post Chair – Optimization of Distribution Networks (DPO)

RWTH Aachen University

[email protected]
Assessing model accuracy

Agenda

1 Assessing model accuracy

Training vs test errors
The bias-variance trade-off

DPO MLDA 2
Assessing model accuracy

DPO MLDA 3
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (regression)

Performance of a learning method: how well its predictions actually match the
observed data

DPO MLDA 4
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (regression)

Performance of a learning method: how well its predictions actually match the
observed data
Training Mean Squared Error: use the same data used to train the model

DPO MLDA 4
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (regression)

Performance of a learning method: how well its predictions actually match the
observed data
Training Mean Squared Error: use the same data used to train the model
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}

DPO MLDA 4
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (regression)

Performance of a learning method: how well its predictions actually match the
observed data
Training Mean Squared Error: use the same data used to train the model
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}
2 Obtain the estimate fˆ and compute fˆ(x1 ), fˆ(x2 ), . . . , fˆ(xn )

DPO MLDA 4
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (regression)

Performance of a learning method: how well its predictions actually match the
observed data
Training Mean Squared Error: use the same data used to train the model
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}
2 Obtain the estimate fˆ and compute fˆ(x1 ), fˆ(x2 ), . . . , fˆ(xn )
3 Compute:
n
1X
Training MSE = (yi − fˆ(xi ))2
n
i=1

- Small: prediction close to true (fˆ(xi ) ≈ yi )

- Large: prediction differs from true substantially
Not useful for predicting tasks (it just memorizes), only used for getting good parameters

DPO MLDA 4
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Test Mean Squared Error: use previously unseen observations to check performance
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}

DPO MLDA 5
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Test Mean Squared Error: use previously unseen observations to check performance
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}
2 Get (x0 , y0 ): previously unseen test observation (not used to train the model)

DPO MLDA 5
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Test Mean Squared Error: use previously unseen observations to check performance
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}
2 Get (x0 , y0 ): previously unseen test observation (not used to train the model)
3 Compute:
Test MSE = Ave(fˆ(x0 ) − y0 )2

DPO MLDA 5
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Test Mean Squared Error: use previously unseen observations to check performance
1 Fit our learning method to training observations: {(x1 , y1 ), (x2 , y2 ), . . . , (xn , yn )}
2 Get (x0 , y0 ): previously unseen test observation (not used to train the model)
3 Compute:
Test MSE = Ave(fˆ(x0 ) − y0 )2
What if no test observations are available?
Choose the learning method which minimizes the Training MSE?
No guarantee that the method with the lowest Training MSE will also have the lowest Test
MSE!

DPO MLDA 5
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

2.5
12

2.0
10

Mean Squared Error

1.5
8
Y

1.0
6

0.5
4
2

0.0
0 20 40 60 80 100 2 5 10 20

X Flexibility

L: black=true function, orange=linear regression, blue and green=smoothing spline fits;

R: Grey=Training MSE, Red=Test MSE, Dotted=Irreducible error (Var ())

Training MSE decreases as the method is more flexible

Test MSE initially decreases with flexibility, but then increases
DPO MLDA 6
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)

DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)
Two sources of randomness: and training set
E (y0 − fˆ(x0 ))2 = Var (fˆ(x0 )) + [Bias(fˆ(x0 ))]2 + Var ()
| {z } | {z } | {z } | {z }
Expected test MSE Variance of fˆ(x0 ) Squared bias of fˆ(x0 ) Variance of the error terms

DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)
Two sources of randomness: and training set
E (y0 − fˆ(x0 ))2 = Var (fˆ(x0 )) + [Bias(fˆ(x0 ))]2 + Var ()
| {z } | {z } | {z } | {z }
Expected test MSE Variance of fˆ(x0 ) Squared bias of fˆ(x0 ) Variance of the error terms

Variance: the amount by which fˆ would change if estimated with a different training
set (more flexible methods have higher variance)

DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)
Two sources of randomness: and training set
E (y0 − fˆ(x0 ))2 = Var (fˆ(x0 )) + [Bias(fˆ(x0 ))]2 + Var ()
| {z } | {z } | {z } | {z }
Expected test MSE Variance of fˆ(x0 ) Squared bias of fˆ(x0 ) Variance of the error terms

Variance: the amount by which fˆ would change if estimated with a different training
set (more flexible methods have higher variance)
Bias: error obtained by approximating a complex real-life problem with a simpler
model (more flexible methods have less bias)

DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)
Two sources of randomness: and training set
E (y0 − fˆ(x0 ))2 = Var (fˆ(x0 )) + [Bias(fˆ(x0 ))]2 + Var ()
| {z } | {z } | {z } | {z }
Expected test MSE Variance of fˆ(x0 ) Squared bias of fˆ(x0 ) Variance of the error terms

Variance: the amount by which fˆ would change if estimated with a different training
set (more flexible methods have higher variance)
Bias: error obtained by approximating a complex real-life problem with a simpler
model (more flexible methods have less bias)
To minimize the expected test error, select a learning method with low variance and
low bias

DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

The bias-variance trade-off

Our predictor fˆ depends on:
1 the parameters we found solving the training problem
2 the points in the training set (their selection)
Two sources of randomness: and training set
E (y0 − fˆ(x0 ))2 = Var (fˆ(x0 )) + [Bias(fˆ(x0 ))]2 + Var ()
| {z } | {z } | {z } | {z }
Expected test MSE Variance of fˆ(x0 ) Squared bias of fˆ(x0 ) Variance of the error terms

Variance: the amount by which fˆ would change if estimated with a different training
set (more flexible methods have higher variance)
Bias: error obtained by approximating a complex real-life problem with a simpler
model (more flexible methods have less bias)
To minimize the expected test error, select a learning method with low variance and
low bias
Bad news: models with low variance have high bias (and vice-versa)
DPO MLDA 7
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

A model with high variance and low bias is an overfitted model (the model is too
complex and specific, it does not generalize well).
A model with low variance and high bias is an underfitted model (the model is not
complex enough).
this helps the model at generalizing, but it introduces a high bias so that there is a
systematic deviation between the predicted data and the true one.
From a practical point of view:
Overfitting an estimator is easy: add new parameters (e.g., add quadratic terms in a linear
regression)
Making an overfitted model generalize better is hard (regularization may help).

DPO MLDA 8
Training vs test errors
Assessing model accuracy
The bias-variance trade-off

Measuring the quality of fit (classification)

yi is no longer numerical
Training error rate:
n
1 P
n I (yi 6= ŷi )
i=1
(
1 if yi 6= ŷi
6 ŷi ): indicator function
I (yi =
0 otherwise
Test error rate:
Ave(I (y0 6= ŷ0 ))

DPO MLDA 9

You might also like

651015812554 bill 0512202п2
100% (1)
651015812554 bill 0512202п2
2 pages
Understanding Calisthenics ENG
No ratings yet
Understanding Calisthenics ENG
337 pages
Lecture 4 - Bias-Variance Trade-Off and Model Selection
No ratings yet
Lecture 4 - Bias-Variance Trade-Off and Model Selection
66 pages
IWinToLose's Messy Spreadsheet
No ratings yet
IWinToLose's Messy Spreadsheet
44 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Unit IV
No ratings yet
Unit IV
51 pages
ASSESSING MODEL Accuracy PDF
No ratings yet
ASSESSING MODEL Accuracy PDF
22 pages
02 Chap02 AssesingModelAccuracy
No ratings yet
02 Chap02 AssesingModelAccuracy
22 pages
ML U-4
No ratings yet
ML U-4
63 pages
1 5 Bias Variance Trade Off
No ratings yet
1 5 Bias Variance Trade Off
34 pages
1 Machine Learning
No ratings yet
1 Machine Learning
111 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
ML 21-22 Sem
No ratings yet
ML 21-22 Sem
10 pages
Theory in Machine Learning
No ratings yet
Theory in Machine Learning
60 pages
10 Advice for Applying Machine Learning
No ratings yet
10 Advice for Applying Machine Learning
25 pages
10: Advice For Applying Machine Learning: Deciding What To Try Next
No ratings yet
10: Advice For Applying Machine Learning: Deciding What To Try Next
8 pages
K Fold
No ratings yet
K Fold
25 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Week2-Day 1-Introduction To Data Mining
No ratings yet
Week2-Day 1-Introduction To Data Mining
30 pages
5 CV Boot-Handout PDF
No ratings yet
5 CV Boot-Handout PDF
44 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Intro To Data Science Lecture 5
No ratings yet
Intro To Data Science Lecture 5
7 pages
Machine Learning Models
No ratings yet
Machine Learning Models
52 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
No ratings yet
ML Models, Model Evaluation Methods, Overfitting, Underfitting Bias Variance Loss Function Hyperparameter and Gradient Descent
74 pages
ML 5
No ratings yet
ML 5
14 pages
ML-Unit 2
No ratings yet
ML-Unit 2
15 pages
Ch5 Resampling Methods
No ratings yet
Ch5 Resampling Methods
66 pages
All DL
No ratings yet
All DL
72 pages
Bias Variance Trade Off
No ratings yet
Bias Variance Trade Off
14 pages
Bias and Variance
No ratings yet
Bias and Variance
21 pages
SLChapter 4
No ratings yet
SLChapter 4
20 pages
Ghojogh, Benyamin, and Mark Crowley
No ratings yet
Ghojogh, Benyamin, and Mark Crowley
23 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
Bias-Variance Tradeoff
No ratings yet
Bias-Variance Tradeoff
6 pages
CSL0777 L08
No ratings yet
CSL0777 L08
29 pages
Lecture Slide 02 - Supervised Learning - Summer 2023
No ratings yet
Lecture Slide 02 - Supervised Learning - Summer 2023
43 pages
lec1
No ratings yet
lec1
54 pages
Inference For The Generalization Error
No ratings yet
Inference For The Generalization Error
43 pages
Lecture 2 Ai
No ratings yet
Lecture 2 Ai
24 pages
DL_Unit1 (1)
No ratings yet
DL_Unit1 (1)
79 pages
Lecture 21: Model Selection 1 Choosing Models
No ratings yet
Lecture 21: Model Selection 1 Choosing Models
14 pages
Lecture 4
No ratings yet
Lecture 4
19 pages
Biasvariancetradeoff 210313075413
No ratings yet
Biasvariancetradeoff 210313075413
13 pages
Statistical Learning: Master in Data Science For Management
No ratings yet
Statistical Learning: Master in Data Science For Management
47 pages
L2 Supervised Learning
No ratings yet
L2 Supervised Learning
43 pages
L3 Model Selection Diagnostics
No ratings yet
L3 Model Selection Diagnostics
75 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
KNN_Bias_Variance_Classification_Metrics (1)
No ratings yet
KNN_Bias_Variance_Classification_Metrics (1)
81 pages
DSOST3
No ratings yet
DSOST3
31 pages
week2
No ratings yet
week2
43 pages
MI_Unit 5
No ratings yet
MI_Unit 5
72 pages
Machine Learning Math Essentials _12.02.2025
No ratings yet
Machine Learning Math Essentials _12.02.2025
88 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
No ratings yet
Huawei H12-211 PRACTICE EXAM HCNA-HNTD H
117 pages
ML 01
No ratings yet
ML 01
24 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Errors in PMR 2010 English Paper
No ratings yet
Errors in PMR 2010 English Paper
2 pages
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
100% (2)
Progress in Civil, Architectural and Hydraulic Engineering: Editor: Yun-Hae Kim
1,447 pages
DLL 9 Phil
80% (5)
DLL 9 Phil
3 pages
How To Register A Sole Proprietor Business in The Philippines?
No ratings yet
How To Register A Sole Proprietor Business in The Philippines?
23 pages
ModeMachines x0xb0x Socksbox TB-303 Clone Manual (English)
No ratings yet
ModeMachines x0xb0x Socksbox TB-303 Clone Manual (English)
19 pages
Real-Time Foreground-Background Segmentation Using
No ratings yet
Real-Time Foreground-Background Segmentation Using
15 pages
MCQ PT
No ratings yet
MCQ PT
5 pages
Diagnosis and Management of Neck Masses An Issue of Atlas of the Oral & Maxillofacial Surgery Clinics of North America pdf download
100% (1)
Diagnosis and Management of Neck Masses An Issue of Atlas of the Oral & Maxillofacial Surgery Clinics of North America pdf download
25 pages
Design And Simulation Of Cmos Active Mixers
No ratings yet
Design And Simulation Of Cmos Active Mixers
80 pages
Bai Tap Avtc2 Prepositions
No ratings yet
Bai Tap Avtc2 Prepositions
5 pages
12-Stem 1 - Subject Orientation - Ucsp
No ratings yet
12-Stem 1 - Subject Orientation - Ucsp
24 pages
8606 Assignment Answer
No ratings yet
8606 Assignment Answer
19 pages
Baijnath B2B 2021
No ratings yet
Baijnath B2B 2021
18 pages
F4 ch4 Agricultural Economics III (Production Function)
No ratings yet
F4 ch4 Agricultural Economics III (Production Function)
29 pages
THE VALUE OF MONEY
No ratings yet
THE VALUE OF MONEY
10 pages
Chapetr Three Information Technology: Instructor: Addisu A Uog, May 2019
100% (1)
Chapetr Three Information Technology: Instructor: Addisu A Uog, May 2019
70 pages
Cek List Excavator
No ratings yet
Cek List Excavator
2 pages
Best Face Rec PDF
No ratings yet
Best Face Rec PDF
1 page
YYY
No ratings yet
YYY
10 pages
NSBC Apm Class 5
No ratings yet
NSBC Apm Class 5
70 pages
CVQ Booklet
No ratings yet
CVQ Booklet
24 pages
Executes A Specified Job
No ratings yet
Executes A Specified Job
2 pages
Why We Show Children How Sex Works
No ratings yet
Why We Show Children How Sex Works
3 pages
Japan Case Study Solution
No ratings yet
Japan Case Study Solution
2 pages
CNF ST Q4
No ratings yet
CNF ST Q4
2 pages
RPA112 Structural Cable Catalogue 72dpi 0
No ratings yet
RPA112 Structural Cable Catalogue 72dpi 0
60 pages
Ix Icse Hindi - Prelim-1 - Set A QP
No ratings yet
Ix Icse Hindi - Prelim-1 - Set A QP
8 pages