0% found this document useful (0 votes)

22 views25 pages

K Fold

Fml

Uploaded by

Dhvanil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views25 pages

K Fold

Fml

Uploaded by

Dhvanil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 25

INTRODUCTION TO

MACHINE
LEARNING
Prof. Eduardo Bezerra
(CEFET/RJ)
[email protected]
2

MODEL EVALUATION
AND SELECTION
Visão Geral
3

 Model Evaluation
 Model Selection
4
Model Evaluation
Generalization error
5

 Error we would get if we could evaluate

the model over the entire population.
 Just because a hypothesis fits the training
set well does not mean that it is a good
hypothesis.
 The training error (empirical error) most likely
will be less than the generalization error.
Model Evaluation
6

 A hypothesis may present a low training

error, but still be poor (due to overfitting)
 Therefore, it is appropriate to evaluate the
performance of an algorithm on unseen
data.
 Model Evaluation refers to the process of
estimating the generalization error of a ML
model.
Evaluation metrics
7

 Accuracy, precision
 Precision, recall, F1 measure
 Squared errors
 Likelihood
 Posterior probability
 Cost/utility
 Margin
 KL divergence
 ....
Evaluation techniques
8

 Common techniques for estimating the

generalization performance of a ML
model:
 holdout method
 k-fold cross-validation
Holdout method (training/test)
9

 Randomly split original dataset into

separate training and test datasets:
 training dataset: used for training a

model M
 test dataset: used to estimate the

generalization error of M
 Can lead to a misleading estimate of the
generalization error if the test data is also
Holdout method (training/validation/test)
10

 A better way: separate the data into

three parts.
 training set: used to fit several models
 validation set: performance on this set if
used for model selection.
 test set: used only to estimate the
generalization error.
Typical proportions are 70%/15%/15% and
60%/20%/20% .
Holdout method (training/validation/test)
11

Source: Python Machine Lerning, 2nd ed., pp 191

k-fold cross-validation
12

 Randomly split the training dataset into

k folds without replacement
 k-1 folds are used for the model training
 one fold is used for performance
evaluation.
 Procedure is repeated k times so that we
obtain k models and performance
estimates.
k-fold cross-validation
13

Source: Python Machine Lerning, 2nd ed., pp 191

k-fold cross-validation
14

 Keep in mind that, as k increases:

 Bias in the estimation of the generalization
error decreases;
 Computational cost also increases.
 Empirical evidence shows that k=10 is a
good value for datasets of moderate size.
 For large sized datasets, k can be safely
decreased.
Other techniques
15

 Leave-one-out cross-validation
 Stratified k-fold cross-validation
 Bootstrap validation
Improvements on Cross-validation: The .632+ Bootstrap Method, B. Efron
and R. Tibshirani, Journal of the American Statistical Association, 92(438):
548-560, 1997

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and

Model Selection, International Joint Conference on Artificial Intelligence
(IJCAI), 14 (12): 1137-43, 1995
Notation
16

 : training set size

 : training data set
 : test set size
 : test dataset
 : error calculated in test set (test
error)
17
Model Selection
Model Selection
18

 Hyperparameters are parameters that are not

directly learnt within models.
 Model selection is the process of selecting the
optimal hyperparameters values for a ML
algorithm.
 Examples:
 degree of the polynomial in polynomial regression;
 regularization term; learning rate;
 many more…
Model Selection - example
19

 Given many models, we apply a

systematic approach to identify the "best"
model.
 The “best” model is chosen by using
some quality measure (e.g., MSE in linear
regression).
 Let us see an example in the context of
polynomials with different degrees…
Model Selection - example (cont.)
20

 Suppose we want to choose the right

degree to fit a polynomial regression.
Model Selection - example (cont.)
21

 To choose one of these models, we

select the one with the least validation
error.
Model Selection - example (cont.)
22

 Finally, we calculated the test error on

the polynomial that produced the
smallest validation error.
Model selection - general procedure
23

1. Optimize the hyperparameters in Θ using the

training set for each degree of polynomial.
2. Find Θ*, the setting of hyperparameters with
the smallest error in the validation set.
3. Estimate the generalization by computing
error of the final model in the test set.
Hyperparameter search
24

 How to do search?
 There are two main approaches to search in
the space of hyperparameters:
 Grid search exhaustively considers all
hyperparameter combinations, for a set of
given values.
 Randomized search can sample a given
number of candidates from a hyperparameter
space with a specified distribution.
Hyperparameter search
25

Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
No ratings yet
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
127 pages
Regression and Generalization
No ratings yet
Regression and Generalization
67 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
Moule 3
No ratings yet
Moule 3
25 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
Chapter 7 - TThe Box-Jenkins Methodology For ARIMA Models
100% (1)
Chapter 7 - TThe Box-Jenkins Methodology For ARIMA Models
205 pages
6应用机器学习的建议
No ratings yet
6应用机器学习的建议
79 pages
Unit I
No ratings yet
Unit I
13 pages
L2 Supervised Learning
No ratings yet
L2 Supervised Learning
43 pages
Critical Value - ANOVA
No ratings yet
Critical Value - ANOVA
2 pages
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
No ratings yet
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
40 pages
Machine Learning
No ratings yet
Machine Learning
63 pages
Final
No ratings yet
Final
145 pages
Quiz 1 Materials
No ratings yet
Quiz 1 Materials
159 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
ML-Unit 2
No ratings yet
ML-Unit 2
15 pages
DSOST3
No ratings yet
DSOST3
31 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Lect 03 Evaluation Part 2
No ratings yet
Lect 03 Evaluation Part 2
40 pages
Codes and Concepts of ML-Developer-2
No ratings yet
Codes and Concepts of ML-Developer-2
17 pages
L2 - Problems in ML & Performance Evaluation
No ratings yet
L2 - Problems in ML & Performance Evaluation
30 pages
Mining Process
No ratings yet
Mining Process
33 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
ML Unit4 Notes
No ratings yet
ML Unit4 Notes
20 pages
Lecture Slide 02 - Supervised Learning - Summer 2023
No ratings yet
Lecture Slide 02 - Supervised Learning - Summer 2023
43 pages
Ch6-Models Selection Evaluating Classifiers
No ratings yet
Ch6-Models Selection Evaluating Classifiers
28 pages
Fundamentals Part 3
No ratings yet
Fundamentals Part 3
21 pages
19 ML Intro
No ratings yet
19 ML Intro
33 pages
DLSU DSILYTC: Correlation & SRegression
No ratings yet
DLSU DSILYTC: Correlation & SRegression
22 pages
19 ML Intro
No ratings yet
19 ML Intro
31 pages
Practical Issues
No ratings yet
Practical Issues
30 pages
Lecture16 Crossvalidation
No ratings yet
Lecture16 Crossvalidation
32 pages
ML 3170724 Unit-3
No ratings yet
ML 3170724 Unit-3
48 pages
Lecture 9
No ratings yet
Lecture 9
16 pages
Unit IV
No ratings yet
Unit IV
51 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Permutation Tests For Stochastic Ordering and ANOVA Theory and Applications With R
No ratings yet
Permutation Tests For Stochastic Ordering and ANOVA Theory and Applications With R
220 pages
4 Model Order
No ratings yet
4 Model Order
10 pages
Correlation
No ratings yet
Correlation
13 pages
Approach Towards Model Evaluation, Model Selection
No ratings yet
Approach Towards Model Evaluation, Model Selection
13 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
ch-3 FML
No ratings yet
ch-3 FML
14 pages
2020 Evaluation PDF
No ratings yet
2020 Evaluation PDF
25 pages
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
No ratings yet
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
27 pages
APS1070 Lecture (3) Slides
No ratings yet
APS1070 Lecture (3) Slides
70 pages
Training Evaluation
No ratings yet
Training Evaluation
42 pages
ML Notes (Module-3)
No ratings yet
ML Notes (Module-3)
21 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
Xchapter 1
No ratings yet
Xchapter 1
31 pages
Take Home Exam 1 30 Probability SIR GEORGE
No ratings yet
Take Home Exam 1 30 Probability SIR GEORGE
10 pages
Chapter 3
No ratings yet
Chapter 3
9 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Final Exam Review
No ratings yet
Final Exam Review
6 pages
Model Selection On ML
No ratings yet
Model Selection On ML
49 pages
Lecture 2 Ai
No ratings yet
Lecture 2 Ai
24 pages
Lecturenotes Cse176
No ratings yet
Lecturenotes Cse176
80 pages
Assignment 1 (Sol.) : Introduction To Data Analytics
No ratings yet
Assignment 1 (Sol.) : Introduction To Data Analytics
4 pages
Probability & Statistics
No ratings yet
Probability & Statistics
3 pages
Markov Chains
No ratings yet
Markov Chains
38 pages
Model Selection NEW
No ratings yet
Model Selection NEW
24 pages
ML 01
No ratings yet
ML 01
24 pages
10: Advice For Applying Machine Learning: Deciding What To Try Next
No ratings yet
10: Advice For Applying Machine Learning: Deciding What To Try Next
8 pages
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
No ratings yet
How Smart Is My Dummy? Time Series Tests For The Influence of Politics
18 pages
Choosing Model and Tuning
No ratings yet
Choosing Model and Tuning
20 pages
MI2023E Probability and Statistics CTTT Final 2024.2
No ratings yet
MI2023E Probability and Statistics CTTT Final 2024.2
8 pages
SPSS Intermediate Advanced Statistical Techniques
No ratings yet
SPSS Intermediate Advanced Statistical Techniques
14 pages
Rr-rr210403-Probability Theory & Stochastics Processes
No ratings yet
Rr-rr210403-Probability Theory & Stochastics Processes
8 pages
Statistics and Probability
No ratings yet
Statistics and Probability
13 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Lecturenotes PDF
No ratings yet
Lecturenotes PDF
80 pages
ML 5
No ratings yet
ML 5
14 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
Unit 6
No ratings yet
Unit 6
81 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Hypothesis Testing: Erwin L. Medina
No ratings yet
Hypothesis Testing: Erwin L. Medina
8 pages
MIDAS Usersguide V2.3
No ratings yet
MIDAS Usersguide V2.3
57 pages
Regression Analysis - VCE Further Mathematics
No ratings yet
Regression Analysis - VCE Further Mathematics
5 pages
CGE676 Tutorial 1
No ratings yet
CGE676 Tutorial 1
2 pages
4539286
No ratings yet
4539286
3 pages
Evaluation Activity #14
No ratings yet
Evaluation Activity #14
4 pages
Concentration
No ratings yet
Concentration
28 pages
Yosi Ika Putri 202110315017 - Praktikum Hari 2
No ratings yet
Yosi Ika Putri 202110315017 - Praktikum Hari 2
6 pages
Full Download Basic Business Statistics 4e by Mark Berenson PDF
100% (3)
Full Download Basic Business Statistics 4e by Mark Berenson PDF
46 pages
Chart Title: Tablet Computer Sales Week Units Sold
No ratings yet
Chart Title: Tablet Computer Sales Week Units Sold
4 pages
ICT505 Data Analytics Week - 4
No ratings yet
ICT505 Data Analytics Week - 4
4 pages
5.3time Series
No ratings yet
5.3time Series
6 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Bba Part 1 Business Statistics S 2019
No ratings yet
Bba Part 1 Business Statistics S 2019
4 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet

K Fold

Uploaded by

K Fold

Uploaded by

INTRODUCTION TO

 Error we would get if we could evaluate

 A hypothesis may present a low training

 Common techniques for estimating the

 Randomly split original dataset into

 A better way: separate the data into

Source: Python Machine Lerning, 2nd ed., pp 191

 Randomly split the training dataset into

Source: Python Machine Lerning, 2nd ed., pp 191

 Keep in mind that, as k increases:

A Study of Cross-Validation and Bootstrap for Accuracy Estimation and

 : training set size

 Hyperparameters are parameters that are not

 Given many models, we apply a

 Suppose we want to choose the right

 To choose one of these models, we

 Finally, we calculated the test error on

1. Optimize the hyperparameters in Θ using the

You might also like