0% found this document useful (0 votes)

10 views23 pages

Week7 Lecture 1 ML SPR25

The document discusses various resampling methods in machine learning, focusing on cross-validation techniques such as Leave-One-Out Cross-Validation (LOOCV) and k-Fold Cross-Validation. It highlights the importance of estimating test error rates to ensure model reliability, detailing the advantages and drawbacks of different approaches. Additionally, it covers bootstrapping as a method for estimating parameters through repeated sampling with replacement.

Uploaded by

Not A Bourgeoisie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views23 pages

Week7 Lecture 1 ML SPR25

Uploaded by

Not A Bourgeoisie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Machine Learning and Deep Learning

with R
Instructor: Babu Adhimoolam
Resampling Methods
Learning objectives Cross-validation Methods
Bootstrapping Method
Estimating the test error rate!

• The accuracy or worthiness of a machine learning model depends on how it can predict
the response y given x on a completely novel dataset (and not on the training dataset).

• We don’t have access to this novel dataset (in real life situations) and must devise methods
to estimate these error rates so that our model that we developed is dependable!

• We overcome this issue by holding out some data (often termed as test data) from
being used for model fitting during training and finally estimating the model error on
this hold-out dataset.
A simple Validation Set approach

• It involves randomly dividing the observations into a training set and a validation set
(or hold-out set).

• The model is fit using the training set and the fitted model is used to predict the responses
in the validation set. The resulting error rate using the validation set provides an estimate of
test error rate.

Whole set of observations (x and y)

Training Set Test Set

Drawbacks of the validation set approach
Figures show test error rates for predicting miles per gallon in auto data set from engine horsepower
(using both linear and higher order polynomials)

• Validation set approaches provide variable test error rate

• Validation set approaches also overestimate of the test error rate.

Leave-one-out Cross-validation (LOOCV)
• In leave-one-out cross validation, like validation set approach there is a data partition
into training and test test.

• Unlike validation set approach, we leave only one observation for testing and all the
remaining (n-1) observations are used for training. The process is repeated until all the
remaining observations are included in testing one by one.

• If x1, x2, …, xn are the set of observations, then in the first round x1,y1is the test data and
remaining x2,…,xn are the training data and the test mean squared error MSE1 is given by
.
Similarly, in the second round with x2,y2 as test data and the remaining as training
dataset the test mean squared error MSE2 is .

• For n number of times, the LOOCV procedure, provides

the following average estimate of test error:
Since LOOCV is generally fit on a
larger dataset than the validation set
approach, it tends not to
overestimate the test error.
Advantages of
LOOCV Since there is no randomness in
test/training split, performing LOOCV
multiple times will yield non varying
results of test error estimates.
Schematic illustration of Leave-one-out Cross-validation
(LOOCV)

n observations

1 = test, n-1st = training

2 = test, n-2nd = training
3 = test, n-3rd = training

n= test, n-nth = training

Figure shows test error rates for predicting miles per

gallon(y) in auto data set from engine horsepower(x)
The k-Fold Cross-Validation
• It is an alternative procedure to LOOCV and involves randomly dividing the set of
observations into k-folds or groups of approximately equal size .

• First, the first group or fold is used for validation set and the model is fit on the remaining
k-1 groups or folds of data. On the next iteration, the second group or fold is used as validation
set and the procedure is repeated for remaining k-1 group and so on.

• The procedure results in k estimates of the test MSE(MSE1,MSE2,..,MSEk) and the average
of these k estimates is the k-fold cross-validation estimate:

• If k = n then the procedure is equivalent to LOOC. In general k = 5 or 10 is the preferred

approach as k = n or LOOVC is computationally intensive.
Schematic representation of k-fold cross validation
(k=5)

n observations

1st fold or group

2nd fold or group
3rd fold or group

4th fold or group

5th fold or group
Comparison of test error rates between validation set
method and k-fold cross-validation

Validation set method repeated 10 times Repetition of 10-fold method with different random
splits each time
Comparison of true MSE and cross-validated MSE

Linear Regression (Orange) and Training MSE(Gray) and Test MSE(Red) True Test MSE in Blue
Two smoothing splines (Green and Blue) LOOCV estimate in black dashed line
10-fold estimate in Orange
Comparison of true MSE and cross-validated MSE

True Test MSE in Blue LOOCV estimate in black dashed line 10-fold estimate in Orange

Despite underestimating the true MSE all the CV estimates come closed to identifying the correct degree of flexibility
associated with lowest test error.
The validation procedure fits a model to part of the
data and its estimation of test error is thus biased.
Since the LOOCV procedure fits the model to most of
the data it has lowest bias in test error estimation.
The k-fold has intermediate level of bias.
Bias Variance Trade
off with cross- In terms of variance, LOOCV estimates suffer from
validation methods high variance, in comparison to k-fold methods.

From bias-variance perspective, k-fold methods

using 5 or 10 as the number of folds, suffers neither
from excessive bias or variance.
Like regression methods where we
used test MSE, in classification we
use the number of misclassified
items as we saw before.

The LOOCV estimate thus becomes

Cross-Validation
with classification
methods Where the error is defined as,

Similarly, we can quantify the error

estimates for k-fold and validation
methods.
Cross validation with Classification methods
Note that the cross-validation errors despite
underestimating are true error are good in picking
up the degree of polynomials/flexibility of the model

Test error is Brown Training Error is Blue

10-fold cv is Black
Bayes Decision Boundary is Purple; Logistic regression with
linear, quadratic, cubic and quartic polynomials
Cross validation with Classification methods
The k-NN classifier with different values of
k- the neighbors

A comparison of k-NN decision

Boundary with k=1,10 and 100.
Bayes Decision boundary is shown
in purple.
Test error is Brown Training Error is Blue
10-fold cv is Black
Nested Cross-validation

Parvandeh, S., Yeh, H. W., Paulus, M. P., & McKinney, B. A. (2020). Consensus features nested cross-validation.
Bioinformatics, 36(10), 3093-3098.
Varma, S., & Simon, R. (2006). Bias in error estimation when using cross-validation for model selection.
BMC bioinformatics, 7(1), 1-8.
In boot strapping we repeatedly
sample data with replacement from
the original data.
Boot Strapping
The model is then fit on multiple
datasets that were created and the
statistic of interest is calculated
across all these datasets.
Graphical illustration of sampling with replacement

Parameter generated from Parameter generated from

sampling with replacement of repeated sampling of multiple
Sampling with replacement of original data Z to estimate a single dataset. datasets from true population.
a parameter 𝝰 of interest.
Boost Strapping can be used in
Combining Boot combination with k-fold or Nested
Strapping with cross- Cross-validation methods.
validation and
repeated designs The Cross-validation methods can be
repeated n (n=10,50,100) times in
repeated designs.

String Programming Interview Question
No ratings yet
String Programming Interview Question
9 pages
MI - Unit 5
No ratings yet
MI - Unit 5
72 pages
Chapter 3 - Solving Problems by Searching
No ratings yet
Chapter 3 - Solving Problems by Searching
71 pages
Resampling Methods - ML
No ratings yet
Resampling Methods - ML
115 pages
DSA Roadmap 3 Months
No ratings yet
DSA Roadmap 3 Months
3 pages
Ch5 Resampling Methods
No ratings yet
Ch5 Resampling Methods
66 pages
KNN Bias Variance Classification Metrics
No ratings yet
KNN Bias Variance Classification Metrics
81 pages
DATA ANALYSIS UNIT 4 Notes
No ratings yet
DATA ANALYSIS UNIT 4 Notes
19 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
ML Mod 5
No ratings yet
ML Mod 5
58 pages
Machine Learning
No ratings yet
Machine Learning
63 pages
Over Fit
No ratings yet
Over Fit
63 pages
5 CV Boot-Handout PDF
No ratings yet
5 CV Boot-Handout PDF
44 pages
4-ResamplingMethods 1
No ratings yet
4-ResamplingMethods 1
23 pages
M1 - Evaluating Predictive Performance
No ratings yet
M1 - Evaluating Predictive Performance
58 pages
Accuracy Measures
No ratings yet
Accuracy Measures
61 pages
Statistical Learning: Master in Data Science For Management
No ratings yet
Statistical Learning: Master in Data Science For Management
47 pages
Lecture Slide 02 - Supervised Learning - Summer 2023
No ratings yet
Lecture Slide 02 - Supervised Learning - Summer 2023
43 pages
ML 1 Lecture 2
No ratings yet
ML 1 Lecture 2
50 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
ML Unit 4 Trupesh Patel
No ratings yet
ML Unit 4 Trupesh Patel
56 pages
Crossvalidation - 1
No ratings yet
Crossvalidation - 1
30 pages
Cross Validation Thesis
100% (4)
Cross Validation Thesis
5 pages
L2 Supervised Learning
No ratings yet
L2 Supervised Learning
43 pages
Lecture 9
No ratings yet
Lecture 9
16 pages
List Steps in Data Preparation. Give Short Description of Each Step
No ratings yet
List Steps in Data Preparation. Give Short Description of Each Step
20 pages
Resampling Methods Class 2
No ratings yet
Resampling Methods Class 2
38 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
No ratings yet
CSO504 Machine Learning: Evaluation and Error Analysis Validation and Regularization Koustav Rudra 22/08/2022
28 pages
Resampling Methods Class 1
No ratings yet
Resampling Methods Class 1
33 pages
Lec 5
No ratings yet
Lec 5
28 pages
Lec 16
No ratings yet
Lec 16
18 pages
SLChapter 4
No ratings yet
SLChapter 4
20 pages
10 CV Val1
No ratings yet
10 CV Val1
26 pages
Chapter 5 Learning Deterministic Models
No ratings yet
Chapter 5 Learning Deterministic Models
28 pages
Ovefitting, Generalization, Cross Validation
No ratings yet
Ovefitting, Generalization, Cross Validation
20 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
MIS410 Lecture8toLecture10
No ratings yet
MIS410 Lecture8toLecture10
13 pages
Week 05
No ratings yet
Week 05
23 pages
Cross Validation
No ratings yet
Cross Validation
16 pages
Data Lecture
No ratings yet
Data Lecture
16 pages
ML Nithish
No ratings yet
ML Nithish
16 pages
Classification
No ratings yet
Classification
4 pages
ML-4 Cross Validation in Machine Learning
No ratings yet
ML-4 Cross Validation in Machine Learning
13 pages
Unit 5 ML
No ratings yet
Unit 5 ML
21 pages
On Estimating Model Accuracy
No ratings yet
On Estimating Model Accuracy
6 pages
18 CV & Model Selection
No ratings yet
18 CV & Model Selection
11 pages
Cross Validation
No ratings yet
Cross Validation
5 pages
K Fold and Other Cross-Validation Techniques
No ratings yet
K Fold and Other Cross-Validation Techniques
10 pages
Mtech Cse 1 Sem Advanced Algorithms v34 2012
No ratings yet
Mtech Cse 1 Sem Advanced Algorithms v34 2012
4 pages
Bouckaert Calibrated Tests
No ratings yet
Bouckaert Calibrated Tests
8 pages
Assignment Solution 2
No ratings yet
Assignment Solution 2
8 pages
Cross Validation
No ratings yet
Cross Validation
37 pages
Exercise 4
No ratings yet
Exercise 4
7 pages
Chapter 6 PDF
No ratings yet
Chapter 6 PDF
45 pages
Model Evaluation and Cross-Validation Methods
No ratings yet
Model Evaluation and Cross-Validation Methods
3 pages
T05 Soln
No ratings yet
T05 Soln
4 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
Proba 227 232
No ratings yet
Proba 227 232
6 pages
Cross Validation LN 12
No ratings yet
Cross Validation LN 12
11 pages
3 - Uninformed Search Strategies
No ratings yet
3 - Uninformed Search Strategies
43 pages
Cross Validation
No ratings yet
Cross Validation
6 pages
KMP Skip Search Algorithm: Advisor: Prof. R. C. T. Lee Speaker: Z. H. Pan
No ratings yet
KMP Skip Search Algorithm: Advisor: Prof. R. C. T. Lee Speaker: Z. H. Pan
18 pages
Feed-Forward Neural Networks (Part 2: Learning)
No ratings yet
Feed-Forward Neural Networks (Part 2: Learning)
17 pages
Multi Rate Signal Processing
No ratings yet
Multi Rate Signal Processing
21 pages
Real Statistics Examples Regression 1
No ratings yet
Real Statistics Examples Regression 1
440 pages
PID Tuning Tips: Check Control Loop Basics With A Time Line
No ratings yet
PID Tuning Tips: Check Control Loop Basics With A Time Line
2 pages
Chapter 9 Searching
No ratings yet
Chapter 9 Searching
47 pages
Tutorial 1
No ratings yet
Tutorial 1
3 pages
Multivariate Approach Towards Financial Market Prediction: Guided By:-Mrs. Shraddha Ovale
No ratings yet
Multivariate Approach Towards Financial Market Prediction: Guided By:-Mrs. Shraddha Ovale
8 pages
ENCh 27
No ratings yet
ENCh 27
10 pages
Worksheet 2
No ratings yet
Worksheet 2
6 pages
Ch02 WienerFilters Lect 04
No ratings yet
Ch02 WienerFilters Lect 04
51 pages
1st Periodic Test - Math 9
No ratings yet
1st Periodic Test - Math 9
2 pages
Complexity
No ratings yet
Complexity
19 pages
January - 2018
No ratings yet
January - 2018
2 pages
Chap4 Lec2
No ratings yet
Chap4 Lec2
16 pages
Week 5 Image Enhancement in The Spatial Domain
No ratings yet
Week 5 Image Enhancement in The Spatial Domain
2 pages
IT245 - Module 7
No ratings yet
IT245 - Module 7
23 pages
Assignment
No ratings yet
Assignment
7 pages
GRU-based Attention Mechanism For Human Activity Recognition
No ratings yet
GRU-based Attention Mechanism For Human Activity Recognition
6 pages
Hash Function
No ratings yet
Hash Function
3 pages
Extendible Hashing
No ratings yet
Extendible Hashing
7 pages
Real Time Scheduling: Edf: Bachelor of Technology Computer Science and Engineering
No ratings yet
Real Time Scheduling: Edf: Bachelor of Technology Computer Science and Engineering
6 pages
Final 2023 - PartA - Solution
No ratings yet
Final 2023 - PartA - Solution
2 pages
ML CS3035 Question Bank Part I
No ratings yet
ML CS3035 Question Bank Part I
2 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet

Week7 Lecture 1 ML SPR25

Uploaded by

Week7 Lecture 1 ML SPR25

Uploaded by

Machine Learning and Deep Learning

Whole set of observations (x and y)

Training Set Test Set

• Validation set approaches provide variable test error rate

• Validation set approaches also overestimate of the test error rate.

• For n number of times, the LOOCV procedure, provides

1 = test, n-1st = training

n= test, n-nth = training

Figure shows test error rates for predicting miles per

• If k = n then the procedure is equivalent to LOOC. In general k = 5 or 10 is the preferred

1st fold or group

4th fold or group

From bias-variance perspective, k-fold methods

The LOOCV estimate thus becomes

Similarly, we can quantify the error

Test error is Brown Training Error is Blue

A comparison of k-NN decision

Parameter generated from Parameter generated from

You might also like