0% found this document useful (0 votes)

15 views37 pages

Cross Validation

The document discusses cross-validation methods for model assessment and selection, specifically focusing on the Validation Set Approach, Leave-One-Out Cross-Validation (LOOCV), and k-Fold Cross Validation. It presents an advertising data set and an auto data set to illustrate the application of these methods in predicting sales and miles per gallon, respectively. The advantages and disadvantages of each cross-validation technique are also highlighted, along with the importance of estimating test error rates.

Uploaded by

krush2408

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views37 pages

Cross Validation

Uploaded by

krush2408

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Cross-Validation

Outline
•Context
•Different Approaches of Cross-Validation
Validation Set Approach
Leave-One-Out Cross-Validation
𝑘-Fold Cross Validation
•An application
Context
Advertising Data Set
The Advertising data set consists of the sales (in
thousands of units) of a particular product in 200
different markets.

It also contains the advertising budgets (in thousands

of dollars) for the product in each of the markets for
three different media: TV, Radio, and Newspaper
Regression Problem

Linear
Predicted
Sales Regression Sales
Model

Quantitative
Quantitative
TV Radio Newspaper
Possible Models
Models Predictors
1 TV
2 Radio
3 Newspaper
4 TV and Radio
5 TV and Newspaper
6 Radio and Newspaper
7 TV, Radio and Newspaper
Model Selection
Model Predictors 𝑅2 Adjusted − 𝑅 2 𝑅𝑆𝐸

1 TV 0.61 0.61 3.26

2 Radio 0.33 0.33 4.28
3 Newspaper 0.05 0.05 5.09
4 TV & Radio 0.90 0.90 1.68
5 TV & Newspaper 0.65 0.64 3.12

6 Radio & Newspaper 0.33 0.33 4.28

7 TV, Radio & Newspaper 0.90 0.90 1.69

Cross-validation
• Cross-validation is an alternative method for assessing the
performance of a model.
• It can also be used for model selection.
Case Data set
Auto Data Set
• A data frame with 392 observations on the 9 variables.
• Our discussion will be focused on the following two variables.
• mpg (miles per gallon): Dependent Variable (𝑌)
• horsepower (Engine horsepower): Independent Variable (𝑋)
• We have to fit a model that predicts mpg using horsepower.
Auto Data Set
Model (1)
Coefficients Std. error t-statistic p-value
Intercept 39.936 0.717 55.66 <0.0001
Horsepower −0.158 0.006 −24.49 <0.0001

Multiple R-squared: 0.6059

Adjusted R-squared: 0.6049
Residual standard error: 4.906
Model (2)
Coefficients Std. error t-statistic p-value

Intercept 56.900 1.800 31.60 <0.0001

Horsepower −0.466 0.031 −14.98 <0.0001

I(horsepower^2) 0.001 0.000 10.08 <0.0001

Multiple R-squared: 0.688

Adjusted R-squared: 0.686
Residual standard error: 4.374
Possible Models
• Consider 10 possible models:
Model (1): Predictor- ℎ𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟
Model (2): Predictors- ℎ𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟 and ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟 2
⋮
Model (10): Predictors- h𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟, ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟 2 , … ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟10
Training Error & Test Error
Training Error
• In order to assess the performance of a method, we must compare the predictions
with the observed data.
• A common measure of accuracy is the Mean Squared Error (𝑀𝑆𝐸), i.e.,
𝑛
1
𝑀𝑆𝐸 = ෍ 𝐴𝑐𝑡𝑢𝑎𝑙𝑖 − 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑𝑖 2 ,
𝑛
𝑖=1
where 𝑛 is the number of observations.
• The 𝑀𝑆𝐸 is computed above based on the training data that was used to fit the model.
• This is referred as training MSE.
Test Error
• Our method has generally been designed to make 𝑀𝑆𝐸 small based on the
training data, e.g., in linear regression, we obtain the regression line such that
𝑀𝑆𝐸 is minimized.
• What really matters is how well the method works on new data.
• We call this new data “Test Data”.
• There is no guarantee that the method with the smallest training 𝑀𝑆𝐸 will have
the smallest test (i.e. new data) 𝑀𝑆𝐸.
Estimation of Test Error
• We here consider a class of methods that estimate the test error rate by holding
out a subset of the training, and then applying the method to those held out
observations.
• This approach is more formally known as cross-validation.
• We now discuss three popular approaches of cross-validation
Validation Set Approach
Leave-One-Out Cross-Validation
𝑘-Fold Cross Validation
Validation Set Approach
Validation Set Approach

Training Data Testing Data

Validation Set Approach
• If we have a large data set, we randomly split the data into training and
validation (testing) parts.
• We use the training data to fit each possible model, and then choose the model
that gave the lowest error rate when applied to the validation data.
• The validation error rate or test error rate is typically assessed using 𝑀𝑆𝐸 for
the quantitative response.
Models for Auto Data
• Consider 10 possible models:
Model (1): Predictor- ℎ𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟
Model (2): Predictors- ℎ𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟 and ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟 2
⋮
Model (10): Predictors- ℎ𝑜𝑟𝑠𝑒𝑝𝑜𝑤𝑒𝑟, ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟 2 , … ℎ𝑜𝑟𝑠𝑝𝑜𝑤𝑒𝑟10
Validation Set Approach for Auto Data
• Randomly split Auto data set into the training (196 obs.) and validation
1 (testing) data sets (196 obs.)

• Fit all the candidate models (Models (1) to Models (10)) using the
2 training data set.

• Use the fitted models to predict 𝑚𝑝𝑔 for the validation data set.
3

• The model with the lowest validation (testing) MSE is the winner!!
4
Auto Data
• Left: Validation error rate
for a single split into the
training and validation data
sets
• Right: Validation method
repeated 10 times, each
time the split is done
randomly!
Validation Set Approach: Advantages
Conceptually simple

Easy to implement
Validation Set Approach: Disadvantages
The validation MSE can be highly variable (see the plot on the
right-hand panel of the Figure in the previous slide).

Only a subset of observations are used to fit the model (training

data). Statistical methods tend to perform worse when trained
on fewer observations.
Leave-One-Out Cross-Validation
Leave-One-Out Cross-Validation (LOOCV)
LOOCV
• Split the data set of size n into a training data set of size 𝑛 − 1 and a validation data
1 set of size 1

• Fit the model using the training data.

• Validate the model using the validation data and compute the corresponding MSE.
3

• Repeat this process n times to obtain 𝑀𝑆𝐸1 , … , 𝑀𝑆𝐸𝑛 .

1
• The MSE for the model is computed as 𝐶𝑉(𝑛) = σ𝑛𝑖=1 𝑀𝑆𝐸𝑖 .
5 𝑛
LOOCV: Advantages
LOOCV has less bias
• We repeatedly fit the statistical learning method using training data
that contains 𝑛 − 1 obs., i.e. almost all the data set is used.
LOOCV produces a less variable MSE
• The validation approach produces different MSE when applied
repeatedly due to randomness in the splitting process, while
performing LOOCV multiple times will always yield the same results,
because we split based on one obs. each time.
LOOCV: Disadvantages
• LOOCV is computationally intensive.
We fit each model 𝑛 times !!
𝐾-fold Cross Validation
𝐾-fold Cross Validation
𝐾-fold Cross Validation
• We divide the data set into 𝐾 different parts (e.g., 𝐾 = 5, or 𝐾 = 10, etc.).
1

• We then remove the first part, fit the model on the remaining 𝐾 − 1 parts,
2 and compute the MSE on the first part.

• We then repeat this 𝐾 different times taking out a different part each time.
3

1
• The 𝐾-fold CV error is given by 𝐶𝑉(𝐾) = σ𝐾
𝑖=1 𝑀𝑆𝐸𝑖 .
4 𝐾
Auto Data: LOOCV & K-fold CV
• LOOCV is a special case of k-fold, where 𝑘 = 𝑛
• They are both stable, but LOOCV is more computationally intensive!
Home Work
• Obtain the cross-validation error rates of all 7 models for the Advertising data
set.
Reading Material
• James, G., Witten, D., Hastie, T. & Tibshirani, R. (2021). An Introduction to
Statistical Learning: with Applications in R. New York: Springer-Verlag.
Chapter 2: Sub-section 2.2.1
Chapter 5: Section 5.1, Sub-sections 5.1.1, 5.1.2, 5.1.3, 5.3.1, 5.3.2, 5.3.3.

MI - Unit 5
No ratings yet
MI - Unit 5
72 pages
Cross-Validation and Model Selection
No ratings yet
Cross-Validation and Model Selection
46 pages
hw16 109090023
No ratings yet
hw16 109090023
22 pages
Convolutional Neural Network
43% (7)
Convolutional Neural Network
20 pages
Over Fit
No ratings yet
Over Fit
63 pages
Lecture 7
No ratings yet
Lecture 7
29 pages
KNN Bias Variance Classification Metrics
No ratings yet
KNN Bias Variance Classification Metrics
81 pages
Module3-Ensemble Learning
No ratings yet
Module3-Ensemble Learning
107 pages
ML Unit 4 Trupesh Patel
No ratings yet
ML Unit 4 Trupesh Patel
56 pages
L2 Supervised Learning
No ratings yet
L2 Supervised Learning
43 pages
Week 05
No ratings yet
Week 05
23 pages
ML 1 Lecture 2
No ratings yet
ML 1 Lecture 2
50 pages
ML Mod 5
No ratings yet
ML Mod 5
58 pages
Cross Validation
No ratings yet
Cross Validation
6 pages
Ch5 Resampling Methods
No ratings yet
Ch5 Resampling Methods
66 pages
10 CV Val1
No ratings yet
10 CV Val1
26 pages
Lecture Slide 02 - Supervised Learning - Summer 2023
No ratings yet
Lecture Slide 02 - Supervised Learning - Summer 2023
43 pages
Week7 Lecture 1 ML SPR25
No ratings yet
Week7 Lecture 1 ML SPR25
23 pages
Resampling Methods - ML
No ratings yet
Resampling Methods - ML
115 pages
R22 ML Question Bank For It and CSM
No ratings yet
R22 ML Question Bank For It and CSM
4 pages
Crossvalidation - 1
No ratings yet
Crossvalidation - 1
30 pages
ML-4th Unit
No ratings yet
ML-4th Unit
44 pages
Proba 227 232
No ratings yet
Proba 227 232
6 pages
Machine Learning
No ratings yet
Machine Learning
63 pages
Chapter 6 Reviewer
100% (5)
Chapter 6 Reviewer
22 pages
Statistical Learning: Master in Data Science For Management
No ratings yet
Statistical Learning: Master in Data Science For Management
47 pages
Exercise 4
No ratings yet
Exercise 4
7 pages
CH 05 Optimization Technique
No ratings yet
CH 05 Optimization Technique
58 pages
DAY 7 SESSION 2 Cross Validation
No ratings yet
DAY 7 SESSION 2 Cross Validation
18 pages
Lec 39
No ratings yet
Lec 39
14 pages
Data Lecture
No ratings yet
Data Lecture
16 pages
INSY662 - F23 - Week 3-1
No ratings yet
INSY662 - F23 - Week 3-1
22 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
d2l en
No ratings yet
d2l en
982 pages
SLChapter 4
No ratings yet
SLChapter 4
20 pages
Machine Learning in Supply Chain Forecasting
No ratings yet
Machine Learning in Supply Chain Forecasting
15 pages
18 CV & Model Selection
No ratings yet
18 CV & Model Selection
11 pages
Lec 5
No ratings yet
Lec 5
28 pages
ML - Module 5
No ratings yet
ML - Module 5
80 pages
ML 4
No ratings yet
ML 4
21 pages
4-ResamplingMethods 1
No ratings yet
4-ResamplingMethods 1
23 pages
Ovefitting, Generalization, Cross Validation
No ratings yet
Ovefitting, Generalization, Cross Validation
20 pages
Project 03: Data Fitting Applied Mathematics and Statistics For Information Technology
No ratings yet
Project 03: Data Fitting Applied Mathematics and Statistics For Information Technology
17 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
Lec 16
No ratings yet
Lec 16
18 pages
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
No ratings yet
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
12 pages
Week 4 Lecture Slides BUS265 2023
No ratings yet
Week 4 Lecture Slides BUS265 2023
45 pages
DATA ANALYSIS UNIT 4 Notes
No ratings yet
DATA ANALYSIS UNIT 4 Notes
19 pages
P-2.1.2 Cross Validation and Regularization
No ratings yet
P-2.1.2 Cross Validation and Regularization
37 pages
Module 3 - ML
No ratings yet
Module 3 - ML
101 pages
Unit V
No ratings yet
Unit V
12 pages
List Steps in Data Preparation. Give Short Description of Each Step
No ratings yet
List Steps in Data Preparation. Give Short Description of Each Step
20 pages
Cross Validation
No ratings yet
Cross Validation
16 pages
EDA Module 2
No ratings yet
EDA Module 2
28 pages
Project Report
0% (1)
Project Report
53 pages
5 CV Boot-Handout PDF
No ratings yet
5 CV Boot-Handout PDF
44 pages
Chapter2 1 33
No ratings yet
Chapter2 1 33
18 pages
On Estimating Model Accuracy
No ratings yet
On Estimating Model Accuracy
6 pages
MIS410 Lecture8toLecture10
No ratings yet
MIS410 Lecture8toLecture10
13 pages
Classification
No ratings yet
Classification
4 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Cross Validation
No ratings yet
Cross Validation
4 pages
Anitha Christopher Automata Theory Lecture Notes
No ratings yet
Anitha Christopher Automata Theory Lecture Notes
80 pages
Teaching Model Driven Architecture Approach With The Sirius Project
No ratings yet
Teaching Model Driven Architecture Approach With The Sirius Project
14 pages
498 FA2019 Lecture11
No ratings yet
498 FA2019 Lecture11
100 pages
2020 21sjit PQT QB
No ratings yet
2020 21sjit PQT QB
66 pages
Flat Lesson Plan
No ratings yet
Flat Lesson Plan
2 pages
How To Use Casio
No ratings yet
How To Use Casio
3 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Technical Report On DenseNet Architecture (Deep Learning Network Model)
No ratings yet
Technical Report On DenseNet Architecture (Deep Learning Network Model)
9 pages
1 5
No ratings yet
1 5
7 pages
Intro Fa1
No ratings yet
Intro Fa1
12 pages
Deterministic Finite Automata and Turing Machine
No ratings yet
Deterministic Finite Automata and Turing Machine
4 pages
K Fold and Other Cross-Validation Techniques
No ratings yet
K Fold and Other Cross-Validation Techniques
10 pages
QUESTION BANK Toc
No ratings yet
QUESTION BANK Toc
10 pages
Pertemuan 8 Stationeritas
No ratings yet
Pertemuan 8 Stationeritas
84 pages
Deep Learning - IIT Ropar - Unit 9 - Week 6
No ratings yet
Deep Learning - IIT Ropar - Unit 9 - Week 6
4 pages
A Single-Layer RNN Can Approximate Stacked and Bidirectional RNNS, and Topologies in Between
No ratings yet
A Single-Layer RNN Can Approximate Stacked and Bidirectional RNNS, and Topologies in Between
18 pages
ATC Class Work Finite Automata Book by Padma Reddy
No ratings yet
ATC Class Work Finite Automata Book by Padma Reddy
18 pages
Intro To ML PDF
No ratings yet
Intro To ML PDF
66 pages
Supplemental 1
No ratings yet
Supplemental 1
54 pages
Artificial Neural Network Based Model For Forecasting of Inflation in India
No ratings yet
Artificial Neural Network Based Model For Forecasting of Inflation in India
12 pages
6116 20272 1 PB - 2 PDF
No ratings yet
6116 20272 1 PB - 2 PDF
9 pages
Normal Distribution Table
No ratings yet
Normal Distribution Table
1 page
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
10 Minute Guide to Orthogonal Array Test Strategy
From Everand
10 Minute Guide to Orthogonal Array Test Strategy
Rajeev Nair Raman
No ratings yet
Ways to Achieve Quality
From Everand
Ways to Achieve Quality
chakrapani srinivasa
5/5 (1)

Cross Validation

Uploaded by

Cross Validation

Uploaded by

Cross-Validation

It also contains the advertising budgets (in thousands

1 TV 0.61 0.61 3.26

6 Radio & Newspaper 0.33 0.33 4.28

7 TV, Radio & Newspaper 0.90 0.90 1.69

Multiple R-squared: 0.6059

Intercept 56.900 1.800 31.60 <0.0001

Horsepower −0.466 0.031 −14.98 <0.0001

I(horsepower^2) 0.001 0.000 10.08 <0.0001

Multiple R-squared: 0.688

Training Data Testing Data

Only a subset of observations are used to fit the model (training

• Fit the model using the training data.

• Repeat this process n times to obtain 𝑀𝑆𝐸1 , … , 𝑀𝑆𝐸𝑛 .

You might also like