Reference guide- Validation & cross-validation

Uploaded by

awivawie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Reference guide- Validation & cross-validation

Uploaded by

awivawie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Reference guide: Validation and cross-

validation
Earlier in this course, you learned that using your model to predict on data that wasn’t used to
train the model is an important part of the model development process known as validation.
You learned about validation using a separate holdout dataset; you also learned about cross-
validation. Model validation is one of the most important parts of predictive modeling, a
process that any data professional must understand. This reading is meant to serve as a
reference guide—a collection of useful tools and processes and tips on how to use them to
perform model validation.

A note about validation and hyperparameter tuning

It’s important to remember that even though validation and hyperparameter tuning are closely
related, they are two separate things. It’s possible to perform model validation without tuning
hyperparameters, and it’s also possible to tune hyperparameters without performing
validation. Most often, however, both steps are undertaken during the model development
process.

Import statements
The following are some of the most commonly used tools related to validation and cross-
validation using scikit-learn.

from sklearn.model_selection import train_test_split

● This function is used to split data. It can be used as many times as needed to achieve
the desired sets. For example, you could split the dataset 80/20 (train/test), then use
the function again on the train set, splitting it 75/25 (train/validate). This would result in
a final ratio of 60/20/20 (train, validate, test).

from sklearn.model_selection import GridSearchCV

● GridSearchCV is a class. You use it to create a GridSearch object. When you use the
fit() method on the GridSearch object, it partitions the data into a user-specified
number of folds, fits a model to the non-holdout data (all folds except one), and
evaluates it against the holdout fold. Scores on each fold and a mean final score are
captured for inspection.
● This is a very useful tool used during cross-validation and can also be used to tune
hyperparameters with a single holdout validation set.

from sklearn.model_selection import PredefinedSplit

● PredefinedSplit is a class that allows you to specify which rows of a dataset to
hold out as validation data. Among other things, it’s useful for tuning hyperparameters
using a single holdout validation set.

Cross-validate/tune hyperparameters with GridSearchCV

Strengths:
● Provides a rigorous estimation of model performance
● More thorough than tuning hyperparameters with a separate holdout dataset
● Good for maximizing utility of limited amounts of data

Weaknesses:
● More time consuming and computationally expensive than tuning against a holdout
validation set

Here are the steps to cross-validate using GridSearchCV. Note that you can cross-validate
without tuning hyperparameters. In that case, instead of indicating multiple values of each
hyperparameter to search over (in step 2 below), just enter the single value that you want to
use for each hyperparameter.
1. Instantiate the model (set the random_state parameter if you want reproducible
results).
2. Create a dictionary of hyperparameters to search over.
3. Create a set of scoring metrics to capture.
4. Instantiate the GridSearchCV object. Pass as arguments:
○ estimator = the model from step 1
○ param_grid = the dictionary of hyperparameters to search over from step 2
○ scoring = the set of scoring metrics you want to capture
○ cv = the number of cross-validation folds you want to use
○ refit = The scoring metric that you want GridSearchCV to use when it
selects the "best" model (i.e., the model that performs best on average over all
validation folds). When it’s done, GridSearchCV will refit the best-scoring
model to all of the data you give it in the step below.
5. Fit the GridSearchCV object to the data (X, y)

Example:
rf = RandomForestClassifier(random_state=0)
cv_params = {'max_depth': [2,3,4,5, None],
'min_samples_leaf': [1,2,3],
'min_samples_split': [2,3,4],
'max_features': [2,3,4],
'n_estimators': [75, 100, 125, 150]
}
scoring = {'accuracy', 'precision', 'recall', 'f1'}

rf_cv = GridSearchCV(estimator=rf, param_grid=cv_params, scoring=scoring, cv=5,

refit='f1')

rf_cv.fit(X_train, y_train)

Use GridSearchCV and PredefinedSplit to tune hyperparameters on a

separate validation set
Strengths:
● Faster and less computationally expensive than a multi-fold (k-fold) cross-validation
● Allows you to choose exactly which samples to include in the validation set (for
example, suppose one of your features is “year,” and you want to ensure that an equal
number of samples from each year are represented in the validation set.

Weaknesses:
● Less rigorous than a k-fold cross-validation
● Not as efficient with data usage (works best with very large datasets)

If you want to tune a model’s hyperparameters using a separate validation set, one way to do
so is by designating which rows of your training data you want to use as your validation set.
Here is one way of doing it:

1. Use train_test_split to separate your data into training and testing data.
Example:
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25,
stratify=y, random_state=42)

2. Use train_test_split again to separate your training data into training and validation
data.
Example:
X_tr, X_val, y_tr, y_val = train_test_split(X_train, y_train, test_size=0.2,
stratify=y_train, random_state=42)

3. Use a list comprehension to make a list of length len(X_train) where each element is
either a 0 or -1. A zero in index i will indicate to GridSearchCV that index i of X_train is
to be held out for validation. A -1 at a given index will indicate that that index of X_train
is to be used as training data.

The list comprehension looks at the index number of each row in X_train. If that index
number is in the validation set’s indices, then the list comprehension appends a 0. If it’s
not, then it appends a -1. If the training data is:
[A, B, C, D],
and your list is:
[-1, 0, 0, -1],
then your training set will contain [A, D] and your validation set will contain [B, C].
Example:
split_index = [0 if x in X_val.index else -1 for x in X_train.index]

4. Pass this list as a parameter to PredefinedSplit and assign the result to a variable.
Example:
custom_split = PredefinedSplit(split_index)

5. Designate this variable as the cv parameter when you instantiate your GridSearchCV
object.
Example:
grid_search = GridSearchCV(estimator=rf, param_grid=cv_params,
scoring=scoring, cv=custom_split, refit='f1')

Selection of a champion model with a separate validation set

“Validation” can also refer to the process of choosing a champion model. Note that this is a
related but (usually) distinct concept from hyperparameter tuning. In most cases, if you’re
tuning hyperparameters of different model architectures (e.g., logistic regression, decision
tree, and random forest) and then selecting one of these architectures as a champion, it’s
worthwhile to perform cross-validation first to tune, and validation later with a separate
validation set to select the champion model. The cross-validation is performed using the
training data, and it’s done to tune the hyperparameters of a particular model architecture .
The data held out for validation is then used to compare the tuned model of each different
architecture to get an objective comparison of their performance. It ensures that the model
you choose as the champion model indeed generalizes well and does not simply overfit the
training data.
After validating

When performing hyperparameter tuning, GridSearchCV will automatically refit the model
with the best hyperparameters on all of the training data. However, if you have a holdout
validation set to compare different model architectures, once you’ve selected a champion
model, go back and train it on the training data + validation data together. Then use that model
(and no other) to predict on the test data to get a measure of future performance. If you then
deploy the model, you might want to finally retrain it using the full dataset (train + validate +
test) so it can learn from as much data as possible before being deployed.

Key takeaways

There are different ways of validating machine learning models, and each way can be
executed using different workflows, techniques, functions, and coding approaches. The
methods demonstrated in this course are just some of them. What’s important is that you
understand that validation is performed to help prevent overfitting models to the training data
and to provide a meaningful way of comparing different models to one another. It’s also
important to understand the strengths and weaknesses of different approaches so you’re
better equipped to make these decisions yourself. Be inquisitive and try different approaches
on your own!

Resources for more information

More detailed information about random forest tuning can be found here:
● scikit-learn documentation:
○ scikit-learn cross-validation documentation
○ developers.google.com - Validation Sets

Essential Idioms in English
95% (20)
Essential Idioms in English
288 pages
Get Ready For First Grade Workbook
No ratings yet
Get Ready For First Grade Workbook
115 pages
Hyperparameters
No ratings yet
Hyperparameters
8 pages
Updated Lecture 12 Zainab
No ratings yet
Updated Lecture 12 Zainab
17 pages
ML chap 5
No ratings yet
ML chap 5
14 pages
Lec-04-05
No ratings yet
Lec-04-05
37 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
9 pages
HyperParameterTuning
No ratings yet
HyperParameterTuning
4 pages
Hyperparameter tuning
No ratings yet
Hyperparameter tuning
4 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
7 pages
Hyperparameter Tuning For Machine Learning Models
No ratings yet
Hyperparameter Tuning For Machine Learning Models
5 pages
Tuning A CART's Hyperparameters: Elie Kawerk
No ratings yet
Tuning A CART's Hyperparameters: Elie Kawerk
26 pages
Hyperparameter Tuning the Random Forest in Python BOM 3_ by Will Koehrsen _ Towards Data Science
No ratings yet
Hyperparameter Tuning the Random Forest in Python BOM 3_ by Will Koehrsen _ Towards Data Science
15 pages
AML_code_for_m2
No ratings yet
AML_code_for_m2
7 pages
XIIAIUNITICAPSTONE_PROJECTPARTII
No ratings yet
XIIAIUNITICAPSTONE_PROJECTPARTII
11 pages
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
No ratings yet
3.1. Cross-Validation - Evaluating Estimator Performance - Scikit-Learn 1.3.0 Documentation
12 pages
Lec - 4
No ratings yet
Lec - 4
43 pages
Advanced Scikit Learn
No ratings yet
Advanced Scikit Learn
98 pages
Data Collection
No ratings yet
Data Collection
8 pages
PA DA2_merged
No ratings yet
PA DA2_merged
29 pages
Cross Validation - Notes
No ratings yet
Cross Validation - Notes
10 pages
Supple Maximizing Performance in Cs CuBiCl
No ratings yet
Supple Maximizing Performance in Cs CuBiCl
5 pages
20. Hyperparameter_Tuning
No ratings yet
20. Hyperparameter_Tuning
3 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
ML5&6&7&8&9&10
No ratings yet
ML5&6&7&8&9&10
35 pages
Sentimental
No ratings yet
Sentimental
11 pages
ML_4,5 (1)
No ratings yet
ML_4,5 (1)
5 pages
Notes - Unit 3 - Machine Learning Lnctu-bca (Aida) - IV Sem - (1)
No ratings yet
Notes - Unit 3 - Machine Learning Lnctu-bca (Aida) - IV Sem - (1)
19 pages
QB 1
No ratings yet
QB 1
11 pages
Skit Learn Cheatsheet
No ratings yet
Skit Learn Cheatsheet
11 pages
Section 1: Cross-Validation and Model Performance
No ratings yet
Section 1: Cross-Validation and Model Performance
33 pages
1 (A) Explain Supervised Learning and Unsupervised Learning
No ratings yet
1 (A) Explain Supervised Learning and Unsupervised Learning
52 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Model Training: (Anything Done While We Train The Model)
No ratings yet
Model Training: (Anything Done While We Train The Model)
194 pages
Regression Linaire Python Tome II
No ratings yet
Regression Linaire Python Tome II
10 pages
Module 6_ML
No ratings yet
Module 6_ML
30 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
T1 ML QB Soln
No ratings yet
T1 ML QB Soln
23 pages
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
No ratings yet
DR Antonio Gulli - A Collection of Advanced Data Science and Machine Learning Interview Questions Solved in Python and Spark (II) - Hands-On Big Data and Machine - Programming Interview Questions) (
112 pages
AIML%20Short%20Term%20Internship%20Session%2010%20Summary-1719293295226
No ratings yet
AIML%20Short%20Term%20Internship%20Session%2010%20Summary-1719293295226
3 pages
Hyperparameters and Model Validation _ Python Data Science Handbook
No ratings yet
Hyperparameters and Model Validation _ Python Data Science Handbook
18 pages
Assignment_2
No ratings yet
Assignment_2
3 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Hyperparameter-Tuning
No ratings yet
Hyperparameter-Tuning
6 pages
frmCourseSyllabusIPDownload (2)
No ratings yet
frmCourseSyllabusIPDownload (2)
3 pages
All Types of Cross Validation
No ratings yet
All Types of Cross Validation
9 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
Cross-Validation
No ratings yet
Cross-Validation
21 pages
Grid Search Steps and Example
No ratings yet
Grid Search Steps and Example
1 page
S-10
No ratings yet
S-10
11 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
ml_fat
No ratings yet
ml_fat
9 pages
Kaggle Course Notes
No ratings yet
Kaggle Course Notes
87 pages
AIE Portfolio3
No ratings yet
AIE Portfolio3
2 pages
8
No ratings yet
8
56 pages
2. Random Forest Algorithm
No ratings yet
2. Random Forest Algorithm
2 pages
Hyper_parameter_optimization
No ratings yet
Hyper_parameter_optimization
13 pages
Lecture 4.1 AML
No ratings yet
Lecture 4.1 AML
12 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Notes On Ahb-Lite
No ratings yet
Notes On Ahb-Lite
5 pages
The Gospel The Besorah
100% (1)
The Gospel The Besorah
5 pages
Software Testing Important Questions
No ratings yet
Software Testing Important Questions
6 pages
6° Ano - Inglês-Interpretação - Atividade
No ratings yet
6° Ano - Inglês-Interpretação - Atividade
2 pages
(Cultures of ASEAN Countries) Quiz 1. History and Geography
No ratings yet
(Cultures of ASEAN Countries) Quiz 1. History and Geography
8 pages
FW03563 PDF
100% (1)
FW03563 PDF
10 pages
English Only One Language
No ratings yet
English Only One Language
16 pages
De Thi Hoc Ki 1 Tieng Anh 4 Family and Friends de So 5 1701145246
No ratings yet
De Thi Hoc Ki 1 Tieng Anh 4 Family and Friends de So 5 1701145246
5 pages
ORTOGRAFIA5
No ratings yet
ORTOGRAFIA5
2 pages
Even So Come-chords-Eb
No ratings yet
Even So Come-chords-Eb
3 pages
Full Download Critical Reading and Writing in the Digital Age An Introductory Coursebook Second Edition Andrew Goatly PDF DOCX
100% (6)
Full Download Critical Reading and Writing in the Digital Age An Introductory Coursebook Second Edition Andrew Goatly PDF DOCX
81 pages
Eng101 Midterm Solved Mcqs by Junaid
100% (1)
Eng101 Midterm Solved Mcqs by Junaid
12 pages
Soal 1 UAS Bahasa Inggris Kelas 5 SD Semester 1
100% (9)
Soal 1 UAS Bahasa Inggris Kelas 5 SD Semester 1
6 pages
Module 7 - Backwards Planning
No ratings yet
Module 7 - Backwards Planning
74 pages
Review 3 For U5 - 6
No ratings yet
Review 3 For U5 - 6
3 pages
IPCToolv4.0.1015 Release Note
No ratings yet
IPCToolv4.0.1015 Release Note
11 pages
QF - BPO - Non Voice - ICT705
No ratings yet
QF - BPO - Non Voice - ICT705
16 pages
8 Bing - Answer
No ratings yet
8 Bing - Answer
4 pages
Autosys Job Mgt4.5 Admin
No ratings yet
Autosys Job Mgt4.5 Admin
126 pages
Hindi-Urdu Controversy 1867 - Pakistan History MCQS - Pakistan - The Land of Pure
100% (1)
Hindi-Urdu Controversy 1867 - Pakistan History MCQS - Pakistan - The Land of Pure
8 pages
Contributions of The New England Indians
100% (3)
Contributions of The New England Indians
48 pages
UI_UX_Design_Brief
No ratings yet
UI_UX_Design_Brief
4 pages
Statement P1.1-EnG 2
No ratings yet
Statement P1.1-EnG 2
14 pages
Cordillera Administrati Ve Region
100% (2)
Cordillera Administrati Ve Region
17 pages
S. Ponnusamy (Auth.) - Foundations of Mathematical Analysis-Birkhäuser Basel (2012)
100% (6)
S. Ponnusamy (Auth.) - Foundations of Mathematical Analysis-Birkhäuser Basel (2012)
587 pages
Download Full Intercultural Communication in Contexts 7th Edition Judith N. Martin - eBook PDF PDF All Chapters
100% (7)
Download Full Intercultural Communication in Contexts 7th Edition Judith N. Martin - eBook PDF PDF All Chapters
59 pages
SIT Environment: Sanity Testing
No ratings yet
SIT Environment: Sanity Testing
8 pages
Different Types of Website
No ratings yet
Different Types of Website
23 pages