Training, Validation, and Test Sets: 2019 Philipp Krähenbühl and Chao-Yuan Wu
Training, Validation, and Test Sets: 2019 Philipp Krähenbühl and Chao-Yuan Wu
sets
ⓒ 2019 Philipp Krähenbühl and Chao-Yuan Wu
Dataset
• Training set
• Validation set
• Learn hyper-parameters
• Test set
• Measure generalization
performance
Why split the data?
• Overfitting
• Optimization objective:
Learn a model that
works well in training
data
Training set
• Used to measure
performance of model
on unseen data
• Random sampling
without replacement
Distribution of data
Low dimensions High dimensions
automated