ECS171: Machine Learning: Lecture 13: Validation, Model Selection
ECS171: Machine Learning: Lecture 13: Validation, Model Selection
Cho-Jui Hsieh
UC Davis
Variance:
var[e(h(x), y )] = σ 2
Validation error
So we (roughly) have
1
Eval (h) = Eout (h) ± O( √ )
K
| {z }
standard deviation
Validation is taken out of training set
D −→ Dtrain ∪ Dval
| {z } |{z}
N−K K
Dtrain =⇒ g−
Validation error: Eval = Eval (g − )
Final model: D =⇒ g
Validation
D −→ Dtrain ∪ Dval
| {z } |{z}
N−K K
Dtrain =⇒ g−
Validation error: Eval = Eval (g − )
Final model: D =⇒ g
D = Dtrain ∪ Dval
M models H1 , · · · , HM
Model selection by validation
D = Dtrain ∪ Dval
M models H1 , · · · , HM
− for each
Use Dtrain to learn gm
model
Model selection by validation
D = Dtrain ∪ Dval
M models H1 , · · · , HM
− for each
Use Dtrain to learn gm
model
− using D :
Evaluate each gm val
Em = Eval (gm − ),
m = 1, · · · , M
Model selection by validation
D = Dtrain ∪ Dval
M models H1 , · · · , HM
− for each
Use Dtrain to learn gm
model
− using D :
Evaluate each gm val
Em = Eval (gm − ),
m = 1, · · · , M
Pick model m = m∗ with
smallest Em
The bias in validation
For M models H1 , · · · , HM
Assume Dval is used for “training” on the finalists model
How much bias
For M models H1 , · · · , HM
Assume Dval is used for “training” on the finalists model
Selecting the best model from
For M models H1 , · · · , HM
Assume Dval is used for “training” on the finalists model
Selecting the best model from
≈ Eout (g − ) |{z}
Eout (g ) |{z} ≈ Eval (g − )
(small K ) (large K )
The dilemma about K
≈ Eout (g − ) |{z}
Eout (g ) |{z} ≈ Eval (g − )
(small K ) (large K )
Training: Dn −→ gn−
en = Eval (gn− ) = e(gn− (xn ), yn )
Leave one out
Training: Dn −→ gn−
en = Eval (gn− ) = e(gn− (xn ), yn )
1 PN
cross validation error: ECV = N n=1 en
Illustration of cross validation
Model selection using CV
Leave more than one out
Logistic regression:
N
1 X T
min log(1 + e −yi w xi ) + λw T w
w N n=1
Linear SVM:
N
1 X
min max(1 − yi w T xi , 0) + λw T w
w N n=1
Classification/regression with cross validation
Questions?