Lasso and Ridge Regression
Lasso and Ridge Regression
Shrinkage (Regularization)
Involves shrinking the estimated coefficients toward zero relative to the OLS
estimates; has the effect of reducing variance and performs variable selection.
Methods: ridge regression, lasso
Dimension Reduction
Involves projecting the p predictors into a M-dimensional subspace, where M
< p, and fit the linear regression model using the M projections as predictors.
Methods: principal components regression, partial least squares
Best Subset Selection
The RSS (R2) will always decline (increase) as the number of
predictors included in the model increases, so they are not
very useful statistics for selecting the best model.
The red line tracks the best model for a given number of
predictors, according to RSS and R2
Best Subset Selection
While best subset selection is a simple and conceptually
appealing approach, it suffers from computational
limitations.
The larger the search space, the higher the chance of finding
models that look good on the training data, even though
they might not have any predictive power on future data.
Thus, RSS and R2 are not suitable for selecting the best
model among a collection of models with different numbers
of predictors.
Estimating Test Error
1. We can indirectly estimate test error by making an
adjustment to the training error to account for the
bias due to overfitting.
These techniques adjust the training error for the model size,
and can be used to select among a set of models with
different numbers of variables.
Increased λ leads to
increased bias but
decreased variance
Ridge Regression
In general, the ridge
regression estimates will be
more biased than the OLS
ones but have lower
variance.
With ridge regression, for any given λ we only need to fit one
model and the computations turn out to be very simple.
Ridge
Lasso Regression
Lasso vs. Ridge Regression
The lasso has a major advantage over ridge regression, in
that it produces simpler and more interpretable models
that involved only a subset of predictors.