Model Selection R Chap 4
Model Selection R Chap 4
• Given the data set with many potential predictors we need to decide which
ones to include in out model and which ones to leave out.
• Statistical algorithms may be used to find the best set of predictors.
Common selection methods are:
• Best Subsets (All possible models)
• Forward Selection (Automatic procedure)
• Backward Elimination (Automatic procedure)
• Stepwise Selection (Automatic procedure)
Best Subsets
To consider all possible models is time consuming unless there are only a
small number of models because there are 2p possible linear regression
models and we require procedures for choosing one (or a small number) of
them.
Still difficult to choose “best” model as lots of test results will be available
giving conflicting information.
Can select the best models based on Adjusted R2, Mallows Cp, AIC or BIC.