Forward Stepwise Subset Selection (FsSS)
- Let M0 denote the null model (no predictors)
- For k=1,...,p:
- Fit all p−(k−1) predictors not in model Mk−1
- Select the predictor that raises R2 the most and add it to model Mk−1 to create model Mk
- Select the model among M0,...,Mk that minimizes validation error (or some estimate of it)
- When p=20, best subset selection requires fitting 1,048,576 models, whereas forward stepwise selection requires fitting only 211 models.