Backward Stepwise Subset Selection (BsSS)
backward stepwise selection provides an efficient alternative to best subset selection.
It’s begins with the full least squares model containing all p predictors, and then iteratively removes the least useful predictor, one-at a-time
- Make sure that n>p
- Let Mp denote the full model with all p predictors
- For k=p,p−1,...,1:
- Consider all k models that result in dropping a single predictor from Mk (thus containing k−1 predictors)
- Choose the best among these k models, and christen it Mk−1
- Select the model among M0,...,Mk that minimizes validation error (or some estimate of it)