Choosing the best model
You have to punish models for having too many predictors
Whatever the method, RSS decreases / R2 increases as we go from Mk to Mk+1. Thus, Mp always wins that contest.
Going with Mp doesn’t provide either of the benefits: model interpretability and variance reduction (overfitting)
We’ll need to estimate test error!