8.7 Recursive binary splitting (continued)
We first select the predictor \(X_j\) and the cutpoint \(s\) such that splitting the predictor space into the regions \({\{X|X_j<s\}}\) and \({\{X|X_j{\ge}s}\}\) leads to the greatest possible reduction in RSS
Repeat the process looking for the best predictor and best cutpoint to split data further (i.e., split one of the 2 previously identified regions - not the entire predictor space) minimizing the RSS within each of the resulting regions
Continue until a stopping criterion is reached, e.g., no region contains more than five observations
Again, we predict the response for a given test observation using the mean of the training observations in the region to which that test observation belongs