8.5 Tree-building process (regression)
- Divide the predictor space — that is, the set of possible values for X1,X2,...,Xp — into J distinct and non-overlapping regions, R1,R2,...,RJ
- Regions can have ANY shape - they don’t have to be boxes
For every observation that falls into the region Rj, we make the same prediction: the mean of the response values in Rj
The goal is to find regions (here boxes) R1,...,RJ that minimize the RSS, given by
RSS=J∑j=1∑i∈Rj(yi−ˆyRj)2
where ˆyRj is the mean response for the training observations within the jth box
- Unfortunately, it is computationally infeasible to consider every possible partition of the feature space into J boxes.