8.5 Tree-building process (regression)

  1. Divide the predictor space — that is, the set of possible values for X1,X2,...,Xp — into J distinct and non-overlapping regions, R1,R2,...,RJ
  • Regions can have ANY shape - they don’t have to be boxes
  1. For every observation that falls into the region Rj, we make the same prediction: the mean of the response values in Rj

  2. The goal is to find regions (here boxes) R1,...,RJ that minimize the RSS, given by

RSS=Jj=1iRj(yiˆyRj)2

where ˆyRj is the mean response for the training observations within the jth box

  • Unfortunately, it is computationally infeasible to consider every possible partition of the feature space into J boxes.