8.5 Tree-building process (regression)

  1. Divide the predictor space — that is, the set of possible values for \(X_1,X_2, . . . ,X_p\) — into \(J\) distinct and non-overlapping regions, \(R_1,R_2, . . . ,R_J\)
  • Regions can have ANY shape - they don’t have to be boxes
  1. For every observation that falls into the region \(R_j\), we make the same prediction: the mean of the response values in \(R_j\)

  2. The goal is to find regions (here boxes) \(R_1, . . . ,R_J\) that minimize the \(RSS\), given by

\[\mathrm{RSS}=\sum_{j=1}^{J}\sum_{i{\in}R_j}^{}(y_i - \hat{y}_{R_j})^2\]

where \(\hat{y}_{R_j}\) is the mean response for the training observations within the \(j\)th box

  • Unfortunately, it is computationally infeasible to consider every possible partition of the feature space into \(J\) boxes.