13.1 Basics

Regression is the most common way in which we fit a line to explain variation.

  • also used for causal effects: closing back doors (controlling)
  • use the values of one variable (X) to predict the values of another (Y)
  • one way: fit a line that describes the relationship
  • interpretation of coefficient: slope
  • plugging prediction in, we get prediction: ˆY
  • difference between Y and ˆY is the residual
  • we can make the line curvy by adding polynomials (i.e. β1X+β2X2)