13.2 Error terms
There’s going to be a difference between the line that we fit and the observation we get.
- Error: theoretical, difference between the actual outcome and prediction we’d make if we had infinite observations to estimate our prediction (true best-fit line)
- Residual: observed, difference between the actual outcome and the prediction
- error contains everything that causes Y that is not included in the model
- if our model is Y=β0+β1X+ϵ, then what’s included in epsilon?