4.2 Prediction using Regression

Caution: Be careful extrapolating results beyond the range of the dataset

Prediction Interval (Uncertainty around a single value)

  • Confidence Intervals (Uncertainty around a statistic)

  • Individual data point error

Here is an example of individual data point error. If we filter for those properties that have four bedrooms, 3 bathrooms and a lot square footage between 10k and 11k, the sale price varies by $50k. This is error in our model.

dat %>% dplyr::filter(Bedroom_AbvGr == 4 & 
                 bath == 3 & 
                 Lot_Area >=10000 & 
                 Lot_Area < 11000) %>% 
  dplyr::select(Sale_Price) %>%
  dplyr::arrange(-Sale_Price)
## # A tibble: 9 × 1
##   Sale_Price
##        <int>
## 1     218500
## 2     211000
## 3     170000
## 4     165150
## 5     157000
## 6     139000
## 7     127500
## 8     103500
## 9     100000