Imputation

Imputation is the process of replacing a missing value with a substituted, “best guess” value. Imputation should be one of the first feature engineering steps you take as it will affect any downstream preprocessing.

  • Estimated statistic (mean, median, mode) <– avoid this method!

  • KNN

  • Tree-based

  • MICE (not available in tidymodels recipes)