16.2 Why do dimensionality reduction?

  • Visualisation and exploratory data analysis: understand the structure of your data
  • Avoid having too many predictors –> improve model performance
    • Linear regression: number of predictors should be less than the number of data points
    • Multicollinearity: independent predictor variables are highly correlated