16.6 Partial Least Squares (PLS)

Supervised: basically PCA, but makes use of the outcome variable
Tries to maximise variation in predictors, while also maximising the relationship between these components and the outcome

bean_rec_trained %>%
  step_pls(all_numeric_predictors(), outcome = "class", num_comp = 4) %>%
  plot_validation_results() + 
  ggtitle("Partial Least Squares")

The first two components are very similar to the first two PCA components, but the remaining components are different. Let’s look at the top features for each component:

bean_rec_trained %>%
  step_pls(all_numeric_predictors(), outcome = "class", num_comp = 4) %>%
  prep() %>% 
  plot_top_loadings(component_number <= 4, n = 5, type = "pls") + 
  scale_fill_brewer(palette = "Paired") +
  ggtitle("Partial Least Squares")

Solidity and roundness are the features behind PLS3.