8.56 Example
Here, we apply bagging and random forests to the Carseats
data set. We will be using the randomForest package as the engine. A bagging model is the same as a random forest where mtry is equal to the number of predictors. We can specify the mtry
to be .cols() which means that the number of columns in the predictor matrix is used. This is useful if you want to make the specification more general and usable to many different data sets. .cols() is one of many descriptors
in the parsnip
package. We also set importance = TRUE
in set_engine() to tell the engine to save the information regarding variable importance. This is needed for this engine if we want to use the vip
package later.
For a more detailed explanation of bagging
and its counterpart boosting
, read this link
<- rand_forest(mtry = .cols()) %>%
bagging_spec set_engine("randomForest", importance = TRUE) %>%
set_mode("regression")
Fit the model.
<- fit(bagging_spec, Sales ~ ., data = carseats_train) bagging_fit