22 Arrow

Learning objectives:

  • Using the arrow package to load in large data files efficiently
  • Partitioning large data files into parquet files for quicker access, less memory usage, and quicker wrangling
  • Wrangling with data in the arrow data format or parquet format using existing dplyr() operations