Seattle library CSV to parquet

  • dplyr::group_by() to define partitions
  • arrow::write_dataset() to save as parquet
pq_path <- "data/seattle-library-checkouts"
seattle_csv |>
  group_by(CheckoutYear) |>
  write_dataset(path = pq_path, format = "parquet")