Binary formats (2)

Parquet and ‘Apache Arrow IPC format’ (formerly called the Feather format) are column-based.

From https://arrow.apache.org/blog/2019/08/08/r-package-on-cran/:

Note that both Feather and Parquet are columnar data formats that allow sharing data frames across R, Pandas, and other tools. When should you use Feather and when should you use Parquet? Parquet balances space-efficiency with deserialization costs, making it an ideal choice for remote storage systems like HDFS or Amazon S3. Feather is designed for fast local reads, particularly with solid-state drives, and is not intended for use with remote storage systems.