Storing Large Data Efficiently
A full season of Statcast data contains over 700k rows and nearly 118 variables.
sc2023 |>
object.size() |>
print(units = "MB")
## 643.2 Mb
The total memory size is around 643MB. The CSVs occupy around 72% of the data stored into memory.