Chapter 12 Working with Large Data

Learning objectives:

  • Retrieving Statcast (Baseball Savant) multiple seasons data
  • Using Apache Arrow and Parquet format
  • Using DuckDB
  • Using MySQL (PostgreSQL)
  • Launch Angles and Velocities, Revisited
suppressMessages(library(tidyverse))
# library(RPostgres) # using PostgreSQL instead of MariaDB
library(abdwr3edata)
library(baseballr)
library(fs)
theme_set(theme_classic())

crcblue <- "#2905a1"

crc_fc <- c("#2905a1", "#e41a1c", "#4daf4a", "#984ea3")

options(digits = 3)

options(timeout = max(600, getOption("timeout")))