24.3 Key commands to extract data from HTML

  • Key package: rvest
  • Load html to scrape: read_html()
  • Extract data through selectors
    • html_elements(), html_element()
    • html_attr()
    • html_text2()
  • Extract data from HTML tables: html_table()