Chapter 2 Data
2.1 Data from R packages
good clean data packages, ideal for teaching beginner to intermediate
- fivethirtyeight
- gapminder
- babynames
- not a package but https://github.com/rfordatascience/tidytuesday
- for tidying:
- https://github.com/jennybc/lotr
- Most raw data from gapminder: https://www.gapminder.org/data/
- https://www.jvcasillas.com/untidydata/
- https://simplystatistics.org/2018/01/22/the-dslabs-package-provides-datasets-for-teaching-data-science/
- https://github.com/rstudio-education/dsbox
- ISLR data
2.2 Data stored on GitHub
Tip: make a short link to the “raw” version of the data you uploaded to GitHub
2.3 Data from other sources
tips/tricks for logistics (use_course, etc)
see data package for your own