This workshop provides an introduction to data science using R. Participants learn an entire pipeline of data analysis, from importing data to creating reproducible reports. Along the way we introduce several core
tidyverse packages, which work together to make the process of data analysis seamless. These include
dplyr for data transformation,
tidyr for data tidying, and
ggplot2 for creating data visualizations. We then introduce core
tidymodels packages for creating modeling pipelines including,
parsnip for creating model specifications,
recipes for feature engineering, and
yardstick for evaluating model performance. Finally, we discuss how to use
rmarkdown for creating reproducible reports.
See the workshop websites for more, including links to the GitHub repositories for all workshop materials.