Workflow management with drake

An example drake dependency graph

In fall 2020 the Center for Environmental Research, Education and Outreach at Washington State University hosted a workshop covering reproducible research techniques in R for graduate students. We wanted to cover drake workflows as one day of the workshop to show students what R-specific options there are for managing workflows. We expected that workflow management and to some extent R functions would be unfamiliar topics to many students, so the workshop day included discussion of why one would use workflow management software and some basic examples of building realistic functions. At the time that we ran the workshop I wasn’t aware of targets, so in the future we may repurpose this example and replace drake with that package.

I put together an R Markdown document here where I’ve combined the contents from the intro presentation I used along with the lesson material in a single document. The (mostly) empty folder structure I provided for the walkthrough is here and an example of the finished product is in a repo here. We used Fanaee-T and Gama’s (2013) dataset on bike sharing in DC. More info here.

Link to workshop website.

Matthew Brousil
Matthew Brousil
Data Scientist