Before I Sleep: Benefits of a function-based diet (The {drake} post) #10

utterances-bot · 2020-12-23T04:45:58Z

Before I Sleep: Benefits of a function-based diet (The {drake} post)

https://milesmcbain.com/posts/the-drake-post/

akgandhi · 2020-12-23T04:45:59Z

wonderful! thanks for sharing.

dewoller · 2021-03-18T01:12:12Z

I can't believe that Benefits of a function-based diet was published less than a year ago. It completely took over my drake practice, for the better. I'm a fan boy!

I am about to evangelise my department to this practice, and I note that targets has 'superceded' drake. I can certainly see the advantages of targets package, but I am interested in your take . I note that William Landau has taken on board much of your thinking, to his advantage.

My questions are: have you moved to /targets/ for your new projects? How do the conflicted and dotenv packages fit into your workflow? What is your advice for a new drake/targets shop, and for the evangelisation task in general?

MilesMcBain · 2021-03-23T11:39:52Z

Thanks @dewoller, I am glad the approach has proven fruitful!

I use {targets} now for all my new projects. I've ported some of our legacy projects over - since it turns out this is quite painless to do.

Thanks to {tarchetypes} an almost identical workflow with {targets} to what I had with {dflow} is possible. See {tflow} for my latest project template. I am still using {conflicted} and {dotenv} as before.

There are 3 big advantages I have seen with {targets}

The debugging workflow with saved workspaces is quite nice.
The way the dynamic branching stuff works feels much more straight forward, and is now possible to debug thanks to 1.
The annoying 'repacking large object' issue that sometimes arose when caching large objects and caused major plan slowness is gone. {targets} uses a different serialisation format.

On the flip side, one thing I have found slightly annoying is the way input file dependencies are supposed to work. You're expected to declare them all up front in the plan, whereas with {drake} you could call file_in in nested functions. It was a handy way of creating stubs or placeholder functions that used temporary data to be properly plumbed in later.

The evangelism task wasn't too hard for me because I had a very keen 'first follower' who had been in the team a long time and had some clout.

Like I said in the post we were having problems with reproducing work 5 minutes & 5 meters away - the dreaded works on my machine syndrome. I was able to show how some of this was being created by people accumulating stale state in their R sessions and that was leading to weird stuff. The explicit package dependencies and workflow dependency graph to be run in a fresh session every time offered a robust solution to this issue.

Your team might not have that issue, but I can offer some general advice: When introducing {targets} or {drake} go with a really basic set of features. Simple static dependency graphs offer nice productivity and reproducibility gains. Bring in parallelism, dynamic branching, custom triggers, and other advanced features later once people are sold on the workflow and have some comfort with the tooling.

Good luck! 👍

MilesMcBain added the comment_thread label Feb 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Before I Sleep: Benefits of a function-based diet (The {drake} post) #10

Before I Sleep: Benefits of a function-based diet (The {drake} post) #10

utterances-bot commented Dec 23, 2020

akgandhi commented Dec 23, 2020

dewoller commented Mar 18, 2021 •

edited

MilesMcBain commented Mar 23, 2021

Before I Sleep: Benefits of a function-based diet (The {drake} post) #10

Before I Sleep: Benefits of a function-based diet (The {drake} post) #10

Comments

utterances-bot commented Dec 23, 2020

Before I Sleep: Benefits of a function-based diet (The {drake} post)

akgandhi commented Dec 23, 2020

dewoller commented Mar 18, 2021 • edited

MilesMcBain commented Mar 23, 2021

dewoller commented Mar 18, 2021 •

edited