• Stars
    star
    220
  • Rank 180,422 (Top 4 %)
  • Language
  • Created almost 6 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Resources from my Rstudio::conf 2019 talk

How To Make Your Data Analysis Notebooks More Reproducible

rstudio_talk_slides

Slide deck | Slide deck as PDF

πŸŽ₯ Video of talk at rstudio::conf(2019)

Resources

I have included a handful of links to papers, software packages and tutorials/manuals about some tools I mention in my talk. Pull requests or issues on additional ones to include are welcome.

Research Compendia

Examples of Research Compendia on GitHub Below are a few links to real world examples of research compendia in R. To have a minimal compendium, all you really need is a valid DESCRIPTION file containing a handful of fields such as type, name, version and dependencies. See Marwick et al 2017 for a detailed description of the different types of compendia.

Small

Medium

Large

Software packages related to research compendia

  • πŸ“¦ rrtools by Ben Marwick (also the author of the packaging data analysis paper mentioned above) extends functions in devtools and provides instructions, templates, and functions to make a basic compendium suitable for doing reproducible research with R.
    • Also see πŸ“¦ workflowr by John Blischak and the task view on R-based data analysis projects maintained by John Blischak, Anna Krystalli, Ben Marwick, Daniel NΓΌst.
  • πŸ“¦ usethis Many of the major function in rrtools are imported from usethis. A savvy user can get by setting up and maintaining a compendium purely with usethis functions.
  • πŸ“¦ goodpractice - Designed to help you build more robust packages, the package does a deep dive on your package contents and provide advice on syntax pitfalls to avoid, code formatting suggestions, and helps you improve overall package structure.
  • The πŸ“¦ rticles package by JJ has numerous journal templates and together with Rstudio addins like word countaddin and citr + knitcitations.

πŸ“ˆ Data management

  • πŸ“¦ piggyback, [docs]: This clever R package allows you to attach arbitrary data (or other) files (upto 2gb each) to a GitHub release. Given GitHub's fast CDN, this would be an easy way to quickly attach large files to a compendium and read them back in a local/collaborator/remote environment very easily. As always be sure to archive a long-term copy on Zenodo.
  • πŸ“¦ arkdb [docs]: This package allows you to archive and unarchive databases as flat text files.
  • πŸŽ₯ For more on setting up data packages, see this excellent talk by Noam Ross at New York R.

Computational environments: Binder and friends

Other hosted Binder hubs

Setting up Binder for your analysis

I have captured all the various ways to set up mybinder with a R project in a separate document.

Are you interested in setting up or hosting a binderhub for the R community? Get in touch via the issues.

Also see

Software packages related to setting up computational environments

  • πŸ“¦ Containerit. Detailed blog post This sweet package will generate a Dockerfile for you by examining the code inside a folder or just from your session info. This is analogous to repo2docker but is very R centric
  • stevedore Although there are a few docker clients (docker, harbor), this is my recommendation for managing docker containers from inside R.

πŸ”¨ Workflows: drake and friends

  • πŸ“¦ drake - An R-focused pipeline toolkit for reproducibility and high-performance computing. Install the package from here or CRAN.
  • The prequel to the drake R package A blog post by the creator of drake describing his motivation for the package.
  • drake manual A detailed bookdown guide on how to setup and use drake for projects of varying levels of complexity.
  • Presentation on drake Slides from a talk by Will Landau (who is here at the conference so go pick his brain if you want to learn more!)

Real world drake examples

Miscellaneous


Acknowledgments

Many thanks to Chris Holdgraf, Carl Boettiger, Will Landau, and Ben Marwick for various discussions on these topics. Also thanks to Ciera Martinez, Kara Woo, and Nick Tierney for comments on the presentation.

More Repositories

1

wesanderson

A Wes Anderson color palette for R
R
1,766
star
2

markdown_science

Learn how to use markdown for science
TeX
300
star
3

holepunch

Make your R project Binder ready
R
249
star
4

rdrop2

Dropbox Interface from R
R
248
star
5

testdat

A package to run unit tests on tabular data
R
141
star
6

life-hacks

Some life hacks documented
97
star
7

smb_git

A review paper describing how git can be used to improve reproducibility in science
Shell
85
star
8

rDrop

Programmatic interface to Dropbox
R
64
star
9

dashboard

A R package dashboard generator
CSS
44
star
10

rdat

An R interface to data Dat
R
43
star
11

ddd

data, data, data paper
TeX
43
star
12

ggplot-lecture

My lecture on ggplot at Cal (spring 2013)
TeX
41
star
13

Rtools

Sublime text package for R
Python
33
star
14

zenodo

Deposit data (or any research object) into Zenodo
R
17
star
15

woRstfigures

woRst R figures
12
star
16

fujifilm

Just a collection of my notes on the Fujifilm X-T2 and X-mount ecosystem
12
star
17

randNames

Random username (+other useful info) generator
R
11
star
18

binder-test

R
11
star
19

git_intro

A quick introduction to git
CSS
10
star
20

sheetseeR

A quick and easy interface to Google Docs + SheetSee.js
R
9
star
21

dcTemplate

Data Carpentry R lesson template
HTML
7
star
22

mozfest-data-lessons

Repo for the data carpentry session at Mozfest
5
star
23

dlab-advanced-r

Repo with material on learning testing, documentation, and package development in R.
5
star
24

mapboxer

Mapbox from R
HTML
5
star
25

ggit

Git visualization tools for git2r
R
5
star
26

software-mapping-workshop

Slides, notes, and resources for the CZI Essentials of Open Source grantee meeting, December 9th, 2020
4
star
27

colorPalette

A colorPalette API for R
4
star
28

webnativesci

OKFestival session on tools and skills for web native science
JavaScript
4
star
29

sevilla15

Repo for workshop at Sevilla, February 2015
CSS
3
star
30

coyote

Functions from my .rprofile now moved to a package
R
3
star
31

esa_data_viz

ESA 2013, Data Visualization in R workshop
R
3
star
32

Rtools3

R tools for Sublime text 3
Python
3
star
33

stat290

R
2
star
34

bmc_post

Set of files for BMC blog post
2
star
35

rProvenance

A prototype for a provenance library implementation in R
R
2
star
36

gigadb

R
2
star
37

RRR

The reproducibility lexicon project at BIDS
Python
2
star
38

commcall

R
2
star
39

rthings

Provides short lists of fake data for use in examples and demos
R
2
star
40

tradeoff

revisiting classic life-history tradeoffs with realistic assumptions.
R
2
star
41

api-best-practices

plos comp bio paper draft
2
star
42

almviz

ALM Visualization from the PLOS Hackathon
CSS
2
star
43

badge-test

1
star
44

ecotools

Find out what tools other scientists use to stay productive
1
star
45

roweb-test

HTML
1
star
46

distill

HTML
1
star
47

2014-02-13-UNSW

Repository for the UNSW Software Carpentry Bootcamp
Python
1
star
48

karthikram.github.com

My Jekyll-Bootstrap github page
JavaScript
1
star
49

binder-test-docker-tidyverse

R
1
star
50

misc-scripts

Python
1
star
51

nick_dates

R
1
star
52

dygraph-test

R
1
star
53

themegray4lyfe

HTML
1
star
54

2014-10-31-nw

Repository for the NorthWestern University bootcamp
Perl
1
star
55

urssi_figures

Repository accompanying "A Survey of the State of the Practice for Research Software in the United States"
HTML
1
star
56

species_interactions

1
star
57

eeguide

A guide to using the ecoengine
CSS
1
star
58

R-Icon

An alternative icon for R.
1
star
59

demo

R
1
star
60

antweb_paper

Paper on AntWeb
TeX
1
star
61

sparse

TeX
1
star
62

write-with-make

A quick repo with an example of how to write with Makefiles
TeX
1
star
63

ucsf19

Notes and slides from my workshop on open code at UCSF
1
star
64

shiny

Short Shiny tutorial
R
1
star
65

DataCabin

1
star