• Stars
    star
    305
  • Rank 136,879 (Top 3 %)
  • Language
    R
  • License
    Other
  • Created over 7 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

R Package ๐Ÿ“ฆ Containing the Datasaurus Dozen datasets ๐Ÿ“Š

datasauRus

Lifecycle: stable CRAN status R-CMD-check

This package wraps the awesome Datasaurus Dozen datasets. The Datasaurus Dozen show us why visualisation is important โ€“ summary statistics can be the same but distributions can be very different. In short, this package gives a fun alternative to Anscombeโ€™s Quartet, available in R as anscombe.

The original Datasaurus was created by Alberto Cairo. The other Dozen were generated using simulated annealing and the process is described in the paper โ€œSame Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealingโ€ by Justin Matejka and George Fitzmaurice (open access materials including manuscript and code, official paper).

In the paper, Justin and George simulate a variety of datasets that the same summary statistics to the Datasaurus but have very different distributions.

Sequential dinosaur gif

Install

The latest stable version is available on CRAN

install.packages("datasauRus")

You can get the latest development version from GitHub, so use {devtools} to install the package

devtools::install_github("jumpingrivers/datasauRus")

Usage

You can use the package to produce Anscombe plots and more.

library("ggplot2")
library("datasauRus")
ggplot(datasaurus_dozen, aes(x = x, y = y, colour = dataset))+
  geom_point() +
  theme_void() +
  theme(legend.position = "none")+
  facet_wrap(~dataset, ncol = 3)

Code of Conduct

Please note that the datasauRus project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms

More Repositories

1

meetingsR

A list of R conferences and user groups
R
102
star
2

namer

R package ๐Ÿ“ฆ for labelling chunks of RMarkdown files! ๐Ÿ’ฅ
R
94
star
3

prettyB

Prettified base graphics
R
63
star
4

podcasts

A collection of Data Science and Statistics podcasts
CSS
24
star
5

diffify

21
star
6

inteRgrate

Continuous integration
R
18
star
7

headR

Adding meta cards to Rmd files
R
17
star
8

optiRum

R Package ๐Ÿ“ฆ of useful helper functions ๐Ÿ’ฐ
R
13
star
9

HIBPwned

R Package ๐Ÿ“ฆ for using the HaveIBeenPwned.com API ๐Ÿ˜ฑ
R
13
star
10

foundational-data-science

Foundational data science
CSS
12
star
11

nhs-ggplot2

Workshop for the NHS R conference
HTML
8
star
12

2022-pydata-global-seaborn

Materials for the Data Visualisation with Seaborn tutorial at PyData Global 2022
8
star
13

meetingsDS

A list of data science events and groups
CSS
7
star
14

ibmsunburst

R ๐Ÿ“ฆ for the IBM โ˜€๐Ÿ“ˆ for the Personality Insights data
R
7
star
15

blog

Code and other content related to Jumping Rivers blogs (https://www.jumpingrivers.com/blog/)
R
6
star
16

2022-nhs-r-quarto

Materials for the Reproducible data reports with Quarto Workshop at NHS R Conference 2022
HTML
6
star
17

neds-meetup-2022

Slides from North East Data Science meetup events (https://www.meetup.com/newcastle-upon-tyne-data-science-meetup/)
6
star
18

2023-nhs-r-seaborn

Materials for 2023 NHS-R Community Conference workshop on Seaborn
Jupyter Notebook
5
star
19

RSS-reproducible-data-reports-with-Quarto

Materials for the Reproducible data reports with Quarto Workshop at RSS International Conference 2022
HTML
5
star
20

SIP-reproducible-data-reports-with-Quarto

CSS
3
star
21

australian-weather

A simple data science project exploring the weather in Australia. Used in our "Git for Me" training course.
R
3
star
22

gaugeR

R
3
star
23

admissionsApp

An R package, created with the {golem} paradigm, to house the admissions data exploration application.
R
2
star
24

2023-pyconuk-seaborn

Materials for the Data Visualisation with Seaborn Clasroom at Pycon UK 2023
Jupyter Notebook
1
star
25

2018-whyr

CSS
1
star
26

hugo-rmd

A small R utility package to help with Hugo sites
R
1
star
27

warranties

A collection of warranties for statistical software
1
star
28

serverheaders

R package for extracting and assessing server headers.
R
1
star
29

zxcvbnR

Password Strength Estimator
R
1
star
30

retentionsApp

An R package, created with the {golem} paradigm, to house the retentions data exploration application.
R
1
star
31

2023-ANNE-version-control

Technical masterclass at Analysts Network North East Spring Conference
CSS
1
star
32

rss-making-maps

Making Maps! workshop for RSS Conference 2023
HTML
1
star