• Stars
    star
    242
  • Rank 161,117 (Top 4 %)
  • Language
    HTML
  • License
    Other
  • Created almost 4 years ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A collection of datasets originally distributed in R packages

What is this?

Rdatasets is a collection of 2264 datasets which were originally distributed alongside the statistical software environment R and some of its add-on packages. The goal is to make these data more broadly accessible for teaching and statistical software development.

What is included?

The list of available datasets (csv and docs) is available here:

On the github repository you will also find the scripts I use to scrape data and update the website.

Adding data

Rdatasets only includes data from packages published on the CRAN repository. Please open an issue on the Github repository if you would like me to add data from a new package.

License

The code in this repository is licensed under GPL-3.

I believe that the R documentation which I copied to the Rdatasets html folder is licensed under GPL. You will find a copy of the GPL in the Rdatasets github repository.

I made a good faith effort to determine the license under which the actual data (i.e. rows/columns of numbers) were distributed, but I was unable to find a definitive answer. My understanding is that these datasets are free to re-distribute. However, if you own the rights to data that are included here and you object to their inclusion in Rdatasets, send me an email at [email protected]. I will promptly remove the data in question and will make sure that all traces are erased from the git revision history.

More Repositories

1

modelsummary

Beautiful and customizable model summaries in R.
R
778
star
2

countrycode

R package: Convert country names and country codes. Assigns region descriptors.
R
330
star
3

marginaleffects

R package to compute and plot predictions, slopes, marginal means, and comparisons (contrasts, risk ratios, odds, etc.) for over 80 classes of statistical models. Conduct linear and non-linear hypothesis tests, or equivalence tests. Calculate uncertainty estimates using the delta method, bootstrapping, or simulation-based inference.
R
290
star
4

WDI

R package to download World Bank data
R
198
star
5

rethinking2

HTML
69
star
6

pymarginaleffects

Python
43
star
7

Reinhart-Rogoff

Reinhart Rogoff replication files: Python stats with IPython notebook
Python
28
star
8

softbib

Software Bibliographies for R Projects
R
24
star
9

marginsplot

plot marginal effects and predicted values using the `margins` and `ggplot2` libraries for `R`
R
9
star
10

tinysnapshot

Snapshots for unit tests using the tinytest framework for `R`. Includes expectations to test base `R` and `ggplot2` plots as well as console output from `print()`.
R
9
star
11

ACMQ

Analyse Causale et Méthodes Quantitatives
CSS
8
star
12

violets

Violets are BLUE. OLS is too. (R package)
R
8
star
13

opic

Overseas Private Investment Corporation data on projects and insurance claims
R
7
star
14

SpatialHelper

A collection of helper functions for network analysis (TERGM) and spatial econometrics in R
R
4
star
15

modelarchive

Archive of `R` models used to test `{modelsummary}` and `{marginaleffects}`
2
star
16

pycountrycode

Python
2
star
17

regrets

R package to print facts and pictures of egrets
R
1
star
18

vincentarelbundock.github.io

Vincent's projects
CSS
1
star