Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

C#

Jupyter Notebook

Shell

Groovy

Swift

C++

Ruby

CSS

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

C#

PowerShell

Shell

Elm

JavaScript

Zig

R

Jupyter Notebook

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇮🇸 Iceland

🇸🇾 Syria

🇧🇯 Benin

🇰🇪 Kenya

🇬🇾 Guyana

🇹🇴 Tonga

🇨🇻 Cabo Verde

🇦🇫 Afghanistan

All Countries Compare Countries

hadley/plyr

High Performance

Stars
493
Rank 89,306 (Top 2 %)
Language
R
License
Other
Created about 16 years ago
Updated about 2 years ago

hadley/plyr

hadley

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A R package for splitting, applying and combining large problems into simpler problems

plyr

plyr is a set of tools for a common set of problems: you need to split up a big data structure into homogeneous pieces, apply a function to each piece and then combine all the results back together. For example, you might want to:

fit the same model each patient subsets of a data frame
quickly calculate summary statistics for each group
perform group-wise transformations like scaling or standardising

It's already possible to do this with base R functions (like split and the apply family of functions), but plyr makes it all a bit easier with:

totally consistent names, arguments and outputs
convenient parallelisation through the foreach package
input from and output to data.frames, matrices and lists
progress bars to keep track of long running operations
built-in error recovery, and informative error messages
labels that are maintained across all transformations

Considerable effort has been put into making plyr fast and memory efficient, and in many cases plyr is as fast as, or faster than, the built-in equivalents.

A detailed introduction to plyr has been published in JSS: "The Split-Apply-Combine Strategy for Data Analysis", http://www.jstatsoft.org/v40/i01/. You can find out more at https://had.co.nz/plyr/, or track development at https://github.com/hadley/plyr. You can ask questions about plyr (and data manipulation in general) on the plyr mailing list. Sign up at https://groups.google.com/group/manipulatr.

Status

plyr is retired: this means only changes necessary to keep it on CRAN will be made. We recommend using dplyr (for data frames) or purrr (for lists) instead.

r4ds

R for data science: a book

adv-r

Advanced R: a book

stats337

Readings in applied data science

ggplot2-book

ggplot2: elegant graphics for data analysis

mastering-shiny

Mastering Shiny: a book

r-pkgs

Building R packages

tidy-data

A paper on data tidying

emo

Easily insert emoji into R and RMarkdown

r-internals

Documentation for R's internal C API

bigvis

Exploratory data analysis for large datasets (10-100 million observations)

strict

Make R a little bit stricter

data-baby-names

Distribution of US baby names, 1880-2008

reshape

An R package to flexible rearrange, reshape and aggregate data

data-movies

Download data from IMDB movies and parse into useful form

pryr

Pry open the covers of R

assertthat

User friendly assertions for R

r2d3

ggplot2 + d3 = r2d3

babynames

An R package containing US baby names from the SSA

lazyeval

Lazy evaluation: an alternative to non-standard evaluation (NSE) for R

secure

Secure private R data in public packages

purrrlyr

Tools at the intersection of purrr and dplyr

lineprof

Visualise line profiling results in R

requirements

Find packages required for code to run

elmer

Call LLM APIs from R

ggstat

Statistical computations for visualisation

r-python

Exploring data related to relative usage of R vs. python

gg2v

Render ggplot2 graphics using vega

building-permits

Code & data accompanying "whole-game" youtube video

stringb

A dependency-free version of stringr

precis

Succintly Summarise Data Frames

r-on-github

An exploration of R code and package on github, using the github search and repo apis

data-housing-crisis

Clean data related to the housing crisis

tidy-tools

Building tidy tools in R, a workshop

decumar

An alternative to sweave

neiss

Data from National Electronic Injury Surveillance System

monads

Work with Monads in R

joy-of-fp

Supplemental materials for "The joy of functional programming"

crantastic

Source code for crantastic.org: a community site for R

recipes

Wickham family recipes

oldbookdown

cubelyr

A data cube dplyr backend

data-fuel-economy

Fuel economy data, 1978-2008

table-shapes

lvplot

Letter value boxplots for R

usdanutrients

USDA nutrient database as an R data package

reactive-docs

An introduction to reactive documents in R (for teaching stats)

vis-eda

Visualisation for EDA

rsmith

A static site generator for R inspired by metalsmith.io

sfhousing

Code to download and process SF housing sales data

helpr

An alternative html help system for R

profr

An alternative profiling package for R

cocktails

Hadley's cocktail book

productplots

Product graphics for categorical data

shinySignals

data-counties

County boundaries in csv for all US counties

l1tf

L1 trend filtering

ggplot1

Before there was ggplot2

roxygen3

15-state-of-the-union

minby

Compute minimum of one variable grouped by another

mylittlepony

A package for learning about the basics of package development

tidyverse-booster

hadley.github.com

boxplots-paper

mturkr

Tools to make MTurk tasks easy to run from R

monthApp

An example of a Shiny app-package

docker

My personal dockerfiles

fueleconomy

EPA fuel economy data in an R package

meifly

An R package for exploring ensembles of (generalised) linear models

clusterfly

An R package for visualising high-dimensional clustering algorithms

rminds

Sample R code for visualising models (especially models in data space)

sinartra

beautiful-data

Book chapter for beautiful data

eggnogr

Shiny app for scaling eggnog

15-student-papers

Graphics & computing student paper winners @ JSM 2015

fec-dplyr

Exploration of FEC contributions data with dplyr

mexico-mortality

Mortality data for Mexico, along with useful extra data

grouperise

Explore the idea of "grouperised" functions

mutatr

Prototype-based mutable objects for R, based on io and javascript

lvplot-paper

yrbss

Youth Risk Behaviour Surveillance System Data

tanglekit

R bindings for Brett Victor's tangle.js

nasaweather

Data from the 2006 ASA data expo

ggplot2-bayarea

Data, code and slides for ggplot2 talk given to Bay Area useR group, 17 Sep 2009

htmlbook

Convert a Quarto book to O'Reilly's html book format

vita

classifly

An R package to visualise high-dimensional classification boundaries with GGobi

ideas

proto

Prototype Object-Based Programming

cran-logs-dplyr

An case study using dplyr on a large dataset: all package downloads from the Rstudio cran mirror.

scagnostics

An R package to calculate graph theoretic scagnostics

ggplot2movies

What the package does (one paragraph).

tidycore

Core tidyverse packages

densityvis

R package for cutting and binning data

fortify

Convert any R object to a data frame, suitable for visualisation

hadladdin

RStudio add-ins by Hadley

hadcol

Hadley's utilities for adding columns

talk-httr2

localmds

Local multidimensional scaling, an R package

layers

Layers code extracted out of ggplot2