• Stars
    star
    135
  • Rank 269,297 (Top 6 %)
  • Language
    R
  • License
    Other
  • Created about 10 years ago
  • Updated 9 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Parsing, Manipulation, and Visualization of Metabarcoding/Taxonomic data

Downloads from Rstudio mirror per month Downloads from Rstudio mirror CRAN version

An R package for metabarcoding research planning and analysis

Metacoder is an R package for reading, plotting, and manipulating large taxonomic data sets, like those generated from modern high-throughput sequencing, like metabarcoding (i.e. amplification metagenomics, 16S metagenomics, etc). It provides a tree-based visualization called “heat trees” used to depict statistics for every taxon in a taxonomy using color and size. It also provides various functions to do common tasks in microbiome bioinformatics on data in the taxmap format defined by the taxa package, such as:

  • Summing read counts/abundance per taxon
  • Converting counts to proportions and rarefaction of counts using vegan
  • Comparing the abundance (or other characteristics) of groups of samples (e.g., experimental treatments) per taxon
  • Combining data for groups of samples
  • Simulated PCR, via EMBOSS primersearch, for testing primer specificity and coverage of taxonomic groups
  • Converting common microbiome formats for data and reference databases into the objects defined by the taxa package.
  • Converting to and from the phyloseq format and the taxa format

Installation

This project is available on CRAN and can be installed like so:

install.packages("metacoder")

You can also install the development version for the newest features, bugs, and bug fixes:

install.packages("devtools")
devtools::install_github("grunwaldlab/metacoder")

Documentation

All the documentation for metacoder can be found on our website here:

https://grunwaldlab.github.io/metacoder_documentation/

Dependencies

The function that simulates PCR requires primersearch from the EMBOSS tool kit to be installed. This is not an R package, so it is not automatically installed. Type ?primersearch after installing and loading metacoder for installation instructions.

Relationship with other packages

Many of these operations can be done using other packages like phyloseq, which also provides tools for diversity analysis. The main strength of metacoder is that its functions use the flexible data types defined by taxa, which has powerful parsing and subsetting abilities that take into account the hierarchical relationship between taxa and user-defined data. In general, metacoder and taxa are more of an abstracted tool kit, whereas phyloseq has more specialized functions for community diversity data, but they both can do similar things. I encourage you to try both to see which fits your needs and style best. You can also combine the two in a single analysis by converting between the two data types when needed.

Citation

If you use metcoder in a publication, please cite our article in PLOS Computational Biology:

Foster ZSL, Sharpton TJ, Grünwald NJ (2017) Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLOS Computational Biology 13(2): e1005404. https://doi.org/10.1371/journal.pcbi.1005404

Future development

Metacoder is under active development and many new features are planned. Some improvements that are being explored include:

  • Barcoding gap analysis and associated plotting functions
  • A function to aid in retrieving appropriate sequence data from NCBI for in silico PCR from whole genome sequences
  • Graphing of different node shapes in heat trees, possibly including pie graphs or PhyloPics.
  • Adding the ability to plot specific edge lengths in the heat trees so they can be used for phylogenetic trees.
  • Adding more data import and export functions to make parsing and writing common formats easier.

To see the details of what is being worked on, check out the issues tab of the Metacoder Github site.

License

This work is subject to the MIT License.

Acknowledgements

Metacoder’s major dependencies are taxa, taxize, vegan, igraph, dplyr, and ggplot2.

This package includes code from the R package ggrepel to handle label overlap avoidance with permission from the author of ggrepel Kamil Slowikowski. We included the code instead of depending on ggrepel because we are using functions internal to ggrepel that might change in the future. We thank Kamil Slowikowski for letting us use his code and would like to acknowledge his implementation of the label overlap avoidance used in metacoder.

Feedback and contributions

We would like to hear about users’ thoughts on the package and any errors they run into. Please report errors, questions or suggestions on the issues tab of the Metacoder Github site. We also welcome contributions via a Github pull request. You can also talk with us using our Google groups site.

More Repositories

1

Population_Genetics_in_R

A primer for computational tools to analyze genetics of populations in R
HTML
84
star
2

poppr

🌶 An R package for genetic analysis of populations with mixed (clonal/sexual) reproduction
R
68
star
3

analysis_of_microbiome_community_data_in_r

This is the source code for the "Analysis of Microbiome Community Data in R" ICPP 2018 workshop. The website is available at: https://grunwaldlab.github.io/analysis_of_microbiome_community_data_in_r/
HTML
17
star
4

Reproducible-science-in-R

This site provides information on conducting reproducible science using the R language.
HTML
14
star
5

krisp

A lightweight kmer-based algorithm for designing diagnostic CRISPR assays using genome data.
Python
10
star
6

effectR

An R package to call oomycete effectors
R
10
star
7

quiltr

R package for making website representations of directories
R
5
star
8

Microbe-ID

Microbe-ID
JavaScript
4
star
9

aps_workshop

2
star
10

audpc_example

An example analysis for calculating AUDPC using RMarkdown
HTML
2
star
11

psminer

Work in progress. Backend for pathogensurveillance
R
2
star
12

supplementary-poppr-2.0

Supplementary materials for Frontiers paper on novel R tools for population genomics
HTML
2
star
13

metacoder_documentation

Documentation for the metacoder R package
HTML
2
star
14

popgen_review_examples

Examples for population genetics review manuscript
HTML
1
star
15

phytophthora_id

Webpage of Phytophthora ID of the Grunwald lab.
JavaScript
1
star
16

ezec

Easy Interface to Effective Concentration Calculations
R
1
star
17

labtools

An R package for lab notebook upkeep
R
1
star
18

Supplements_Knaus_Pinfestans_CNV

Supplements for Knaus et al. manuscript on changes variation in genic copy number in Phytophthora infestans and relatives.
1
star
19

OomyceteDB

A new barcode for characterizing oomycete communities.
HTML
1
star
20

poppr-poster-aps-2016

source data for the APS 2016 poster on Tools for analysis of clonal population genetic data in R
HTML
1
star
21

bioinformatics_practice

Record of exercises done to learn bioinformatics
R
1
star