• Stars
    star
    781
  • Rank 58,232 (Top 2 %)
  • Language
    Python
  • License
    BSD 3-Clause "New...
  • Created almost 11 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

scikit-bio is an open-source, BSD-licensed, Python package providing data structures, algorithms, and educational resources for bioinformatics.

Important project update (April 2022): scikit-bio is currently in maintenance mode. Due to limited developer bandwidth, we are focusing on keeping scikit-bio up-to-date with Python and Python scientific computing libraries. We plan to do this through two annual releases of scikit-bio. At this time, we have less availability for reviewing or adding new features. We realize that scikit-bio is an important tool for the bioinformatics community, and we hope to transition back to more active development in the future. If you're interested in helping by taking a leadership role in the project, please reach out.

scikit-bio logo

Build Status Coverage Status ASV Benchmarks Join the chat at https://gitter.im/biocore/scikit-bio Depsy Badge Anaconda Build Platforms Anaconda Build Version License Downloads Install

scikit-bio is an open-source, BSD-licensed Python 3 package providing data structures, algorithms and educational resources for bioinformatics.

To view scikit-bio's documentation, visit scikit-bio.org.

Note: scikit-bio is no longer compatible with Python 2. scikit-bio is compatible with Python 3.8 and later.

scikit-bio is currently in beta. We are very actively developing it, and backward-incompatible interface changes can and will arise. To avoid these types of changes being a surprise to our users, our public APIs are decorated to make it clear to users when an API can be relied upon (stable) and when it may be subject to change (experimental). See the API stability docs for more details, including what we mean by stable and experimental in this context.

Installing

The recommended way to install scikit-bio is via the conda package manager available in Anaconda or miniconda.

To install the latest release of scikit-bio:

conda install -c conda-forge scikit-bio

Alternatively, you can install scikit-bio using pip:

pip install scikit-bio

You can verify your installation by running the scikit-bio unit tests:

python -m skbio.test

For users of Debian, skbio is in the Debian software distribution and may be installed using:

sudo apt-get install python3-skbio python-skbio-doc

Getting help

To get help with scikit-bio, you should use the skbio tag on StackOverflow (SO). Before posting a question, check out SO's guide on how to ask a question. The scikit-bio developers regularly monitor the skbio SO tag.

Projects using scikit-bio

Some of the projects that we know of that are using scikit-bio are:

If you're using scikit-bio in your own projects, feel free to issue a pull request to add them to this list.

scikit-bio development

If you're interested in getting involved in scikit-bio development, see CONTRIBUTING.md.

See the list of scikit-bio's contributors.

Licensing

scikit-bio is available under the new BSD license. See LICENSE.txt for scikit-bio's license, and the licenses directory for the licenses of third-party software that is (either partially or entirely) distributed with scikit-bio.

The pre-history of scikit-bio

scikit-bio began from code derived from PyCogent and QIIME, and the contributors and/or copyright holders have agreed to make the code they wrote for PyCogent and/or QIIME available under the BSD license. The contributors to PyCogent and/or QIIME modules that have been ported to scikit-bio are: Rob Knight (@rob-knight), Gavin Huttley (@gavinhuttley), Daniel McDonald (@wasade), Micah Hamady, Antonio Gonzalez (@antgonza), Sandra Smit, Greg Caporaso (@gregcaporaso), Jai Ram Rideout (@jairideout), Cathy Lozupone (@clozupone), Mike Robeson (@mikerobeson), Marcin Cieslik, Peter Maxwell, Jeremy Widmann, Zongzhi Liu, Michael Dwan, Logan Knecht (@loganknecht), Andrew Cochran, Jose Carlos Clemente (@cleme), Damien Coy, Levi McCracken, Andrew Butterfield, Will Van Treuren (@wdwvt1), Justin Kuczynski (@justin212k), Jose Antonio Navas Molina (@josenavas), Matthew Wakefield (@genomematt) and Jens Reeder (@jensreeder).

Logo

scikit-bio's logo was created by Alina Prassas.

More Repositories

1

qiime

Official QIIME 1 software repository. QIIME 2 (https://qiime2.org) has succeeded QIIME 1 as of January 2018.
Python
285
star
2

sortmerna

SortMeRNA: next-generation sequence filtering and alignment tool
C++
169
star
3

emp

Code repository of the Earth Microbiome Project.
Jupyter Notebook
154
star
4

mmvec

Neural networks for microbe-metabolite interaction analysis
Python
117
star
5

American-Gut

American Gut open-access data and IPython notebooks
Jupyter Notebook
107
star
6

biom-format

The Biological Observation Matrix (BIOM) Format Project
Python
92
star
7

deblur

Deblur is a greedy deconvolution algorithm based on known read error profiles.
Python
91
star
8

tcga

Microbial analysis in TCGA data
Jupyter Notebook
88
star
9

gemelli

Gemelli is a tool box for running Robust Aitchison PCA (RPCA), Joint Robust Aitchison PCA (Joint-RPCA), TEMPoral TEnsor Decomposition (TEMPTED), and Compositional Tensor Factorization (CTF) on sparse compositional omics datasets.
Python
67
star
10

songbird

Vanilla regression methods for microbiome differential abundance analysis
Python
56
star
11

gneiss

compositional data analysis toolbox
Jupyter Notebook
55
star
12

emperor

Emperor a tool for the analysis and visualization of large microbial ecology datasets
JavaScript
52
star
13

empress

A fast and scalable phylogenetic tree viewer for microbiome data analysis
JavaScript
45
star
14

redbiom

Sample search by metadata and features
Python
44
star
15

unifrac

Python
37
star
16

scikit-bio-cookbook

Recipes for bioinformatics analyses with scikit-bio
Jupyter Notebook
36
star
17

DEICODE

Robust Aitchison PCA from sparse count data
JavaScript
33
star
18

q2-qemistree

Hierarchical orderings for mass spectrometry data. Canonically pronounced "chemis-tree".
Python
31
star
19

qurro

Visualize differentially ranked features (taxa, metabolites, ...) and their log-ratios across samples
JavaScript
31
star
20

calour

exploratory and interactive microbiome analyses based on heatmaps
Python
27
star
21

q2-greengenes2

A QIIME 2 plugin for interaction with the Greengenes2 database
Python
26
star
22

wol

Reference Phylogeny for Bacterial and Archaeal Genomes
Jupyter Notebook
24
star
23

BIRDMAn

Bayesian Inferential Regression for Differential Microbiome Analysis
Python
22
star
24

Platypus-Conquistador

Confirming specific taxonomic groups within your samples.
Python
19
star
25

micronota

annotation pipeline for microbial genomes and metagenomes
Python
18
star
26

tax2tree

Automated taxonomy decoration onto a tree
Python
14
star
27

evident

Python
14
star
28

qadabra

Snakemake workflow for comparison of differential abundance ranks
Python
13
star
29

oecophylla

shotgun pipeline
Python
11
star
30

horizomer

Workflow for detecting genome-wide horizontal gene transfers
Python
11
star
31

greengenes2

Processing support for Greengenes2
Python
11
star
32

pyqi

Tools for developing and testing command line interfaces in Python.
Python
9
star
33

burrito

Python framework for controlling command-line applications.
Python
8
star
34

pynast

Python Nearest Alignment Space Termination tool (PyNAST): Official repository for software and unit tests
Python
8
star
35

metagenomics_pooling_notebook

Jupyter notebooks to assist with sample processing
Python
8
star
36

my-microbes

A set of tools for delivering personal microbiome results to individuals participating in microbiome sequencing studies.
Python
7
star
37

zebra_filter

Filtering out false taxonomic hits from shotgun sequencing based on genome coverage
Python
7
star
38

burrito-fillings

Application controllers for command line bioinformatics applications
Python
7
star
39

Evident-initial-demo

Elucidating sampling effort for microbial analysis studies
JavaScript
7
star
40

mds-approximations

Multidimensional scaling algorithms for microbiology-ecology datasets.
Python
6
star
41

microsetta-private-api

A private microservice to support The Microsetta Initiative
Python
6
star
42

conda-recipes

conda recipes for bioinformatic tools like blast+, infernal, etc.
Python
6
star
43

american-gut-web

The website for the American Gut Project participant portal
Python
5
star
44

qiime-default-reference

Default reference data files for use with QIIME.
Python
4
star
45

scikit-bio-rfcs

Request For Comments (RFCs) for scikit-bio.
4
star
46

labadmin

Administration website for the Knight Lab
Python
4
star
47

q2-umap

Applying umap to microbiome data via QIIME2
Python
4
star
48

improved-octo-waddle

Balanced parentheses succinct data structure in Python
Jupyter Notebook
4
star
49

dsFDR

descrete False Discovery Rate method
Python
3
star
50

SitePainter

A tool for exploring biogeographical patterns
JavaScript
3
star
51

bayestime

Jupyter Notebook
3
star
52

genome-subsampler

Statistical and empirical subsampling of reference genomes
Jupyter Notebook
3
star
53

micov

Aggregate genome coverage
Python
3
star
54

cmi-workshops

2
star
55

taxster

taxster: assigning taxonomy to organisms you've never even heard of
Python
2
star
56

PipeClust

MPI-based sequence clusterer
C
2
star
57

microsetta-public-api

A public microservice to support The Microsetta Initiative
Python
2
star
58

LabControl

lab manager for plate maps and sequence flows
Python
2
star
59

american-gut-rest

RESTful interface into the American Gut data
Python
2
star
60

unifrac-binaries

C++
1
star
61

biocore.github.io

CSS
1
star
62

q2-ili

QIIME2 plugin for `ili
Python
1
star
63

q2-katharoseq

Python
1
star
64

microsetta-interface

The Microsetta participant facing user interface
Jinja
1
star
65

qiime-workshops

Materials for biocore organized workshops
Jupyter Notebook
1
star
66

microprot

structural annotation pipeline for microbial genomes and metagenomes
Python
1
star
67

mg-scripts

Knight Lab internal Metagenomic processing scripts for demultiplexing, QC and host removal
Python
1
star
68

sage-emperor

Emperor implementation in the SAGE2 framework
JavaScript
1
star
69

q2-mislabeled

A QIIME 2 plugin for assessing sample mislabeling and contamination
Python
1
star
70

q2-american-gut

A QIIME2 plugin for working with and processing American Gut data
Python
1
star
71

basespace-qiime

QIIME's BaseSpace App
HTML
1
star