• Stars
    star
    582
  • Rank 76,801 (Top 2 %)
  • Language
    Python
  • Created almost 16 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Incubator for useful bioinformatics code, primarily in Python and R

Collection of useful code related to biological analysis. Much of this is discussed with examples at Blue collar bioinformatics.

All code, images and documents in this repository are freely available for all uses. Code is available under the MIT license and images, documentations and talks under the Creative Commons No Rights Reserved (CC0) license.

Some projects which may be especially interesting:

  • CloudBioLinux -- An automated environment to install useful biological software and libraries. This is used to bootstrap blank machines, such as those you'd find on Cloud providers like Amazon, to ready to go analysis workstations. See the CloudBioLinux effort for more details. This project moved to its own repository at https://github.com/chapmanb/cloudbiolinux.
  • gff -- A GFF parsing library in Python, aimed for inclusion into Biopython.
  • nextgen -- A python toolkit providing best-practice pipelines for fully automated high throughput sequencing analysis. This project has moved into its own repository: https://github.com/chapmanb/bcbio-nextgen
  • distblast -- A distributed BLAST analysis running for identifying best hits in a wide variety of organisms for downstream phylogenetic analyses. The code is generalized to run on local multi-processor and distributed Hadoop clusters.

More Repositories

1

cloudbiolinux

CloudBioLinux: configure virtual (or real) machines with tools for biological analyses
Python
257
star
2

bcbio.variation

Toolkit to analyze genomic variation data, built on the GATK with Clojure
Clojure
67
star
3

homebrew-cbl

Homebrew repository for CloudBioLinux: incubator for formulas to end up in homebrew-science
Ruby
19
star
4

biosqlweb

BioSQL web
Python
13
star
5

clj-blend

Clojure library for interacting with Galaxy, CloudMan, and BioCloudCentral, built on blend4j
Clojure
10
star
6

bcbio.prioritize

Prioritize small variants, structural variants and coverage based on biological inputs
Clojure
7
star
7

r-var

Exploring our genomic variability
Clojure
7
star
8

bcbio-conda

Deprecated conda recipes for bcbio python code and dependencies -- migrated to bioconda
Python
7
star
9

dotfiles

~chapmanb dotfile organization for backup and sychronization
Shell
4
star
10

bcbio.pipeline

Next-generation sequencing analysis pipelines built on Hadoop and Cascalog
Java
4
star
11

clj-gcon

Genome Connector: Clojure API to access multiple genomic resources
Clojure
3
star
12

bcbio.run

Idempotent, transactional runs of external command line programs
Clojure
3
star
13

bcbio.variation.plus

Extended functionality for analyzing genomic variability, built on bcbio.variation and GATK
Clojure
3
star
14

zmk-34key-split

3
star
15

bcbio.adam

Experiment: Clojure interface to ADAM distributed file formats for variants and aligned reads
Clojure
2
star
16

kwd-doc-find

Clojure web server providing full-text document searching via Lucene
Clojure
2
star
17

mgh_projects

In-progress code for various research projects
Python
2
star
18

chapmanb.github.com

1
star
19

bcbio.coverage

Investigate coverage metrics for variant calling experiments
Clojure
1
star