• Stars
    star
    8
  • Rank 2,099,232 (Top 42 %)
  • Language
    C++
  • License
    MIT License
  • Created over 5 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A simple yet powerful sequence clustering tool.

More Repositories

1

khmer

In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Python
749
star
2

dammit

just annotate it, dammit!
Python
89
star
3

screed

a simple read-only sequence database, designed for short reads
Python
56
star
4

charcoal

Remove contaminated contigs from genomes using k-mers and taxonomies.
Python
52
star
5

genome-grist

map Illumina metagenomes to genomes!
Jupyter Notebook
36
star
6

elvers

(formerly eelpond) an automated RNA-Seq workflow system
Python
28
star
7

2018-ncbi-lineages

Extract lineage CSVs from NCBI for use with sourmash lca.
Python
26
star
8

dib-MMETSP

Code to generate re-assemblies of the Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP)
Python
21
star
9

khmer-protocols

Python
19
star
10

pybbhash

A Python wrapper for the bbhash library for Minimal Perfect Hashing
C++
18
star
11

2016-pycon-tutorial

Materials for "Creating, building, testing, and documenting a Python project - a hands-on HOWTO - PyCon 2016"
17
star
12

farm-notes

notes on the farm cluster
16
star
13

MQF

C++
12
star
14

kProcessor

kProcessor: kmers processing framework.
C++
11
star
15

literate-resting

Scripts and code for development of literate testing with reST-based shell tutorials.
Python
10
star
16

kmerDecoder

Extract kmers, skipmers, minimizers.
C++
10
star
17

2013-khmer-counting

Paper code: "These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure"
Jupyter Notebook
10
star
18

rcgrep

Search text files for DNA sequences and their reverse complements
Python
10
star
19

khmer-recipes

Python
8
star
20

dib-training

Web site for Data Intensive Training at UC Davis
Jupyter Notebook
8
star
21

2020-paper-sourmash-gather

Here we describe an extension of MinHash that permits accurate compositional analysis of metagenomes with low memory and disk requirements.
HTML
8
star
22

TransAnn-Review

7
star
23

buoy

A project that will change everything. Or something. Or maybe nothing at all.
7
star
24

syrah

Output trusted regions from raw sequencing data.
Python
7
star
25

2020-workflows-paper

Strategies for leveraging workflow systems to streamline large-scale biological analyses
HTML
6
star
26

dib-lab

Issue tracker and Web site for the Lab for Data Intensive Biology at UC Davis.
Python
6
star
27

workflows-book

Issues and draft text for "The Workflow Enabled Bioinformatician"
5
star
28

2014-paper-wssspe14

TeX
5
star
29

2016-pycon-tutorial-project

Python
5
star
30

ged-docs

Holding repository for GED internal documentation of commonly-used tools, with a focus on power-users.
Python
5
star
31

distillerycats

Find disease associations across metagenomes with k-mers using sourmash, and then recover pangenomic accessory elements using spacegraphcats.
Python
5
star
32

2021-panmers

Exploring amino acid kmers as a substitute for genes in microbial pangenome analysis
Jupyter Notebook
5
star
33

2020-ibd

Analysis of publicly available metagenomic sequencing data from humans with IBD.
Jupyter Notebook
4
star
34

2015-paper-10-simple-rules-releasing

4
star
35

q2-sourmash

Qiime2 Sourmash Plugin
Python
4
star
36

2021-paper-ibd

Meta-analysis of metagenomes via machine learning and assembly graphs reveals strain switches in Crohn’s disease
HTML
4
star
37

2012-paper-kmer-percolation

TeX
4
star
38

bioinf_algorithms

Repository for reading group to keep track of work in Bioinformatics Algorithms: an active learning approach
Jupyter Notebook
4
star
39

2013-caltech-workshop

Python
3
star
40

dibsi

Data Intensive Biology Summer Institute
Python
3
star
41

2017-tule-elk

Shell
3
star
42

2021-metapangenome-example

Jupyter Notebook
3
star
43

sourmash-oddify

Tools and workflows for evaluating bacterial and archaeal genome contamination with sourmash and GTDB.
Python
3
star
44

2022-paper-genomic-tax-redundancy

Paper: Comprehensive evaluation of genomic redundancy and taxonomic coherence in large biological databases
HTML
3
star
45

dib_rotation

Metagenomics DIB-lab rotation project
Python
3
star
46

TheGreatGenotyper

C++
2
star
47

workshop_12Jan_2023

Jupyter Notebook
2
star
48

2019-nih-hgrc-grant

Our application to be the NIH Human Genome Reference Center.
2
star
49

2016-illo-screed

Jupyter Notebook
2
star
50

khmer-batlab

batlab.org build scripts
Shell
2
star
51

2012-assembly-methods

2
star
52

ONT_Illumina_genome_assembly

Genome assemblies of Fundulus killifish species with ONT and Illumina sequence data
HTML
2
star
53

2021-orpheum-sim

Simulate short sequencing reads from coding and non-coding portions of bacterial and archaeal genomes to evaluate orpheum ORF prediction
Jupyter Notebook
2
star
54

2016-binder-proposal-to-sloan

2
star
55

2022-branchwater-benchmarking

Benchmarking etc code for https://dib-lab.github.io/2022-paper-branchwater-software/
Jupyter Notebook
2
star
56

CarpentryConWest18

The GitHub repo for CarpentryConnect Davis (formerly known as CarpentryConWest)
2
star
57

2016-sourmash-tax

Taxonomic breakdown of metagenomes using sourmash.
Standard ML
2
star
58

2017-ncbi-taxdump

Standard ML
2
star
59

2013-srop-summer

R
2
star
60

soursigs

Python
2
star
61

2015-09-10-scihack

Inaugural Group Science Coding Session
Python
2
star
62

2019-sourmash-gtdb

Tools and scripts for studying the GTDB taxonomy with sourmash
Python
2
star
63

2018-06-27-DIBSI-Genomics

DIBSI Data Carpentry Genomics
Python
2
star
64

eel-pond

mRNAseq assembly, evaluation, and expression analysis
Python
2
star
65

2016-metagenome-assembly-eval

Standard ML
2
star
66

2022-dominating-set-differential-abundance-example

A reproducible example demonstrating an example workflow for performing dominating set differential abundance analysis
R
2
star
67

2018-mcintyre-output

Jupyter Notebook
1
star
68

oxli

k-mers and the like
Rust
1
star
69

sourmash-slainte

Project template for sourmash-based characterization of genomes and metagenomes
Python
1
star
70

2022-pipeline-antarctic-biogeography

Shell
1
star
71

khmer-citation

The paper that is the citation handle for khmer software.
TeX
1
star
72

2021-paper-metapangenomes

HTML
1
star
73

Haemonchus-pipeline

Automated pipeline to process genomic and RNASeq reads from Haemonchus contortus into a draft genome assembly and draft transcriptomic assembly.
1
star
74

2018-paper-reanalysis-op-ed

TeX
1
star
75

portable-snakemake-workflows

A small, fast, robust workflows using snakemake!
Python
1
star
76

2020-NSURP

Python
1
star
77

mkdocs-material-dib

Mkdocs theme for documentation for DIB Lab software.
CSS
1
star
78

2013-msu-zero-entry

A short bootcamp for MSU: zero-entry computational science.
Python
1
star
79

2022-donut-ibd-classified-sra-search

Standard ML
1
star
80

2022-database-covers

Python
1
star
81

2020-paper-mqf-benchmarks

C++
1
star
82

2017-paper-gather

Python
1
star
83

training-by-practice

Shell
1
star
84

sourmash-utils

Scripts that are useful for sourmash, but not necessarily part of the base command line.
Python
1
star
85

2014-mrnaseq-cloud

TeX
1
star
86

sourmash_plugin_pangenomics

tools for sourmash-based pangenome analyses
Python
1
star
87

2022-import-ranger-hashes

Convert ranger random forest models into sourmash signatures
Python
1
star
88

2019-12-12-sourmash_viz

Python
1
star
89

sourmash_plugin_colorcompare

Add colors representing categories in sourmash compare plots.
Python
1
star