• Stars
    star
    26
  • Rank 930,752 (Top 19 %)
  • Language
    Python
  • License
    BSD 3-Clause "New...
  • Created almost 7 years ago
  • Updated about 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Extract lineage CSVs from NCBI for use with sourmash lca.

More Repositories

1

khmer

In-memory nucleotide sequence k-mer counting, filtering, graph traversal and more
Python
749
star
2

dammit

just annotate it, dammit!
Python
89
star
3

screed

a simple read-only sequence database, designed for short reads
Python
56
star
4

charcoal

Remove contaminated contigs from genomes using k-mers and taxonomies.
Python
52
star
5

genome-grist

map Illumina metagenomes to genomes!
Jupyter Notebook
36
star
6

elvers

(formerly eelpond) an automated RNA-Seq workflow system
Python
28
star
7

dib-MMETSP

Code to generate re-assemblies of the Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP)
Python
21
star
8

khmer-protocols

Python
19
star
9

pybbhash

A Python wrapper for the bbhash library for Minimal Perfect Hashing
C++
18
star
10

2016-pycon-tutorial

Materials for "Creating, building, testing, and documenting a Python project - a hands-on HOWTO - PyCon 2016"
17
star
11

farm-notes

notes on the farm cluster
16
star
12

MQF

C++
12
star
13

kProcessor

kProcessor: kmers processing framework.
C++
11
star
14

literate-resting

Scripts and code for development of literate testing with reST-based shell tutorials.
Python
10
star
15

kmerDecoder

Extract kmers, skipmers, minimizers.
C++
10
star
16

2013-khmer-counting

Paper code: "These are not the k-mers you are looking for: efficient online k-mer counting using a probabilistic data structure"
Jupyter Notebook
10
star
17

rcgrep

Search text files for DNA sequences and their reverse complements
Python
10
star
18

khmer-recipes

Python
8
star
19

kSpider

A simple yet powerful sequence clustering tool.
C++
8
star
20

dib-training

Web site for Data Intensive Training at UC Davis
Jupyter Notebook
8
star
21

2020-paper-sourmash-gather

Here we describe an extension of MinHash that permits accurate compositional analysis of metagenomes with low memory and disk requirements.
HTML
8
star
22

TransAnn-Review

7
star
23

buoy

A project that will change everything. Or something. Or maybe nothing at all.
7
star
24

syrah

Output trusted regions from raw sequencing data.
Python
7
star
25

2020-workflows-paper

Strategies for leveraging workflow systems to streamline large-scale biological analyses
HTML
6
star
26

dib-lab

Issue tracker and Web site for the Lab for Data Intensive Biology at UC Davis.
Python
6
star
27

workflows-book

Issues and draft text for "The Workflow Enabled Bioinformatician"
5
star
28

2014-paper-wssspe14

TeX
5
star
29

2016-pycon-tutorial-project

Python
5
star
30

ged-docs

Holding repository for GED internal documentation of commonly-used tools, with a focus on power-users.
Python
5
star
31

distillerycats

Find disease associations across metagenomes with k-mers using sourmash, and then recover pangenomic accessory elements using spacegraphcats.
Python
5
star
32

2021-panmers

Exploring amino acid kmers as a substitute for genes in microbial pangenome analysis
Jupyter Notebook
5
star
33

2020-ibd

Analysis of publicly available metagenomic sequencing data from humans with IBD.
Jupyter Notebook
4
star
34

2015-paper-10-simple-rules-releasing

4
star
35

q2-sourmash

Qiime2 Sourmash Plugin
Python
4
star
36

2021-paper-ibd

Meta-analysis of metagenomes via machine learning and assembly graphs reveals strain switches in Crohn’s disease
HTML
4
star
37

2012-paper-kmer-percolation

TeX
4
star
38

bioinf_algorithms

Repository for reading group to keep track of work in Bioinformatics Algorithms: an active learning approach
Jupyter Notebook
4
star
39

2013-caltech-workshop

Python
3
star
40

dibsi

Data Intensive Biology Summer Institute
Python
3
star
41

2017-tule-elk

Shell
3
star
42

2021-metapangenome-example

Jupyter Notebook
3
star
43

sourmash-oddify

Tools and workflows for evaluating bacterial and archaeal genome contamination with sourmash and GTDB.
Python
3
star
44

2022-paper-genomic-tax-redundancy

Paper: Comprehensive evaluation of genomic redundancy and taxonomic coherence in large biological databases
HTML
3
star
45

dib_rotation

Metagenomics DIB-lab rotation project
Python
3
star
46

TheGreatGenotyper

C++
2
star
47

workshop_12Jan_2023

Jupyter Notebook
2
star
48

2019-nih-hgrc-grant

Our application to be the NIH Human Genome Reference Center.
2
star
49

2016-illo-screed

Jupyter Notebook
2
star
50

khmer-batlab

batlab.org build scripts
Shell
2
star
51

2012-assembly-methods

2
star
52

ONT_Illumina_genome_assembly

Genome assemblies of Fundulus killifish species with ONT and Illumina sequence data
HTML
2
star
53

2021-orpheum-sim

Simulate short sequencing reads from coding and non-coding portions of bacterial and archaeal genomes to evaluate orpheum ORF prediction
Jupyter Notebook
2
star
54

2016-binder-proposal-to-sloan

2
star
55

2022-branchwater-benchmarking

Benchmarking etc code for https://dib-lab.github.io/2022-paper-branchwater-software/
Jupyter Notebook
2
star
56

CarpentryConWest18

The GitHub repo for CarpentryConnect Davis (formerly known as CarpentryConWest)
2
star
57

2016-sourmash-tax

Taxonomic breakdown of metagenomes using sourmash.
Standard ML
2
star
58

2017-ncbi-taxdump

Standard ML
2
star
59

2013-srop-summer

R
2
star
60

soursigs

Python
2
star
61

2015-09-10-scihack

Inaugural Group Science Coding Session
Python
2
star
62

2019-sourmash-gtdb

Tools and scripts for studying the GTDB taxonomy with sourmash
Python
2
star
63

2018-06-27-DIBSI-Genomics

DIBSI Data Carpentry Genomics
Python
2
star
64

eel-pond

mRNAseq assembly, evaluation, and expression analysis
Python
2
star
65

2016-metagenome-assembly-eval

Standard ML
2
star
66

2022-dominating-set-differential-abundance-example

A reproducible example demonstrating an example workflow for performing dominating set differential abundance analysis
R
2
star
67

2018-mcintyre-output

Jupyter Notebook
1
star
68

oxli

k-mers and the like
Rust
1
star
69

sourmash-slainte

Project template for sourmash-based characterization of genomes and metagenomes
Python
1
star
70

2022-pipeline-antarctic-biogeography

Shell
1
star
71

khmer-citation

The paper that is the citation handle for khmer software.
TeX
1
star
72

2021-paper-metapangenomes

HTML
1
star
73

Haemonchus-pipeline

Automated pipeline to process genomic and RNASeq reads from Haemonchus contortus into a draft genome assembly and draft transcriptomic assembly.
1
star
74

2018-paper-reanalysis-op-ed

TeX
1
star
75

portable-snakemake-workflows

A small, fast, robust workflows using snakemake!
Python
1
star
76

2020-NSURP

Python
1
star
77

mkdocs-material-dib

Mkdocs theme for documentation for DIB Lab software.
CSS
1
star
78

2013-msu-zero-entry

A short bootcamp for MSU: zero-entry computational science.
Python
1
star
79

2022-donut-ibd-classified-sra-search

Standard ML
1
star
80

2022-database-covers

Python
1
star
81

2020-paper-mqf-benchmarks

C++
1
star
82

2017-paper-gather

Python
1
star
83

training-by-practice

Shell
1
star
84

sourmash-utils

Scripts that are useful for sourmash, but not necessarily part of the base command line.
Python
1
star
85

2014-mrnaseq-cloud

TeX
1
star
86

sourmash_plugin_pangenomics

tools for sourmash-based pangenome analyses
Python
1
star
87

2022-import-ranger-hashes

Convert ranger random forest models into sourmash signatures
Python
1
star
88

2019-12-12-sourmash_viz

Python
1
star
89

sourmash_plugin_colorcompare

Add colors representing categories in sourmash compare plots.
Python
1
star