• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
  • Created about 10 years ago
  • Updated about 10 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

it's kind've like make, but not really. and it's for streams!

More Repositories

1

intervaltree

a minimal C++ interval tree implementation
C++
208
star
2

alignment-and-variant-calling-tutorial

basic walk-throughs for alignment and variant calling from NGS sequencing data
191
star
3

fraction.js

A fraction math library in javascript.
JavaScript
163
star
4

seqwish

alignment to variation graph inducer
C++
144
star
5

fastahack

utilities for indexing and sequence extraction from FASTA files
C++
57
star
6

bamaddrg

adds sample names and read-group (RG) tags to BAM alignments
C++
48
star
7

glia

a string to graph aligner
C++
40
star
8

thesis

my PhD thesis
TeX
34
star
9

edyeet

base-accurate DNA sequence alignments using edlib and mashmap2
C++
33
star
10

mmmulti

memory mapped multimap, multiset, and implicit interval tree based on an in-place parallel sort
C++
27
star
11

split

split and join for C++ strings
C++
26
star
12

gimbricate

recompute GFA link overlaps
C++
25
star
13

pafplot

base-level dotplots from PAF alignments
Rust
23
star
14

multichoose

generate multiset combinations (n multichoose k)
C++
23
star
15

guix-genomics

guix packages for bioinformatics software
Scheme
22
star
16

HLA-zoo

genome variation graphs constructed from HLA GRCh38 ALTs
21
star
17

gafpack

convert variation graph alignments to coverage maps over nodes
Rust
20
star
18

hhga

haplotypes genotypes and alleles example decision synthesizer
C++
20
star
19

multipermute

efficient multiset permutations
TypeScript
20
star
20

1000G-integration

variant integration methods for the 1000 Genomes Project
Shell
20
star
21

ogap

gap opening realigner for BAM data streams
C++
18
star
22

viral-assembly

exploring viral genome assembly with variation graph tools
Shell
18
star
23

shuntingyard

implementations of Dijkstra's shunting-yard algorithm for infix notation parsing
Python
18
star
24

smithwaterman

smith-waterman-gotoh alignment algorithm
C++
16
star
25

drawbwt

vector illustrations of BWT based searching on small strings
C++
16
star
26

openconnect-sso-docker

connect to a cisco anyconnect server with 2FA via web SSO on a headless server
Shell
16
star
27

mutatrix

genome simulation across a population with zeta-distributed allele frequency, snps, insertions, deletions, and multi-nucleotide polymorphisms
C++
14
star
28

pca

PCA in rust
Rust
14
star
29

fastix

Prefix-renaming FASTA records really fast.
Rust
13
star
30

yllm

Yet another LLM command line interface
Shell
12
star
31

ssh.py

synchronous, serial, and parallel ssh, scp, and ping methods in python
Python
12
star
32

bitcoin-fs

encode arbitrary data in bitcoin transactions
JavaScript
11
star
33

wflign

the we-flyin WFA-guided ultralong tiling sequence aligner
C++
11
star
34

ACAD18

Day 2 of ACAD's 2018 Advanced Bioinformatics Workshop
11
star
35

dozyg

sequence to graph mapper
C++
11
star
36

succinct-graph

compressed, queryable variation graphs
C++
11
star
37

hapviz

indel haplotype visualization on the command line from BAM files
C++
11
star
38

pafcheck

PAF (pairwise alignment format) validator based on extended CIGAR strings
Rust
11
star
39

USDA-SR22-importer

assists in importing the USDA's SR22 nutritional database from MS Access to MySQL
Shell
10
star
40

splitfa

split a FASTA sequence file into shorter sequences
Rust
10
star
41

foxsamply

algorenderer / a script to convert FoxDot code to recorded samples
Shell
10
star
42

yeast-pangenome

yeast pangenome
Shell
9
star
43

loq

talk to type
Shell
9
star
44

wflambda

WFλ - wavefront alignment with callback match function
C++
8
star
45

bkmers

binary kmers written numerically
Rust
7
star
46

hamming-fasta

brute force online sequence similarity search
Rust
7
star
47

gaffy

GAF (graph alignment format) command line utility
Rust
7
star
48

dirtyzipf

zipfian int distributions using a fast approximation for pow
C++
7
star
49

interleave-fastq

interleaves fastq files (also gzipped ones!)
Python
7
star
50

beats

snippets of music in foxdot
Python
7
star
51

rsvd

rSVD in rust
Rust
6
star
52

arfer

annotation of variants using genotype likelihoods
C++
6
star
53

kmers

generate kmer frequency information from text streams
C++
6
star
54

xbzipLib

XBzip/XBWT implementation from "Compressing and indexing labeled trees, with applications," P Ferragina et al.
C
6
star
55

jvcf

Joint Variant Call Format -- JSON notation for genetic variant annotation
6
star
56

bamprofile

profiles alignment mismatches and gaps
C++
5
star
57

atomicbitvector

atomic bitset/bitvector with size determined at runtime
C++
5
star
58

sgd2

header-only large graph layout via stochastic gradient descent
C++
4
star
59

3q29

3q29 pangenome build from HPRC year 1 samples
4
star
60

pafcov

pairwise alignment file coverage metric
Rust
4
star
61

pafnet

PAF alignment to network format converter
Rust
4
star
62

poa-motifs

exploring variation graph models for motifs based on partial order alignment
4
star
63

paryfor

parallel_for based on atomic queues and C++11 threads
C++
4
star
64

phred.py

phred quality conversion to and from float and log space
Python
4
star
65

intervalstab

interval stabbing (pointwise range lookup) using the FastStabbing algorithm
C++
4
star
66

edlign

exploring identity-based alignment chaining
C++
4
star
67

pafciggy

use the cigar string in the PAF to correct target and query endpoint issues
Rust
4
star
68

graphappy

a variation graph-oriented fork of WhatsHap
C++
4
star
69

lru_cache

A thread-safe LRU cache implementation in C++
C++
4
star
70

aptertree

Apter Trees in rust
Rust
3
star
71

kmerj

kmer counting comparisons in low memory
C++
3
star
72

entropy

shannon entropy
Python
3
star
73

ssw

striped smith waterman implementation
C
3
star
74

simgenie

diplotype caller simulation on pangenie and freebayes
Shell
3
star
75

drosophila

drosophila genome assembly and analysis
Shell
3
star
76

tsvtools

tools for manipulating tab-separated-values files, mostly in C++
C++
3
star
77

endian

c++ header library for manipulating endianness
C
3
star
78

uniqprimers

determines if the primers required to assay VCF records are unique (provided BAM alignments of the primers and a VCF file of the variants)
C++
3
star
79

shastaGFA

scripts for interpreting GFAs made by the shasta assembler
Shell
2
star
80

wikiq

an xml to .tsv converter for wikimedia XML data dumps
C++
2
star
81

leveldb-snappy

demonstrates how to build leveldb with snappy support using the static libraries for each and git submodules
C++
2
star
82

gprof-compare

compare call counts in two gprof profile outputs
Python
2
star
83

vatfilter

extensions for manipulating VCF files annotated with the Variant Annotation Tool (VAT)
Python
2
star
84

qllm

query local LLM openai-API-compatible endpoints
Rust
2
star
85

tsvsplit

split tsv files with headers
2
star
86

nuller

null genotype caller which generates depth of coverage VCF
C++
2
star
87

genmusic

Python
2
star
88

subarch-select

pick an executable based on CPU capabilities
C++
2
star
89

bad-python-list

equal lists should serialize equally, but don't
Python
2
star
90

phage

phage assembly and analysis scripts
R
2
star
91

gatk-swe

Perl
2
star
92

factorial-ln

compute log(n!) of huge numbers
JavaScript
2
star
93

bamquality

read quality distributions for BAM and FASTQ files
C++
2
star
94

strswap

command line tool to swap string patterns in text files
Rust
2
star
95

diatoms

contig ordering using linkage disequilibrium
R
2
star
96

embeddna

test of DNA embeddings using a transformer
Python
2
star
97

vgp

analysis sketches for vertebrate genomes project assemblies
R
2
star
98

protobufstream

protocol buffers streams
C++
1
star
99

tidalcycles-stream

stream a multiuser tidalcycles session
Shell
1
star
100

fasta-utilities

utility scripts and programs for manipulating fasta files
Python
1
star