• Stars
    star
    18
  • Rank 1,167,036 (Top 24 %)
  • Language
    C++
  • License
    MIT License
  • Created about 13 years ago
  • Updated over 11 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

gap opening realigner for BAM data streams

More Repositories

1

intervaltree

a minimal C++ interval tree implementation
C++
208
star
2

alignment-and-variant-calling-tutorial

basic walk-throughs for alignment and variant calling from NGS sequencing data
191
star
3

fraction.js

A fraction math library in javascript.
JavaScript
163
star
4

seqwish

alignment to variation graph inducer
C++
141
star
5

fastahack

utilities for indexing and sequence extraction from FASTA files
C++
57
star
6

bamaddrg

adds sample names and read-group (RG) tags to BAM alignments
C++
48
star
7

glia

a string to graph aligner
C++
40
star
8

thesis

my PhD thesis
TeX
34
star
9

edyeet

base-accurate DNA sequence alignments using edlib and mashmap2
C++
32
star
10

split

split and join for C++ strings
C++
26
star
11

gimbricate

recompute GFA link overlaps
C++
25
star
12

mmmulti

memory mapped multimap, multiset, and implicit interval tree based on an in-place parallel sort
C++
25
star
13

multichoose

generate multiset combinations (n multichoose k)
C++
23
star
14

impg

implicit pangenome graph
Rust
23
star
15

pafplot

base-level dotplots from PAF alignments
Rust
22
star
16

HLA-zoo

genome variation graphs constructed from HLA GRCh38 ALTs
21
star
17

guix-genomics

guix packages for bioinformatics software
Scheme
21
star
18

multipermute

efficient multiset permutations
TypeScript
20
star
19

1000G-integration

variant integration methods for the 1000 Genomes Project
Shell
20
star
20

hhga

haplotypes genotypes and alleles example decision synthesizer
C++
19
star
21

gafpack

convert variation graph alignments to coverage maps over nodes
Rust
18
star
22

viral-assembly

exploring viral genome assembly with variation graph tools
Shell
18
star
23

shuntingyard

implementations of Dijkstra's shunting-yard algorithm for infix notation parsing
Python
18
star
24

smithwaterman

smith-waterman-gotoh alignment algorithm
C++
16
star
25

drawbwt

vector illustrations of BWT based searching on small strings
C++
16
star
26

mutatrix

genome simulation across a population with zeta-distributed allele frequency, snps, insertions, deletions, and multi-nucleotide polymorphisms
C++
14
star
27

pca

PCA in rust
Rust
13
star
28

ssh.py

synchronous, serial, and parallel ssh, scp, and ping methods in python
Python
12
star
29

bitcoin-fs

encode arbitrary data in bitcoin transactions
JavaScript
11
star
30

wflign

the we-flyin WFA-guided ultralong tiling sequence aligner
C++
11
star
31

ACAD18

Day 2 of ACAD's 2018 Advanced Bioinformatics Workshop
11
star
32

succinct-graph

compressed, queryable variation graphs
C++
11
star
33

openconnect-sso-docker

connect to a cisco anyconnect server with 2FA via web SSO on a headless server
Shell
11
star
34

fastix

Prefix-renaming FASTA records really fast.
Rust
11
star
35

dozyg

sequence to graph mapper
C++
11
star
36

hapviz

indel haplotype visualization on the command line from BAM files
C++
11
star
37

yllm

Yet another LLM command line interface
Shell
10
star
38

USDA-SR22-importer

assists in importing the USDA's SR22 nutritional database from MS Access to MySQL
Shell
10
star
39

foxsamply

algorenderer / a script to convert FoxDot code to recorded samples
Shell
10
star
40

yeast-pangenome

yeast pangenome
Shell
9
star
41

splitfa

split a FASTA sequence file into shorter sequences
Rust
9
star
42

wflambda

WFλ - wavefront alignment with callback match function
C++
8
star
43

bkmers

binary kmers written numerically
Rust
7
star
44

hamming-fasta

brute force online sequence similarity search
Rust
7
star
45

gaffy

GAF (graph alignment format) command line utility
Rust
7
star
46

interleave-fastq

interleaves fastq files (also gzipped ones!)
Python
7
star
47

beats

snippets of music in foxdot
Python
7
star
48

arfer

annotation of variants using genotype likelihoods
C++
6
star
49

kmers

generate kmer frequency information from text streams
C++
6
star
50

xbzipLib

XBzip/XBWT implementation from "Compressing and indexing labeled trees, with applications," P Ferragina et al.
C
6
star
51

jvcf

Joint Variant Call Format -- JSON notation for genetic variant annotation
6
star
52

rsvd

rSVD in rust
Rust
5
star
53

bamprofile

profiles alignment mismatches and gaps
C++
5
star
54

dirtyzipf

zipfian int distributions using a fast approximation for pow
C++
5
star
55

atomicbitvector

atomic bitset/bitvector with size determined at runtime
C++
5
star
56

3q29

3q29 pangenome build from HPRC year 1 samples
4
star
57

sgd2

header-only large graph layout via stochastic gradient descent
C++
4
star
58

pafcov

pairwise alignment file coverage metric
Rust
4
star
59

pafnet

PAF alignment to network format converter
Rust
4
star
60

paryfor

parallel_for based on atomic queues and C++11 threads
C++
4
star
61

phred.py

phred quality conversion to and from float and log space
Python
4
star
62

intervalstab

interval stabbing (pointwise range lookup) using the FastStabbing algorithm
C++
4
star
63

edlign

exploring identity-based alignment chaining
C++
4
star
64

graphappy

a variation graph-oriented fork of WhatsHap
C++
4
star
65

lru_cache

A thread-safe LRU cache implementation in C++
C++
4
star
66

bamtools

C++ API & command-line toolkit for working with BAM data
C++
4
star
67

aptertree

Apter Trees in rust
Rust
3
star
68

poa-motifs

exploring variation graph models for motifs based on partial order alignment
3
star
69

kmerj

kmer counting comparisons in low memory
C++
3
star
70

entropy

shannon entropy
Python
3
star
71

ssw

striped smith waterman implementation
C
3
star
72

datscriptish

it's kind've like make, but not really. and it's for streams!
3
star
73

simgenie

diplotype caller simulation on pangenie and freebayes
Shell
3
star
74

drosophila

drosophila genome assembly and analysis
Shell
3
star
75

tsvtools

tools for manipulating tab-separated-values files, mostly in C++
C++
3
star
76

endian

c++ header library for manipulating endianness
C
3
star
77

uniqprimers

determines if the primers required to assay VCF records are unique (provided BAM alignments of the primers and a VCF file of the variants)
C++
3
star
78

shastaGFA

scripts for interpreting GFAs made by the shasta assembler
Shell
2
star
79

wikiq

an xml to .tsv converter for wikimedia XML data dumps
C++
2
star
80

leveldb-snappy

demonstrates how to build leveldb with snappy support using the static libraries for each and git submodules
C++
2
star
81

vatfilter

extensions for manipulating VCF files annotated with the Variant Annotation Tool (VAT)
Python
2
star
82

gprof-compare

compare call counts in two gprof profile outputs
Python
2
star
83

qllm

query local LLM openai-API-compatible endpoints
Rust
2
star
84

tsvsplit

split tsv files with headers
2
star
85

nuller

null genotype caller which generates depth of coverage VCF
C++
2
star
86

genmusic

Python
2
star
87

subarch-select

pick an executable based on CPU capabilities
C++
2
star
88

bad-python-list

equal lists should serialize equally, but don't
Python
2
star
89

phage

phage assembly and analysis scripts
R
2
star
90

gatk-swe

Perl
2
star
91

factorial-ln

compute log(n!) of huge numbers
JavaScript
2
star
92

bamquality

read quality distributions for BAM and FASTQ files
C++
2
star
93

strswap

command line tool to swap string patterns in text files
Rust
2
star
94

diatoms

contig ordering using linkage disequilibrium
R
2
star
95

embeddna

test of DNA embeddings using a transformer
Python
2
star
96

protobufstream

protocol buffers streams
C++
1
star
97

tidalcycles-stream

stream a multiuser tidalcycles session
Shell
1
star
98

sautocorr

estimating autocorrelation in sequences with the goal of finding repeat subunits in DNA
C++
1
star
99

fasta-utilities

utility scripts and programs for manipulating fasta files
Python
1
star
100

hexygraph

a hexastore in C++ using leveldb
1
star