• Stars
    star
    13
  • Rank 1,512,713 (Top 30 %)
  • Language
    C
  • Created almost 14 years ago
  • Updated about 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Subset of the kent source libraries (perhaps out of date) that are easily built and installed on OSX and Linux. These libraries provide usefull utilities for bioinformatics programming in C. This may contain some of my own libraries for bioinformatics utilities as well as long as they install easily on both my mac and linux box.

More Repositories

1

SeqPrep

Tool for stripping adaptors and/or merging paired reads with overlap into single reads.
C
140
star
2

IntroToStatisticalLearningR-

Work on Introduction to Statistical Learning
121
star
3

SimSeq

An illumina paired-end and mate-pair short read simulator. This project attempts to model as many of the quirks that exist in Illumina data as possible. Some of these quirks include the potential for chimeric reads, and non-biotinylated fragment pull down in mate-pair libraries . Additionally the program provides the ability to model both site and base specific error, and scripts are provided to train this error model on real datasets. My hope in creating this program is to generate as realistic data as possible to assist in assessing the accuracy of genome assembly tools.
C
65
star
4

GrNMF

An R/Rcpp/RcppArmadillo implementation of Deng Cai's Non-negative Matrix Factorization on Manifold, sometimes called GNMF or GrNMF.
R
5
star
5

Jellyfish

Fork of the jellyfish kmer counter. Here is the description copied from their site: JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers using an order of magnitude less memory and an order of magnitude faster than other k-mer counting packages by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism. JELLYFISH is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the "jellyfish stats" command. See the documentation below for more details.
C++
5
star
6

re-pair

Program to re-do the pairing of fastq reads. This program is modified from http://code.google.com/p/ngopt/source/browse/trunk/tools/pair_reads/repair.cpp?r=85
C++
5
star
7

swift-c-htslib

Ultra-light HTSLIB wrapper for swift, currently just a modulemap giving you access to the C functions.
Swift
3
star
8

cscripts

Miscalanious C and C++ scripts.
C
2
star
9

swift-c-htslib-example

Example swift package built against CHTSLib
Swift
2
star
10

mia

MIA is an reference guided assembler for DNA reads as generated by recent sequencing technologies. It is designed to support unusual short reads like they come from ancient, fragmented DNA.
C
2
star
11

KinectOrbit

A fork of http://www.arduinoandkinectprojects.com/kinectorbit with an attempt at upgrading to Processing v2.
Java
1
star
12

quora_querstion_pairs_kaggle

Python
1
star
13

signalstepdetector

Automatically exported from code.google.com/p/signalstepdetector
MATLAB
1
star