• Stars
    star
    114
  • Rank 308,031 (Top 7 %)
  • Language
    C
  • License
    MIT License
  • Created over 13 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Small utilities for working with fastq sequence files.

fastq-tools

This package provides a number of small and efficient programs to perform common tasks with high throughput sequencing data in the FASTQ format. All of the programs work with typical FASTQ files as well as gzipped FASTQ files.

index

The following programs are provided. See the individual man pages for more information.

  • fastq-sort : sort fastq entries by various keys

  • fastq-grep : match sequences against regular expressions

  • fastq-kmers : count k-mer occurrences

  • fastq-match : (smith-waterman) local sequence alignment

  • fastq-qual : tabulate quality scores

  • fastq-sample : randomly sample reads, with or without replacement

  • fastq-uniq : count duplicate reads

  • fastq-qualadj : adjust quality scores by a fixed offset

install

On most systems, installation is as simple as ./configure && make install.

If the source was obtained from the git repository, the included ./autogen.sh script must be run first to generate the configure script.

The only external dependencies are PCRE (http://www.pcre.org/) and zlib (http://zlib.net/).

contribute

If you have written any small but useful programs to deal with FASTQ files, please consider submitting them for inclusion in fastq-tools. Check out the Github page (https://github.com/dcjones/fastq-tools) or send mail to the author ([email protected]).

copying

This package is provided under a permissive MIT-style license. In particular:

Copyright (C) 2011 by Daniel C. Jones [email protected]

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

More Repositories

1

hat-trie

An efficient trie implementation.
C
252
star
2

mk

make remade
Go
170
star
3

coitrees

A very fast interval tree data structure
Rust
111
star
4

quip

Compressing next-generation sequencing data with extreme prejudice.
C
78
star
5

Judo.jl

A Julia document generator
Julia
57
star
6

proseg

Probabilistic cell segmentation for in situ spatial transcriptomics
Rust
44
star
7

isolator

Rapid and robust analysis of RNA-Seq experiments.
C
32
star
8

colormake

colorize GNU make output
Python
30
star
9

polee

Analyzing RNA-Seq with approximate likelihood
Julia
26
star
10

subsample

Randomly sample lines from massive text files efficiently
C
17
star
11

RecursiveSparseBlocks.jl

julia interface to the librsb sparse matrix library
Julia
16
star
12

Skia.jl

Experimental skia bindings
Julia
11
star
13

seqbias

An R package to correct for sequence bias in RNA-Seq experiments.
C++
10
star
14

fastlog

A faster, reduced-precision logarithm function.
C
8
star
15

Switch.jl

A C-style switch statement for Julia
Julia
8
star
16

vanity

scRNA-Seq normalization for the vain
Python
7
star
17

cbgb

computational biology grab-bag: messy one-off scripts for a variety of tasks
C
6
star
18

Zlib.jl

zlib bindings for Julia
Julia
6
star
19

maxspin

Quantifying spatial information in spatial transcriptomics
Jupyter Notebook
5
star
20

PoleeClassifier.jl

Julia
5
star
21

color.jl

Pretty colors for Julia
Julia
5
star
22

sls

Stochastic L-Systems in Python
Python
5
star
23

Showoff.jl

Nicely format an array of n things for tables and plots
Julia
5
star
24

LazySequences.jl

Lazy sequences.
Julia
4
star
25

ragel-julia

A Julia backend for Ragel
C++
4
star
26

SpuriousCoexpression.jl

Benchmarking tool for spatial transcriptomics segmentation
Julia
3
star
27

ulam-death-spiral

Exposing the depravity of the natural numbers.
CoffeeScript
3
star
28

isolator-paper

The Isolator Paper
Jupyter Notebook
3
star
29

gtf-parse-off

Experiments with parsing gene transfer format
Ragel in Ruby Host
3
star
30

seqsim

A simplistic RNA-Seq simulator.
C++
2
star
31

vim-mk

Vim syntax for mk
Vim Script
2
star
32

AnnDatas.jl

Julia compatibility for the anndata python module
Julia
2
star
33

quip-paper

A paper describing and evaluating the quip compression algorithm.
C
2
star
34

HATTries.jl

julia hat-trie bindings
Julia
1
star
35

GatedLinearNetworks.jl

A gaussian gated linear networks implementation
Julia
1
star
36

gadfly-static-html

Static HTML documentation for Gadfly
JavaScript
1
star
37

dotfiles

Vim Script
1
star
38

peakolator

World's fastest genomic segmentation algorithm.
C
1
star
39

Stepwell.jl

Julia
1
star
40

quip-website

A very simple website for quip.
JavaScript
1
star
41

peakolator-legacy

Some old code.
C++
1
star
42

julia-minimalist-vim

Alternate vim syntax definitions for Julia
Vim Script
1
star