• Stars
    star
    436
  • Rank 96,411 (Top 2 %)
  • Language
    Rust
  • License
    BSD 3-Clause "New...
  • Created almost 8 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Snappy compression implemented in Rust (including the Snappy frame format).

snap

A pure Rust implementation of the Snappy compression algorithm. Includes streaming compression and decompression using the Snappy frame format. This implementation is ported from both the reference C++ implementation and the Go implementation.

Build status

Licensed under the BSD 3-Clause.

Documentation

https://docs.rs/snap

Usage

Add this to your Cargo.toml:

[dependencies]
snap = "1"

Example: compress data on stdin

This program reads data from stdin, compresses it and emits it to stdout. This example can be found in examples/compress.rs:

use std::io;

fn main() {
    let stdin = io::stdin();
    let stdout = io::stdout();

    let mut rdr = stdin.lock();
    // Wrap the stdout writer in a Snappy writer.
    let mut wtr = snap::write::FrameEncoder::new(stdout.lock());
    io::copy(&mut rdr, &mut wtr).expect("I/O operation failed");
}

Example: decompress data on stdin

This program reads data from stdin, decompresses it and emits it to stdout. This example can be found in examples/decompress.rs:

use std::io;

fn main() {
    let stdin = io::stdin();
    let stdout = io::stdout();

    // Wrap the stdin reader in a Snappy reader.
    let mut rdr = snap::read::FrameDecoder::new(stdin.lock());
    let mut wtr = stdout.lock();
    io::copy(&mut rdr, &mut wtr).expect("I/O operation failed");
}

Example: the szip tool

szip is a tool with similar behavior as gzip, except it uses Snappy compression. It can be installed with Cargo:

$ cargo install szip

To compress a file, run szip file. To decompress a file, run szip -d file.sz. See szip --help for more details.

Testing

This crate is tested against the reference C++ implementation of Snappy. Currently, compression is byte-for-byte equivalent with the C++ implementation. This seems like a reasonable starting point, although it is not necessarily a goal to always maintain byte-for-byte equivalence.

Tests against the reference C++ implementation can be run with cargo test --features cpp. Note that you will need to have the C++ Snappy library in your LD_LIBRARY_PATH (or equivalent).

To run tests, you'll need to explicitly run the test crate:

$ cargo test --manifest-path test/Cargo.toml

To test that this library matches the output of the reference C++ library, use:

$ cargo test --manifest-path test/Cargo.toml --features cpp

Tests are in a separate crate because of the dependency on the C++ reference library. Namely, Cargo does not yet permit optional dev dependencies.

Minimum Rust version policy

This crate's minimum supported rustc version is 1.39.0.

The current policy is that the minimum Rust version required to use this crate can be increased in minor version updates. For example, if crate 1.0 requires Rust 1.20.0, then crate 1.0.z for all values of z will also require Rust 1.20.0 or newer. However, crate 1.y for y > 0 may require a newer minimum version of Rust.

In general, this crate will be conservative with respect to the minimum supported version of Rust.

Performance

The performance of this implementation should roughly match the performance of the C++ implementation on x86_64. Below are the results of the microbenchmarks (as defined in the C++ library):

group                         snappy/cpp/                            snappy/snap/
-----                         -----------                            ------------
compress/zflat00_html         1.00     94.5±0.62µs  1033.1 MB/sec    1.02     96.1±0.74µs  1016.2 MB/sec
compress/zflat01_urls         1.00   1182.3±8.89µs   566.3 MB/sec    1.04  1235.3±11.99µs   542.0 MB/sec
compress/zflat02_jpg          1.00      7.2±0.11µs    15.9 GB/sec    1.01      7.3±0.06µs    15.8 GB/sec
compress/zflat03_jpg_200      1.10    262.4±1.84ns   727.0 MB/sec    1.00    237.5±2.95ns   803.2 MB/sec
compress/zflat04_pdf          1.02     10.3±0.18µs     9.2 GB/sec    1.00     10.1±0.16µs     9.4 GB/sec
compress/zflat05_html4        1.00    399.2±5.36µs   978.4 MB/sec    1.01    404.0±2.46µs   966.8 MB/sec
compress/zflat06_txt1         1.00    397.3±2.61µs   365.1 MB/sec    1.00    398.5±3.06µs   364.0 MB/sec
compress/zflat07_txt2         1.00    352.8±3.20µs   338.4 MB/sec    1.01    355.2±5.01µs   336.1 MB/sec
compress/zflat08_txt3         1.01   1058.8±6.85µs   384.4 MB/sec    1.00   1051.8±6.74µs   386.9 MB/sec
compress/zflat09_txt4         1.00   1444.1±8.10µs   318.2 MB/sec    1.00  1450.0±13.36µs   316.9 MB/sec
compress/zflat10_pb           1.00     85.1±0.58µs  1328.6 MB/sec    1.02     87.0±0.90µs  1300.2 MB/sec
compress/zflat11_gaviota      1.07    311.9±4.27µs   563.5 MB/sec    1.00    291.9±1.86µs   602.3 MB/sec
decompress/uflat00_html       1.03     36.9±0.28µs     2.6 GB/sec    1.00     36.0±0.25µs     2.7 GB/sec
decompress/uflat01_urls       1.04    437.4±2.89µs  1530.7 MB/sec    1.00    419.9±3.10µs  1594.6 MB/sec
decompress/uflat02_jpg        1.00      4.6±0.05µs    24.9 GB/sec    1.00      4.6±0.03µs    25.0 GB/sec
decompress/uflat03_jpg_200    1.08    122.4±1.06ns  1558.6 MB/sec    1.00    112.8±1.35ns  1690.8 MB/sec
decompress/uflat04_pdf        1.00      5.7±0.05µs    16.8 GB/sec    1.10      6.2±0.07µs    15.3 GB/sec
decompress/uflat05_html4      1.01    164.1±1.71µs     2.3 GB/sec    1.00    162.6±2.16µs     2.3 GB/sec
decompress/uflat06_txt1       1.08    146.6±1.01µs   989.5 MB/sec    1.00    135.3±1.11µs  1072.0 MB/sec
decompress/uflat07_txt2       1.09    130.2±0.93µs   916.6 MB/sec    1.00    119.2±0.96µs  1001.8 MB/sec
decompress/uflat08_txt3       1.07    387.2±2.30µs  1051.0 MB/sec    1.00    361.9±6.29µs  1124.7 MB/sec
decompress/uflat09_txt4       1.09    536.1±3.47µs   857.2 MB/sec    1.00    494.0±5.05µs   930.2 MB/sec
decompress/uflat10_pb         1.00     32.5±0.19µs     3.4 GB/sec    1.05     34.0±0.48µs     3.2 GB/sec
decompress/uflat11_gaviota    1.00    142.1±2.05µs  1236.7 MB/sec    1.00    141.5±0.92µs  1242.3 MB/sec

Notes: These benchmarks were run with Snappy/C++ 1.1.8. Both the C++ and Rust benchmarks were run with the same benchmark harness. Benchmarks were run on an Intel i7-6900K.

Additionally, here are the benchmarks run on the same machine from the Go implementation of Snappy (which has a hand rolled implementation in Assembly). Note that these were run using Go's microbenchmark tool, so the numbers may not be directly comparable, but they should serve as a useful signpost:

Benchmark_UFlat0           25040             45180 ns/op        2266.49 MB/s
Benchmark_UFlat1            2648            451475 ns/op        1555.10 MB/s
Benchmark_UFlat2          229965              4788 ns/op        25709.01 MB/s
Benchmark_UFlat3        11355555               101 ns/op        1973.65 MB/s
Benchmark_UFlat4          196551              6055 ns/op        16912.64 MB/s
Benchmark_UFlat5            6016            189219 ns/op        2164.68 MB/s
Benchmark_UFlat6            6914            166371 ns/op         914.16 MB/s
Benchmark_UFlat7            8173            142506 ns/op         878.41 MB/s
Benchmark_UFlat8            2744            436424 ns/op         977.84 MB/s
Benchmark_UFlat9            1999            591141 ns/op         815.14 MB/s
Benchmark_UFlat10          28885             37291 ns/op        3180.04 MB/s
Benchmark_UFlat11           7308            163366 ns/op        1128.26 MB/s
Benchmark_ZFlat0           12902             91231 ns/op        1122.43 MB/s
Benchmark_ZFlat1             997           1200579 ns/op         584.79 MB/s
Benchmark_ZFlat2          136762              7832 ns/op        15716.53 MB/s
Benchmark_ZFlat3         4896124               245 ns/op         817.27 MB/s
Benchmark_ZFlat4          117643             10129 ns/op        10109.44 MB/s
Benchmark_ZFlat5            2934            394742 ns/op        1037.64 MB/s
Benchmark_ZFlat6            3008            382877 ns/op         397.23 MB/s
Benchmark_ZFlat7            3411            344916 ns/op         362.93 MB/s
Benchmark_ZFlat8             966           1057985 ns/op         403.36 MB/s
Benchmark_ZFlat9             854           1429024 ns/op         337.20 MB/s
Benchmark_ZFlat10          13861             83040 ns/op        1428.08 MB/s
Benchmark_ZFlat11           4070            293952 ns/op         627.04 MB/s

To run benchmarks, including the reference C++ implementation, do the following:

$ cd bench
$ cargo bench --features cpp -- --save-baseline snappy

To compare them, as shown above, install critcmp and run (assuming you saved the baseline above under the name snappy):

$ critcmp snappy -g '.*?/(.*$)'

Finally, the Go benchmarks were run with the following command on commit ff6b7dc8:

$ go test -cpu 1 -bench Flat -download

Comparison with other Snappy crates

  • snappy - These are bindings to the C++ library. No support for the Snappy frame format.
  • snappy_framed - Implements the Snappy frame format on top of the snappy crate.
  • rsnappy - Written in pure Rust, but lacks documentation and the Snappy frame format. Performance is unclear and tests appear incomplete.
  • snzip - Was created and immediately yanked from crates.io.

More Repositories

1

ripgrep

ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Rust
45,192
star
2

xsv

A fast CSV command line toolkit written in Rust.
Rust
10,115
star
3

toml

TOML parser for Golang with reflection.
Go
4,407
star
4

quickcheck

Automated property based testing for Rust (with shrinking).
Rust
2,272
star
5

erd

Translates a plain text description of a relational database schema to a graphical entity-relationship diagram.
Haskell
1,761
star
6

fst

Represent large sets and maps compactly with finite state transducers.
Rust
1,714
star
7

rust-csv

A CSV parser for Rust, with Serde support.
Rust
1,612
star
8

nflgame

An API to retrieve and read NFL Game Center JSON data. It can work with real-time data, which can be used for fantasy football.
Python
1,257
star
9

walkdir

Rust library for walking directories recursively.
Rust
1,182
star
10

nfldb

A library to manage and update NFL data in a relational database.
Python
1,068
star
11

wingo

A fully-featured window manager written in Go.
Go
958
star
12

aho-corasick

A fast implementation of Aho-Corasick in Rust.
Rust
955
star
13

byteorder

Rust library for reading/writing numbers in big-endian and little-endian.
Rust
938
star
14

memchr

Optimized string search routines for Rust.
Rust
759
star
15

bstr

A string type for Rust that is not required to be valid UTF-8.
Rust
745
star
16

xgb

The X Go Binding is a low-level API to communicate with the X server. It is modeled on XCB and supports many X extensions.
Go
472
star
17

advent-of-code

Rust solutions to AoC 2018
Rust
471
star
18

termcolor

Cross platform terminal colors for Rust.
Rust
446
star
19

go-sumtype

A simple utility for running exhaustiveness checks on Go "sum types."
Go
410
star
20

chan

Multi-producer, multi-consumer concurrent channel for Rust.
Rust
392
star
21

regex-automata

A low level regular expression library that uses deterministic finite automata.
Rust
353
star
22

cargo-benchcmp

A small utility to compare Rust micro-benchmarks.
Rust
338
star
23

suffix

Fast suffix arrays for Rust (with Unicode support).
Rust
254
star
24

rure-go

Go bindings to Rust's regex engine.
Go
247
star
25

tabwriter

Elastic tabstops for Rust.
Rust
244
star
26

imdb-rename

A command line tool to rename media files based on titles from IMDb.
Rust
221
star
27

rebar

A biased barometer for gauging the relative speed of some regex engines on a curated set of tasks.
Python
201
star
28

critcmp

A command line tool for comparing benchmarks run by Criterion.
Rust
198
star
29

ty

Easy parametric polymorphism at run time using completely unidiomatic Go.
Go
197
star
30

xgbutil

A utility library to make use of the X Go Binding easier. (Implements EWMH and ICCCM specs, key binding support, etc.)
Go
191
star
31

pytyle3

An updated (and much faster) version of pytyle that uses xpybutil and is compatible with Openbox Multihead.
Python
181
star
32

dotfiles

My configuration files and personal collection of scripts.
Vim Script
141
star
33

rsc-regexp

Translations of a simple C program to Rust.
Rust
133
star
34

rust-cbor

CBOR (binary JSON) for Rust with automatic type based decoding and encoding.
Rust
128
star
35

chan-signal

Respond to OS signals with channels.
Rust
126
star
36

goim

Goim is a robust command line utility to maintain and query the Internet Movie Database (IMDb).
Go
117
star
37

clibs

A smattering of miscellaneous C libraries. Includes sane argument parsing, a thread-safe multi-producer/multi-consumer queue, and implementation of common data structures (hashmaps, vectors and linked lists).
C
98
star
38

same-file

Cross platform Rust library for checking whether two file paths are the same file.
Rust
98
star
39

nflvid

An experimental library to map play meta data to footage of that play.
Python
91
star
40

ucd-generate

A command line tool to generate Unicode tables as source code.
Rust
90
star
41

rust-stats

Basic statistical functions on streams for Rust.
Rust
87
star
42

migration

Package migration for Golang automatically handles versioning of a database schema by applying a series of migrations supplied by the client.
Go
79
star
43

xpybutil

An incomplete xcb-util port plus some extras
Python
62
star
44

graphics-go

Automatically exported from code.google.com/p/graphics-go
Go
59
star
45

winapi-util

Safe wrappers for various Windows specific APIs.
Rust
57
star
46

rust-pcre2

High level Rust bindings to PCRE2.
C
51
star
47

rust-sorts

Implementations of common sorting algorithms in Rust with comprehensive tests and benchmarks.
Rust
51
star
48

blog

My blog.
Rust
50
star
49

openbox-multihead

Openbox with patches for enhanced multihead support.
C
46
star
50

nakala

A low level embedded information retrieval system.
Rust
45
star
51

nflfan

View your fantasy teams with nfldb using a web interface.
JavaScript
43
star
52

utf8-ranges

Convert contiguous ranges of Unicode codepoints to UTF-8 byte ranges.
Rust
43
star
53

rtmpdump-ksv

rtmpdump with ksv's patch. Intended to track upstream git://git.ffmpeg.org/rtmpdump as well.
C
40
star
54

globset

A globbing library for Rust.
Rust
39
star
55

regexp

A regular expression library implemented in Rust.
Rust
37
star
56

xdg

A Go package for reading config and data files according to the XDG Base Directory specification.
Go
35
star
57

locker

A simple Golang package for conveniently using named read/write locks. Useful for synchronizing access to session based storage in web applications.
Go
34
star
58

nflcmd

A collection of command line utilities for viewing NFL statistics and rankings with nfldb.
Python
30
star
59

notes

A collection of small notes that aren't appropriate for my blog.
30
star
60

mempool

A fast thread safe memory pool for reusing allocations.
Rust
29
star
61

gribble

A command oriented language whose environment is defined through Go struct types by reflection.
Go
28
star
62

vcr

A simple wrapper tool around ffmpeg to capture video from a VCR.
Rust
27
star
63

encoding_rs_io

Streaming I/O adapters for the encoding_rs crate.
Rust
22
star
64

rust-cmail

A simple command line utility for periodically sending email containing the output of long-running commands.
Rust
21
star
65

cluster

A simple API for managing a network cluster with smart peer discovery.
Go
19
star
66

pager-multihead

A pager that supports per-monitor desktops (compatible with Openbox Multihead and Wingo)
Python
15
star
67

cablastp

Performs BLAST on compressed proteomic data.
Go
15
star
68

rg-cratesio-typosquat

The source code of the 'rg' crate. It is an intentional typo-squat that redirects folks to 'ripgrep'.
Rust
15
star
69

imgv

An image viewer for Linux written in Go.
Go
14
star
70

rust-error-handling-case-study

Code for the case study in my blog post: http://blog.burntsushi.net/rust-error-handling
Rust
14
star
71

cmd

A convenience library for executing commands in Go, including executing commands in parallel with a pool.
Go
14
star
72

cmail

cmail runs a command and sends the output to your email address at certain intervals.
Go
12
star
73

fanfoot

View your fantasy football leagues and get text alerts when one of your players scores.
Python
12
star
74

burntsushi-blog

A small Go application for my old blog.
CSS
12
star
75

gohead

An xrandr wrapper script to manage multi-monitor configurations. With hooks.
Go
12
star
76

intern

A simple package for interning strings, with a focus on efficiently representing dense pairwise data.
Go
11
star
77

crev-proofs

My crev reviews.
10
star
78

pytyle1

A lightweight X11 tool for simulating tiling in a stacking window manager.
Python
9
star
79

cif

A golang package for reading and writing data in the Crystallographic Information File (CIF) format. It mostly conforms to the CIF 1.1 specification.
Go
9
star
80

rucd

WIP
Rust
8
star
81

qcsv

An API to read and analyze CSV files by inferring types for each column of data.
Python
8
star
82

pyndow

A window manager written in Python
Python
8
star
83

csql

Package csql provides convenience functions for use with the types and functions defined in the standard library `database/sql` package.
Go
6
star
84

freetype-go

A fork of freetype-go with bounding box calculations.
Go
6
star
85

sqlsess

Simple database backed session management. Integrates with Gorilla's sessions package.
Go
6
star
86

go-wayland-simple-shm

C
5
star
87

sqlauth

A simple Golang package that provides database backed user authentication with bcrypt.
Vim Script
4
star
88

lcmweb

A Go web application for coding documents with the Linguistic Category Model.
JavaScript
4
star
89

bcbgo

Computational biology tools for the BCB group at Tufts University.
Go
4
star
90

fex

A framework for specifying and executing experiments.
Haskell
3
star
91

present

My presentations.
HTML
3
star
92

memchr-2.6-mov-regression

Rust
3
star
93

genecentric

A tool to generate between-pathway modules and perform GO enrichment on them.
Python
3
star
94

rust-docs

A silly repo for managing my Rust crate documentation.
Python
3
star
95

pcre2-mirror

A git mirror for PCRE2's SVN repository at svn://vcs.exim.org/pcre2/code
2
star
96

xpyb

A clone of xorg-xpyb.
C
2
star
97

burntsushi-homepage

A small PHP web application for my old homepage.
PHP
2
star
98

window-marker

Use vim-like marks on windows.
Python
2
star
99

sudoku

An attempt at a sudoku solver in Haskell.
Haskell
1
star
100

play

Testing stuff.
1
star