Daniel Lemire (@lemire)

Top repositories

1

FastPFor

The FastPFOR C++ library: Fast integer compression
C++
836
star
2

Code-used-on-Daniel-Lemire-s-blog

This is a repository for the code posted on my blog
C
720
star
3

fast_double_parser

Fast function to parse strings into double (binary64) floating-point values, enforces the RFC 7159 (JSON standard) grammar: 4x faster than strtod
C++
574
star
4

javaewah

A compressed alternative to the Java BitSet class
Java
556
star
5

JavaFastPFOR

A simple integer compression library in Java
Java
505
star
6

simdcomp

A simple C library for compressing lists of integers using binary packing
C
463
star
7

EWAHBoolArray

A compressed bitmap class in C++.
C++
430
star
8

SIMDCompressionAndIntersection

A C++ library to compress and intersect sorted lists of integers using SIMD instructions
C++
410
star
9

fastbase64

SIMD-accelerated base64 codecs
C
388
star
10

streamvbyte

Fast integer compression in C using the StreamVByte codec
C
357
star
11

FastPriorityQueue.js

a fast heap-based priority queue in JavaScript
JavaScript
328
star
12

fastvalidate-utf-8

header-only library to validate utf-8 strings at high speeds (using SIMD instructions)
C
284
star
13

fastrange

A fast alternative to the modulo reduction
C
272
star
14

fastmod

A C/C++ header file for fast 32-bit division remainders (and divisibility tests) on 64-bit hardware.
C++
264
star
15

clhash

C library implementing the ridiculously fast CLHash hashing function
C
256
star
16

externalsortinginjava

External-Memory Sorting in Java
Java
248
star
17

rollinghashcpp

Rolling Hash C++ Library
C++
179
star
18

FastBitSet.js

Speed-optimized BitSet implementation for modern browsers and JavaScript engines
JavaScript
150
star
19

docker_programming_station

A simple script to help programmers who want to work within Docker
Shell
142
star
20

despacer

C library to remove white space from strings as fast as possible
C
133
star
21

StronglyUniversalStringHashing

Benchmark showing the we can randomly hash strings very quickly with good universality
C++
131
star
22

testingRNG

Testing common random-number generators (RNG)
C++
131
star
23

MaskedVByte

Fast decoder for VByte-compressed integers
C
114
star
24

cbitset

A simple bitset library in C
C
110
star
25

lbimproved

Dynamic Time Warping (DTW) library implementing lower bounds (LB_Keogh, LB_Improved...)
C++
105
star
26

dictionary

High-performance dictionary coding
C++
99
star
27

SIMDxorshift

Fast random number generators: Vectorized (SIMD) version of xorshift128+
C
99
star
28

csharpewah

Compressed bitmaps in C#
C#
82
star
29

fastrand

Fast random number generation in an interval in Python: Up to 10x faster than random.randint.
C
77
star
30

bloofi

Bloofi: A java implementation of multidimensional Bloom filters
Java
75
star
31

rollinghashjava

Rolling hash functions in Java
Java
75
star
32

LittleIntPacker

C library to pack and unpack short arrays of integers as fast as possible
C
70
star
33

simdpcg

Vectorized version of the PCG random number generator
C
68
star
34

TypedFastBitSet.js

Speed-optimized BitSet implementation for modern browsers and JavaScript engines, uses typed arrays
TypeScript
67
star
35

FastIntegerCompression.js

Fast integer compression library in JavaScript
JavaScript
54
star
36

simdprune

Pruning elements in SIMD vectors (i.e., packing left elements)
C
50
star
37

testingmlp

Testing memory-level parallelism
C
49
star
38

FastDifferentialCoding

Fast differential coding functions (using SIMD instructions)
C
49
star
39

multiplatform_simd_recipes

SIMD recipes, for various platforms (collection of code snippets)
C
43
star
40

runningmaxmin

Fast maximum-minimum filters implemented in C++
C++
43
star
41

fasthashing

A Variable-Length String Hashing Library in C++
C++
40
star
42

fast_division

Simple C++ code to benchmark fast division algorithms
C++
40
star
43

FrameOfReference

C++ library to pack and unpack vectors of integers having a small range of values using a technique called Frame of Reference
C
40
star
44

validateutf8-experiments

Reproducible experimeents on UTF-8 validation using SIMD instructions
C++
39
star
45

SwiftBitset

A fast Bitset class in Swift
Swift
36
star
46

FastShuffleExperiments

How fast can we shuffle values?
C++
33
star
47

programmingishard

Long-term book project
32
star
48

StablePriorityQueue.js

A stable heap-based priority queue in JavaScript
JavaScript
27
star
49

fastscancount

Fast implementations of the scancount algorithm: C++ header-only library
C++
24
star
50

SIMDgameoflife

Vectorized (AVX) version of the game of life
C
23
star
51

CMemoryUsage

Measuring memory usage in C and C++
C
23
star
52

CRoaringUnityBuild

Dumps of CRoaring unity builds (for convenience)
C
23
star
53

SwiftWyhash

Fast random number generator in pure Swift
Swift
23
star
54

kodakimagecollection

Set of images made available royalty-free by Kodak
22
star
55

sparsebitmap

A simple sparse bitmap implementation in java
Java
21
star
56

IndexWikipedia

A simple utility to index wikipedia dumps using Lucene.
Java
21
star
57

PiecewiseLinearTimeSeriesApproximation

code from Daniel Lemire, A Better Alternative to Piecewise Linear Time Series Segmentation, SIAM Data Mining 2007, 2007.
C++
21
star
58

Concise

C++ implementation of Concise and WAH compressed bitsets
C++
19
star
59

MemoryLanes

iOS app to test memory-level parallelism
C++
18
star
60

talks

Random material having to do with Daniel Lemire's talks
C++
18
star
61

fastrandom

Go
18
star
62

crackingxoroshiro128plus

How to "crack" xoroshiro128+: from two outputs, derive a possible seed
Python
18
star
63

RealisticTabularDataSets

Some realistic tabular datasets for testing (CSV)
17
star
64

StarSchemaBenchmark

O'Neil et al.'s Star Schema Benchmark: curated code
C
16
star
65

simplebitmapbenchmark

Simple benchmark between compressed bitmap libraries in Java
Java
15
star
66

vectorclass

Random number generator for large applications using vector instructions
C++
15
star
67

simple_fastfloat_benchmark

C
15
star
68

zobristhashing

Zobrist hashing in C
C
14
star
69

BitSliceIndex

Experiments on bit-slice indexing
Java
14
star
70

SIMDIntersections

Vectorized intersections (research code)
C++
14
star
71

microbenchmarks

Private microbenchmarks
Java
13
star
72

backward_multiplication

Multiplying... backward?
C++
12
star
73

pospopcnt_avx512

benchmarking positional population count
Assembly
11
star
74

costofsafety

Quick experiment to see how expensive safety is in C, for research
C
11
star
75

constantdivisionbenchmarks

Benchmarks for constant-division problems (not a library! for research only!)
C++
11
star
76

createfasthash

Code from article http://locklessinc.com/articles/fast_hash/
C
9
star
77

SwiftCallingCHeader

Calling a C header from Swift (example)
Swift
8
star
78

arraylayout

Pat Morin's arraylayout
TeX
8
star
79

WebAssemblyVSJavaScript

Project to compare the performance of WebAssembly with JavaScript
JavaScript
7
star
80

ComputerLanguageBenchmark

Captured as http://benchmarksgame.alioth.debian.org is closing
HTML
7
star
81

rowreorderingcpplibrary

This is a set of row-reordering algorithms and data compression compression schemes implemented in C++.
C++
6
star
82

notesdecours

Notes de cours
Java
6
star
83

gobitmapbenchmark

A Go project to benchmark various bitmap implementations (this is not a library!)
Go
6
star
84

gutenberg-headers

Removing Manually-Generated Boilerplate from Project Gutenberg e-Books
Java
6
star
85

iosbitmapdecoding

Experimenting with bitmap decoding on ios
C++
6
star
86

simple_simdjson_python_wrapper

Simple use of simdjson from python (for research purposes)
C++
6
star
87

citationdata

Influential references dataset
6
star
88

fastheap

Fast heap data structures in Go (golang)
Go
6
star
89

pythonmaxmin

Fast minimum-maximal filter in Python
Python
6
star
90

jstypes

Doing C-like arithmetic and logical operations in JavaScript (full 64-bit support)
JavaScript
5
star
91

fast_float_supplemental_tests

Supplemental tests for the fast_float library (credit: Nigel Tao)
C++
5
star
92

HashVSTree

Do hash table use more memory than trees? A case study in Java.
Java
5
star
93

hierarchicalbinbuffering

Hierarchical Bin Buffering C++ Library
C++
4
star
94

RoaringVSConciseBenchmark

A benchmark comparing roaring and concise
Java
4
star
95

DivisionBenchmark

A not-so-exciting benchmark to measure the performance of some division functions
C++
4
star
96

exercices_lucene

Exercises pour s'approprier lucene, le moteur de recherche
Java
4
star
97

jackson-json-bench

A silly benchmark for Jackson (JSON parser)
Java
4
star
98

roaring_rust_benchmark

Basic benchmark to compare different Roaring bitmap implementations in Rust
Rust
4
star
99

datasciencebook

Python
4
star
100

viewsizeestimation

Unassuming hashing-based view-size estimation techniques
C++
4
star