• Stars
    star
    36,205
  • Rank 424 (Top 0.01 %)
  • Language
    C++
  • License
    BSD 3-Clause "New...
  • Created about 10 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

This repository is receiving very limited maintenance. We will only review the following types of changes.

  • Fixes for critical bugs, such as data loss or memory corruption
  • Changes absolutely needed by internally supported leveldb clients. These typically fix breakage introduced by a language/standard library/OS update

ci

Authors: Sanjay Ghemawat ([email protected]) and Jeff Dean ([email protected])

Features

  • Keys and values are arbitrary byte arrays.
  • Data is stored sorted by key.
  • Callers can provide a custom comparison function to override the sort order.
  • The basic operations are Put(key,value), Get(key), Delete(key).
  • Multiple changes can be made in one atomic batch.
  • Users can create a transient snapshot to get a consistent view of data.
  • Forward and backward iteration is supported over the data.
  • Data is automatically compressed using the Snappy compression library, but Zstd compression is also supported.
  • External activity (file system operations etc.) is relayed through a virtual interface so users can customize the operating system interactions.

Documentation

LevelDB library documentation is online and bundled with the source code.

Limitations

  • This is not a SQL database. It does not have a relational data model, it does not support SQL queries, and it has no support for indexes.
  • Only a single process (possibly multi-threaded) can access a particular database at a time.
  • There is no client-server support builtin to the library. An application that needs such support will have to wrap their own server around the library.

Getting the Source

git clone --recurse-submodules https://github.com/google/leveldb.git

Building

This project supports CMake out of the box.

Build for POSIX

Quick start:

mkdir -p build && cd build
cmake -DCMAKE_BUILD_TYPE=Release .. && cmake --build .

Building for Windows

First generate the Visual Studio 2017 project/solution files:

mkdir build
cd build
cmake -G "Visual Studio 15" ..

The default default will build for x86. For 64-bit run:

cmake -G "Visual Studio 15 Win64" ..

To compile the Windows solution from the command-line:

devenv /build Debug leveldb.sln

or open leveldb.sln in Visual Studio and build from within.

Please see the CMake documentation and CMakeLists.txt for more advanced usage.

Contributing to the leveldb Project

This repository is receiving very limited maintenance. We will only review the following types of changes.

  • Bug fixes
  • Changes absolutely needed by internally supported leveldb clients. These typically fix breakage introduced by a language/standard library/OS update

The leveldb project welcomes contributions. leveldb's primary goal is to be a reliable and fast key/value store. Changes that are in line with the features/limitations outlined above, and meet the requirements below, will be considered.

Contribution requirements:

  1. Tested platforms only. We generally will only accept changes for platforms that are compiled and tested. This means POSIX (for Linux and macOS) or Windows. Very small changes will sometimes be accepted, but consider that more of an exception than the rule.

  2. Stable API. We strive very hard to maintain a stable API. Changes that require changes for projects using leveldb might be rejected without sufficient benefit to the project.

  3. Tests: All changes must be accompanied by a new (or changed) test, or a sufficient explanation as to why a new (or changed) test is not required.

  4. Consistent Style: This project conforms to the Google C++ Style Guide. To ensure your changes are properly formatted please run:

    clang-format -i --style=file <file>
    

We are unlikely to accept contributions to the build configuration files, such as CMakeLists.txt. We are focused on maintaining a build configuration that allows us to test that the project works in a few supported configurations inside Google. We are not currently interested in supporting other requirements, such as different operating systems, compilers, or build systems.

Submitting a Pull Request

Before any pull request will be accepted the author must first sign a Contributor License Agreement (CLA) at https://cla.developers.google.com/.

In order to keep the commit timeline linear squash your changes down to a single commit and rebase on google/leveldb/main. This keeps the commit timeline linear and more easily sync'ed with the internal repository at Google. More information at GitHub's About Git rebase page.

Performance

Here is a performance report (with explanations) from the run of the included db_bench program. The results are somewhat noisy, but should be enough to get a ballpark performance estimate.

Setup

We use a database with a million entries. Each entry has a 16 byte key, and a 100 byte value. Values used by the benchmark compress to about half their original size.

LevelDB:    version 1.1
Date:       Sun May  1 12:11:26 2011
CPU:        4 x Intel(R) Core(TM)2 Quad CPU    Q6600  @ 2.40GHz
CPUCache:   4096 KB
Keys:       16 bytes each
Values:     100 bytes each (50 bytes after compression)
Entries:    1000000
Raw Size:   110.6 MB (estimated)
File Size:  62.9 MB (estimated)

Write performance

The "fill" benchmarks create a brand new database, in either sequential, or random order. The "fillsync" benchmark flushes data from the operating system to the disk after every operation; the other write operations leave the data sitting in the operating system buffer cache for a while. The "overwrite" benchmark does random writes that update existing keys in the database.

fillseq      :       1.765 micros/op;   62.7 MB/s
fillsync     :     268.409 micros/op;    0.4 MB/s (10000 ops)
fillrandom   :       2.460 micros/op;   45.0 MB/s
overwrite    :       2.380 micros/op;   46.5 MB/s

Each "op" above corresponds to a write of a single key/value pair. I.e., a random write benchmark goes at approximately 400,000 writes per second.

Each "fillsync" operation costs much less (0.3 millisecond) than a disk seek (typically 10 milliseconds). We suspect that this is because the hard disk itself is buffering the update in its memory and responding before the data has been written to the platter. This may or may not be safe based on whether or not the hard disk has enough power to save its memory in the event of a power failure.

Read performance

We list the performance of reading sequentially in both the forward and reverse direction, and also the performance of a random lookup. Note that the database created by the benchmark is quite small. Therefore the report characterizes the performance of leveldb when the working set fits in memory. The cost of reading a piece of data that is not present in the operating system buffer cache will be dominated by the one or two disk seeks needed to fetch the data from disk. Write performance will be mostly unaffected by whether or not the working set fits in memory.

readrandom  : 16.677 micros/op;  (approximately 60,000 reads per second)
readseq     :  0.476 micros/op;  232.3 MB/s
readreverse :  0.724 micros/op;  152.9 MB/s

LevelDB compacts its underlying storage data in the background to improve read performance. The results listed above were done immediately after a lot of random writes. The results after compactions (which are usually triggered automatically) are better.

readrandom  : 11.602 micros/op;  (approximately 85,000 reads per second)
readseq     :  0.423 micros/op;  261.8 MB/s
readreverse :  0.663 micros/op;  166.9 MB/s

Some of the high cost of reads comes from repeated decompression of blocks read from disk. If we supply enough cache to the leveldb so it can hold the uncompressed blocks in memory, the read performance improves again:

readrandom  : 9.775 micros/op;  (approximately 100,000 reads per second before compaction)
readrandom  : 5.215 micros/op;  (approximately 190,000 reads per second after compaction)

Repository contents

See doc/index.md for more explanation. See doc/impl.md for a brief overview of the implementation.

The public interface is in include/leveldb/*.h. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Guide to header files:

  • include/leveldb/db.h: Main interface to the DB: Start here.

  • include/leveldb/options.h: Control over the behavior of an entire database, and also control over the behavior of individual reads and writes.

  • include/leveldb/comparator.h: Abstraction for user-specified comparison function. If you want just bytewise comparison of keys, you can use the default comparator, but clients can write their own comparator implementations if they want custom ordering (e.g. to handle different character encodings, etc.).

  • include/leveldb/iterator.h: Interface for iterating over data. You can get an iterator from a DB object.

  • include/leveldb/write_batch.h: Interface for atomically applying multiple updates to a database.

  • include/leveldb/slice.h: A simple module for maintaining a pointer and a length into some other byte array.

  • include/leveldb/status.h: Status is returned from many of the public interfaces and is used to report success and various kinds of errors.

  • include/leveldb/env.h: Abstraction of the OS environment. A posix implementation of this interface is in util/env_posix.cc.

  • include/leveldb/table.h, include/leveldb/table_builder.h: Lower-level modules that most clients probably won't use directly.

More Repositories

1

material-design-icons

Material Design icons by Google (Material Symbols)
50,560
star
2

guava

Google core libraries for Java
Java
48,313
star
3

zx

A tool for writing better scripts
JavaScript
42,760
star
4

styleguide

Style guides for Google-originated open-source projects
HTML
37,420
star
5

googletest

GoogleTest - Google Testing and Mocking Framework
C++
34,040
star
6

material-design-lite

Material Design Components in HTML/CSS/JS
HTML
32,281
star
7

comprehensive-rust

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.
Rust
27,842
star
8

python-fire

Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
Python
26,842
star
9

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.
C++
25,626
star
10

gson

A Java serialization/deserialization library to convert Java Objects into JSON and back
Java
23,317
star
11

flatbuffers

FlatBuffers: Memory Efficient Serialization Library
C++
23,037
star
12

iosched

The Google I/O Android App
Kotlin
21,772
star
13

ExoPlayer

This project is deprecated and stale. The latest ExoPlayer code is available in https://github.com/androidx/media
Java
21,710
star
14

eng-practices

Google's Engineering Practices documentation
19,942
star
15

web-starter-kit

Web Starter Kit - a workflow for multi-device websites
HTML
18,422
star
16

flexbox-layout

Flexbox for Android
Kotlin
18,230
star
17

fonts

Font files available from Google Fonts, and a public issue tracker for all things Google Fonts
HTML
18,222
star
18

filament

Filament is a real-time physically based rendering engine for Android, iOS, Windows, Linux, macOS, and WebGL2
C++
17,554
star
19

cadvisor

Analyzes resource usage and performance characteristics of running containers.
Go
17,078
star
20

gvisor

Application Kernel for Containers
Go
15,733
star
21

libphonenumber

Google's common Java, C++ and JavaScript library for parsing, formatting, and validating international phone numbers.
C++
15,728
star
22

WebFundamentals

Former git repo for WebFundamentals on developers.google.com
JavaScript
13,851
star
23

yapf

A formatter for Python files
Python
13,755
star
24

brotli

Brotli compression format
TypeScript
13,363
star
25

tink

Tink is a multi-language, cross-platform, open source library that provides cryptographic APIs that are secure, easy to use correctly, and hard(er) to misuse.
Java
13,318
star
26

deepdream

13,212
star
27

wire

Compile-time Dependency Injection for Go
Go
12,919
star
28

guetzli

Perceptual JPEG encoder
C++
12,917
star
29

guice

Guice (pronounced 'juice') is a lightweight dependency injection framework for Java 11 and above, brought to you by Google.
Java
12,458
star
30

blockly

The web-based visual programming editor.
TypeScript
12,392
star
31

sanitizers

AddressSanitizer, ThreadSanitizer, MemorySanitizer
C
11,410
star
32

or-tools

Google's Operations Research tools:
C++
11,144
star
33

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Jupyter Notebook
10,529
star
34

grumpy

Grumpy is a Python to Go source code transcompiler and runtime.
Go
10,464
star
35

oss-fuzz

OSS-Fuzz - continuous fuzzing for open source software.
Shell
10,389
star
36

auto

A collection of source code generators for Java.
Java
10,234
star
37

go-github

Go library for accessing the GitHub v3 API
Go
10,206
star
38

go-cloud

The Go Cloud Development Kit (Go CDK): A library and tools for open cloud development in Go.
Go
9,546
star
39

sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.
C++
8,657
star
40

tsunami-security-scanner

Tsunami is a general purpose network security scanner with an extensible plugin system for detecting high severity vulnerabilities with high confidence.
Java
8,232
star
41

re2

RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.
C++
8,190
star
42

traceur-compiler

Traceur is a JavaScript.next-to-JavaScript-of-today compiler
JavaScript
8,173
star
43

trax

Trax — Deep Learning with Clear Code and Speed
Python
8,051
star
44

pprof

pprof is a tool for visualization and analysis of profiling data
Go
7,875
star
45

skia

Skia is a complete 2D graphic library for drawing Text, Geometries, and Images.
C++
7,874
star
46

benchmark

A microbenchmark support library
C++
7,812
star
47

magika

Detect file content types with deep learning
Rust
7,680
star
48

android-classyshark

Android and Java bytecode viewer
Java
7,492
star
49

accompanist

A collection of extension libraries for Jetpack Compose
Kotlin
7,442
star
50

closure-compiler

A JavaScript checker and optimizer.
Java
7,394
star
51

agera

Reactive Programming for Android
Java
7,227
star
52

latexify_py

A library to generate LaTeX expression from Python code.
Python
7,160
star
53

diff-match-patch

Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Python
7,132
star
54

flutter-desktop-embedding

Experimental plugins for Flutter for Desktop
C++
7,102
star
55

glog

C++ implementation of the Google logging module
C++
7,017
star
56

jsonnet

Jsonnet - The data templating language
Jsonnet
6,938
star
57

model-viewer

Easily display interactive 3D models on the web and in AR!
TypeScript
6,858
star
58

lovefield

Lovefield is a relational database for web apps. Written in JavaScript, works cross-browser. Provides SQL-like APIs that are fast, safe, and easy to use.
JavaScript
6,847
star
59

error-prone

Catch common Java mistakes as compile-time errors
Java
6,818
star
60

draco

Draco is a library for compressing and decompressing 3D geometric meshes and point clouds. It is intended to improve the storage and transmission of 3D graphics.
C++
6,459
star
61

gops

A tool to list and diagnose Go processes currently running on your system
Go
6,375
star
62

gopacket

Provides packet processing capabilities for Go
Go
6,289
star
63

automl

Google Brain AutoML
Jupyter Notebook
6,230
star
64

osv-scanner

Vulnerability scanner written in Go which uses the data provided by https://osv.dev
Go
6,222
star
65

flax

Flax is a neural network library for JAX that is designed for flexibility.
Jupyter Notebook
6,085
star
66

grafika

Grafika test app
Java
6,071
star
67

snappy

A fast compressor/decompressor
C++
6,068
star
68

physical-web

The Physical Web: walk up and use anything
Java
6,017
star
69

j2objc

A Java to iOS Objective-C translation tool and runtime.
Java
5,990
star
70

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.
C++
5,961
star
71

ios-webkit-debug-proxy

A DevTools proxy (Chrome Remote Debugging Protocol) for iOS devices (Safari Remote Web Inspector).
C
5,918
star
72

seesaw

Seesaw v2 is a Linux Virtual Server (LVS) based load balancing platform.
Go
5,634
star
73

EarlGrey

🍵 iOS UI Automation Test Framework
Objective-C
5,616
star
74

seq2seq

A general-purpose encoder-decoder framework for Tensorflow
Python
5,577
star
75

google-java-format

Reformats Java source code to comply with Google Java Style.
Java
5,538
star
76

mesop

Rapidly build AI apps in Python
Python
5,401
star
77

wireit

Wireit upgrades your npm/pnpm/yarn scripts to make them smarter and more efficient.
TypeScript
5,385
star
78

syzkaller

syzkaller is an unsupervised coverage-guided kernel fuzzer
Go
5,350
star
79

uuid

Go package for UUIDs based on RFC 4122 and DCE 1.1: Authentication and Security Services.
Go
5,284
star
80

clusterfuzz

Scalable fuzzing infrastructure.
Python
5,283
star
81

battery-historian

Battery Historian is a tool to analyze battery consumers using Android "bugreport" files.
Go
5,249
star
82

gemma_pytorch

The official PyTorch implementation of Google's Gemma models
Python
5,242
star
83

bbr

5,156
star
84

gumbo-parser

An HTML5 parsing library in pure C99
HTML
5,141
star
85

git-appraise

Distributed code review system for Git repos
Go
5,122
star
86

google-authenticator

Open source version of Google Authenticator (except the Android app)
Java
5,077
star
87

gts

☂️ TypeScript style guide, formatter, and linter.
TypeScript
5,071
star
88

closure-library

Google's common JavaScript library
JavaScript
4,881
star
89

grr

GRR Rapid Response: remote live forensics for incident response
Python
4,757
star
90

cameraview

[DEPRECATED] Easily integrate Camera features into your Android app
Java
4,734
star
91

pytype

A static type analyzer for Python code
Python
4,731
star
92

liquidfun

2D physics engine for games
C++
4,559
star
93

clasp

🔗 Command Line Apps Script Projects
TypeScript
4,525
star
94

google-ctf

Google CTF
Python
4,477
star
95

gxui

An experimental Go cross platform UI library.
Go
4,450
star
96

santa

A binary authorization and monitoring system for macOS
Objective-C++
4,402
star
97

bloaty

Bloaty: a size profiler for binaries
C++
4,386
star
98

tcmalloc

C++
4,339
star
99

ko

Build and deploy Go applications on Kubernetes
Go
4,329
star
100

orbit

C/C++ Performance Profiler
C++
4,190
star