• Stars
    star
    264
  • Rank 150,374 (Top 4 %)
  • Language
    C++
  • License
    BSD 3-Clause "New...
  • Created almost 10 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A fork of Google's LevelDB with added zlib compression and windows support

LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

Build Status

Authors: Sanjay Ghemawat ([email protected]) and Jeff Dean ([email protected])

Features

  • Keys and values are arbitrary byte arrays.
  • Data is stored sorted by key.
  • Callers can provide a custom comparison function to override the sort order.
  • The basic operations are Put(key,value), Get(key), Delete(key).
  • Multiple changes can be made in one atomic batch.
  • Users can create a transient snapshot to get a consistent view of data.
  • Forward and backward iteration is supported over the data.
  • Data is automatically compressed using the Snappy compression library.
  • External activity (file system operations etc.) is relayed through a virtual interface so users can customize the operating system interactions.

Documentation

LevelDB library documentation is online and bundled with the source code.

Limitations

  • This is not a SQL database. It does not have a relational data model, it does not support SQL queries, and it has no support for indexes.
  • Only a single process (possibly multi-threaded) can access a particular database at a time.
  • There is no client-server support builtin to the library. An application that needs such support will have to wrap their own server around the library.

Contributing to the leveldb Project

The leveldb project welcomes contributions. leveldb's primary goal is to be a reliable and fast key/value store. Changes that are in line with the features/limitations outlined above, and meet the requirements below, will be considered.

Contribution requirements:

  1. POSIX only. We generally will only accept changes that are both compiled, and tested on a POSIX platform - usually Linux. Very small changes will sometimes be accepted, but consider that more of an exception than the rule.

  2. Stable API. We strive very hard to maintain a stable API. Changes that require changes for projects using leveldb might be rejected without sufficient benefit to the project.

  3. Tests: All changes must be accompanied by a new (or changed) test, or a sufficient explanation as to why a new (or changed) test is not required.

Submitting a Pull Request

Before any pull request will be accepted the author must first sign a Contributor License Agreement (CLA) at https://cla.developers.google.com/.

In order to keep the commit timeline linear squash your changes down to a single commit and rebase on google/leveldb/master. This keeps the commit timeline linear and more easily sync'ed with the internal repository at Google. More information at GitHub's About Git rebase page.

Performance

Here is a performance report (with explanations) from the run of the included db_bench program. The results are somewhat noisy, but should be enough to get a ballpark performance estimate.

Setup

We use a database with a million entries. Each entry has a 16 byte key, and a 100 byte value. Values used by the benchmark compress to about half their original size.

LevelDB:    version 1.1
Date:       Sun May  1 12:11:26 2011
CPU:        4 x Intel(R) Core(TM)2 Quad CPU    Q6600  @ 2.40GHz
CPUCache:   4096 KB
Keys:       16 bytes each
Values:     100 bytes each (50 bytes after compression)
Entries:    1000000
Raw Size:   110.6 MB (estimated)
File Size:  62.9 MB (estimated)

Write performance

The "fill" benchmarks create a brand new database, in either sequential, or random order. The "fillsync" benchmark flushes data from the operating system to the disk after every operation; the other write operations leave the data sitting in the operating system buffer cache for a while. The "overwrite" benchmark does random writes that update existing keys in the database.

fillseq      :       1.765 micros/op;   62.7 MB/s
fillsync     :     268.409 micros/op;    0.4 MB/s (10000 ops)
fillrandom   :       2.460 micros/op;   45.0 MB/s
overwrite    :       2.380 micros/op;   46.5 MB/s

Each "op" above corresponds to a write of a single key/value pair. I.e., a random write benchmark goes at approximately 400,000 writes per second.

Each "fillsync" operation costs much less (0.3 millisecond) than a disk seek (typically 10 milliseconds). We suspect that this is because the hard disk itself is buffering the update in its memory and responding before the data has been written to the platter. This may or may not be safe based on whether or not the hard disk has enough power to save its memory in the event of a power failure.

Read performance

We list the performance of reading sequentially in both the forward and reverse direction, and also the performance of a random lookup. Note that the database created by the benchmark is quite small. Therefore the report characterizes the performance of leveldb when the working set fits in memory. The cost of reading a piece of data that is not present in the operating system buffer cache will be dominated by the one or two disk seeks needed to fetch the data from disk. Write performance will be mostly unaffected by whether or not the working set fits in memory.

readrandom  : 16.677 micros/op;  (approximately 60,000 reads per second)
readseq     :  0.476 micros/op;  232.3 MB/s
readreverse :  0.724 micros/op;  152.9 MB/s

LevelDB compacts its underlying storage data in the background to improve read performance. The results listed above were done immediately after a lot of random writes. The results after compactions (which are usually triggered automatically) are better.

readrandom  : 11.602 micros/op;  (approximately 85,000 reads per second)
readseq     :  0.423 micros/op;  261.8 MB/s
readreverse :  0.663 micros/op;  166.9 MB/s

Some of the high cost of reads comes from repeated decompression of blocks read from disk. If we supply enough cache to the leveldb so it can hold the uncompressed blocks in memory, the read performance improves again:

readrandom  : 9.775 micros/op;  (approximately 100,000 reads per second before compaction)
readrandom  : 5.215 micros/op;  (approximately 190,000 reads per second after compaction)

Repository contents

See doc/index.md for more explanation. See doc/impl.md for a brief overview of the implementation.

The public interface is in include/*.h. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Guide to header files:

  • include/db.h: Main interface to the DB: Start here

  • include/options.h: Control over the behavior of an entire database, and also control over the behavior of individual reads and writes.

  • include/comparator.h: Abstraction for user-specified comparison function. If you want just bytewise comparison of keys, you can use the default comparator, but clients can write their own comparator implementations if they want custom ordering (e.g. to handle different character encodings, etc.)

  • include/iterator.h: Interface for iterating over data. You can get an iterator from a DB object.

  • include/write_batch.h: Interface for atomically applying multiple updates to a database.

  • include/slice.h: A simple module for maintaining a pointer and a length into some other byte array.

  • include/status.h: Status is returned from many of the public interfaces and is used to report success and various kinds of errors.

  • include/env.h: Abstraction of the OS environment. A posix implementation of this interface is in util/env_posix.cc

  • include/table.h, include/table_builder.h: Lower-level modules that most clients probably won't use directly

More Repositories

1

brigadier

Brigadier is a command parser & dispatcher, designed and developed for Minecraft: Java Edition.
Java
3,327
star
2

DataFixerUpper

A set of utilities designed for incremental building, merging and optimization of data transformations.
Java
1,145
star
3

bedrock-samples

Samples and assets for building Minecraft: Bedrock Edition add-ons
671
star
4

ore-ui

💎 Building blocks to construct game UIs using web tech.
TypeScript
378
star
5

LegacyLauncher

Hacky code to launch our old versions from the new launcher!
Java
245
star
6

minecraft-editor

Minecraft Editor is in early development. It is an in-engine, multiblock editing experience focused on making it possible for creators of all skillsets to easily craft high-quality experiences in Bedrock.
237
star
7

bedrock-protocol-docs

Documentation of the Bedrock network protocol. Protocol is subject to change release over release.
HTML
222
star
8

slicer

Resource pack migration tool for Minecraft 1.14
Java
163
star
9

Sift

A lightweight and easy-to-use tool for accessing your clouds
JavaScript
146
star
10

web-theme-bootstrap

Contains the theme used on minecraft.net based on Boostrap 4
SCSS
123
star
11

AccountsClient

Java client for accessing Mojang's account/profile API
Java
115
star
12

blixtser

Fast Serialization
Java
66
star
13

minecraft-creator-tools

A set of tools for creating content and add-ons for Minecraft Bedrock Edition.
TypeScript
48
star
14

stats.minecraft.net

Show them datas
JavaScript
43
star
15

TextureEnder

A tool to convert Minecraft Texture Packs to Resource Packs.
Java
43
star
16

minecraft-debugger

TypeScript
36
star
17

EducationContent

Content repository for MEE
TypeScript
34
star
18

minecraft-legends-docs

Houses public-facing documentation of content formats for Minecraft Legends
25
star
19

minecraft-editor-extension-starter-kit

A repo containing the build pipeline, libraries, and types required for a 3rd party to build a Minecraft Editor Extension
PowerShell
23
star
20

mojang.github.io

JavaScript
21
star
21

t-component

React translation component
TypeScript
17
star
22

dcpu-16

Documentation for DCPU-16 and related hardware.
16
star
23

minecraft-scripting-libraries

Sets of typescript scripting libraries for use with the minecraft scripting modules.
TypeScript
12
star
24

LegacyUnifont

Tracking Java Edition changes to Unifont
C
10
star
25

minecraft-editor-extension-samples

Minecraft Bedrock Editor Extension samples repo.
TypeScript
10
star
26

i18n-scripts

Localization scripts for t-component
JavaScript
9
star
27

legends-blockbench-plugin

Blockbench plugin for exporting Minecraft Legends content
JavaScript
9
star
28

PaintScan

[InventionWeek2020] Paint, scan, and see your own creations in Minecraft!
Java
9
star
29

breakpad

C++
6
star
30

FakeIt

FakeIt repo with fixes needed by our code not included in the main repo (https://github.com/eranpeer/FakeIt)
C++
5
star
31

geojson2fbx

Tool to convert geo.json content into FBX
4
star
32

sloedeployinfra

Azure Function for Commit Status Updates
PowerShell
3
star
33

OrcaSettings

Misc settings for the 3d printer firmware/slicer/etc
3
star
34

cef

Fork of CEF, with changes required for the launcher. See branch 2883.
C++
3
star
35

SFAT-Data-Explorer

C++
2
star
36

poc_precompiled_headers

C++
2
star
37

freetype2

Fork of git://git.sv.nongnu.org/freetype/freetype2.git
C
2
star
38

react-native-0.64

JavaScript
2
star
39

PaintScanAR

PaintScan experiments. Notable piece is use of AR.
Java
2
star
40

PRONYas

A tool for populating word document templates with data
Visual Basic .NET
2
star
41

redux-persist-01

JavaScript
1
star
42

POC_TIME_TRACE_FLAG

C++
1
star