• Stars
    star
    206
  • Rank 190,504 (Top 4 %)
  • Language
    C++
  • License
    MIT License
  • Created over 1 year ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)

BtrBlocks - Efficient Columnar Compression for Data Lakes

Paper Build

  • Paper (two-column version)
  • Video (SIGMOD 2023 presentation)

Usage

After building the library, follow one of the examples to get started.

Components

  • btrblocks/: the compression library, schemes, utilities, ...
  • btrfiles/: helper library for binary files and yaml schema information
  • tools/: various conversion, measurement and benchmarking tools
  • test/: rudimentary tests for the library

Dependency Graph

Building

We currently require an x86 platform. This library was built and tested on linux only.

mkdir build 
cd build
cmake ..

Then, depending on your usecase, build only the library or any of the tools:

  • build everything: make
  • install static library and headers on your system: sudo make install
  • build the compression library only: make btrblocks
  • build the tests make tester
  • build the in-memory decompression speed benchmark: make decompression_speed
  • ...

For a list of all valid targets, run make help.

Contributors

Adnan Alhomssi David Sauerwein Maximilian Kuschewski

License

MIT - See License File