HPI Data Engineering Systems (@hpides)
  • Stars
    star
    166
  • Global Org. Rank 44,693 (Top 15 %)
  • Registered over 5 years ago
  • Most used languages
    C++
    41.2 %
    Python
    17.6 %
    Java
    11.8 %
    TeX
    11.8 %
    Cuda
    11.8 %
    Shell
    5.9 %
  • Location 🇩🇪 Germany
  • Country Total Rank 7,125
  • Country Ranking
    Cuda
    28
    TeX
    426
    C++
    586
    Java
    3,068
    Shell
    7,631
    Python
    7,689

Top repositories

1

viper

Viper: A hybrid PMem-DRAM Key-Value Store for Persistent Memory (VLDB '21)
C++
73
star
2

vectorized-hash-tables

Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.
C++
22
star
3

pmem-olap

This repository contains the source code for our ACM SIGMOD '21 paper (Maximizing Persistent Memory Bandwidth Utilization for OLAP Workloads)
C++
19
star
4

perma-bench

A benchmarking suite to evaluate the performance of persistent memory access (PerMA-Bench @ VLDB '22)
C++
17
star
5

pmem-nvme-dropin

This repository contains the code for our DaMoN '21 paper.
C++
11
star
6

autovec-db

Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"
C++
11
star
7

thesis-proposal-template

This is a Latex template for thesis proposals at the Data Engineering Systems Group.
TeX
8
star
8

disco

Stream processing engine for distributed window aggregation (EDBT '20)
Java
7
star
9

multi-gpu-sorting

This repository contains the source code for our ACM SIGMOD '22 paper (Evaluating Multi-GPU Sorting with Modern Interconnects)
Cuda
5
star
10

mexico-flink-tutorial

Java
5
star
11

End-to-end-ML-System-Benchmark

A modular suite for benchmarking all stages of Machine Learning pipelines. To find bottlenecks in such pipelines and compare different ML tools, this framework can calculate and visualize several metrics in the data preparation, model training, model validation and inference stages.
Python
5
star
12

rmg-sort

RMG Sort: Radix-Partitioning-Based Multi-GPU Sorting (BTW '23)
Cuda
4
star
13

mp-ddsp-ws20

C++
2
star
14

mmlib

Efficiently Managing Deep Learning Models in a Distributed Environment (Awarded as best paper @ EDBT 2022)
Python
2
star
15

inferdb

Code for "InferDB: In-Database Machine Learning Inference Using Indexes"
Python
2
star
16

BDL

Code and Tutorials for Big Data Lab
Shell
1
star
17

thesis-template

This is a Latex template for theses at the Data Engineering Systems Group.
TeX
1
star