Delft's Data Management Group (@delftdata)

Top repositories

1

valentine

A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching methods.
Python
80
star
2

flink-statefun-transactions

Transactions for Stateful Functions as a Service. This repository implements and API and associated underpinnings for two-phase Commit and SAGAs on Apache Flink's Statefun.
Java
24
star
3

stateflow

Prototype which extracts stateful dataflows by analysing Python code.
Python
17
star
4

styx

Styx: Transactional Stateful Functions on Streaming Dataflows
Python
14
star
5

wdm-project-benchmark

Benchmarking suite for the Web-Scale Data Management course using Locust
Python
12
star
6

valentine-system

Valentine scalable deployment for VLDB demo
Python
8
star
7

master-thesis-kit

A kit that all master thesis students need to write a thesis with the Delta team.
TeX
7
star
8

checkmate

CheckMate: Evaluating Checkpointing Protocols for Streaming Dataflows
Python
6
star
9

FERDiS

C#
5
star
10

reinforcement_learning_augmentation

Modified code and experiments from the "Feature augmentation with reinforcement learning" paper
Python
4
star
11

wdm-project-template

Template project for TU Delft's Web-scale Data Management course
Python
4
star
12

s-query

S-Query, a novel system for querying stateful stream processors. Implemented on top of Hazelcast Jet.
Java
3
star
13

autofeat

Source code for augmenting relational datasets through join paths
Jupyter Notebook
3
star
14

valentine-data-fabricator

The data generator used to produce the datasets in the paper "The Valentine Experiment Suite for Schema Matching"
Python
2
star
15

valentine-paper-results

The output produced by the Valentine Experiment Suite included in the paper "The Valentine Experiment Suite for Schema Matching"
Jupyter Notebook
2
star
16

bsc_research_project_q4_2023

Code base for BSc Research Project Q4/2023 - Group 19
Python
2
star
17

espa-autoscaling

Python
2
star
18

hci-auto-feat

Human-in-the-loop Feature Discovery with AutoFeat
Jupyter Notebook
1
star
19

paper-template-one-pager

TeX
1
star
20

clonos-web

Ruby
1
star
21

stateflow-evaluation

Jupyter Notebook
1
star
22

SiMa

Jupyter Notebook
1
star
23

flink-test-scripts

A set of bash scripts that repeatedly execute a native example from Flink's code base, kill a task manager, and check whether the job resumes successfully.
Python
1
star
24

COMA

HTML
1
star
25

s3j-adaptive-similarity-joins

Code repository for Adaptive Distributed Streaming Similarity Joins published in DEBS 2023.
Java
1
star