MIT DB Group (@mitdbg)

Top repositories

1

treeline

An update-in-place key-value store for modern storage.
C++
130
star
2

fastdeepnets

TeX
112
star
3

deneva

Deneva is a distributed in-memory database framework that supports the evaluation of various concurrency control algorithms.
C++
110
star
4

aurum-datadiscovery

Python
74
star
5

palimpzest

A Declarative System for Optimizing AI Workloads
Python
48
star
6

asciiclass

Notes and Labs for Advanced Topics in Data Processing
JavaScript
38
star
7

ml-class-iap2017

20
star
8

AdaptDB

Java
16
star
9

amoeba

Java
16
star
10

bigdata

MIT Big Data Challenge
JavaScript
14
star
11

twitinfo

A timeline-based visualization of events as they are discussed on Twitter
Python
14
star
12

lazo

Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
Java
13
star
13

imputedb

A database with automatic dynamic imputation of missing values.
Java
10
star
14

datascienceclass

Software Systems for Data Science Main Repo
Jupyter Notebook
9
star
15

genbase

Code for GenBase: complex analytics based genomics benchmark
R
6
star
16

XSystem

XSystem: Extracting Syntactical Patterns from Databases
Scala
6
star
17

modeldb-notebooks

Scala
5
star
18

ycsbr

Customizable synthetic workload generator and runner.
C++
4
star
19

brad

A virtualization layer for cloud data infrastructures.
Python
4
star
20

iap-class

Resources for the "Programming with Data" IAP class.
Jupyter Notebook
3
star
21

forecache-code

ForeCache codebase.
JavaScript
3
star
22

wifivis

Visualize some mit wifi access point data
Python
3
star
23

logos

Human-in-the-Loop Causal Analysis of Log Files
Python
3
star
24

learnedsystems-www

JavaScript
2
star
25

confo

Python
2
star
26

purk

hit layer infrastructure
1
star