• Stars
    star
    1
  • Language
    Python
  • License
    Apache License 2.0
  • Created about 3 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Benchmarking query engines with pre-computation on cloud

More Repositories

1

marlin

A Distributed Matrix Operations Library Built on Top of Spark
Scala
104
star
2

MR-Course-Assignments

Assignments for courses of MapReduce
Shell
31
star
3

dolphin

Dolphin - a Deep Learning on MIC architecture Project.
C++
24
star
4

SmartFD

SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms
Scala
17
star
5

tachyon-perf

A General Performance Test Framework for Tachyon
Java
16
star
6

dfs-perf

A general performance test framework for Distributed File System
Java
14
star
7

NAS-CTR

Python
12
star
8

DGST

DGST: Efficient and Scalable Generalized Suffix Tree Construction on Apache Spark
Scala
11
star
9

Liquid

Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters
Python
10
star
10

DIFER

Python
10
star
11

forestlayer

ForestLayer: Efficient and scalable deep forest learning library based on Ray
Python
10
star
12

AdaMCL

PyTorch implementation of AdaCML
Python
10
star
13

BigSpa

A framework for large-sacle static program analysis.
Java
8
star
14

seal

Training Large Scale Statistical Machine Translation Models on Spark
Scala
4
star
15

Octopus-DF

A cross-platfrom pandas-like Dataframe based on Pandas, Spark and Dask.
Python
4
star
16

HAGNN

Hybrid Aggregation for Heterogeneous Graph Neural Networks
Python
4
star
17

PSP

Python
4
star
18

cichlid

Cichlid is a distributed RDFS & OWL reasoning system based on Spark.
Scala
4
star
19

EAAFE

The codes for paper "Evolutionary Automated Feature Engineering"
Python
3
star
20

Coral

Coral: Federated Query Join Order Optimization Based on Deep Reinforcement Learning
Java
2
star
21

Magpie

Efficient Big Data Query System Parameter Optimization based on Pre-selection and Search Pruning Approach
Java
2
star
22

trasa

Implementation for Transition Relation Aware Self-Attention for Session-based Recommendation
Python
2
star
23

AutoAC

Python
2
star
24

SparkDQ

The code repository for SparkDQ, a big data quality management system.
Python
2
star
25

PMPAS

Python
1
star
26

TSSE

Topic model for short text
Python
1
star
27

PGA

partial attack for graph global attack
Jupyter Notebook
1
star
28

FSClientCache

code repo for file system client-side cache paper
Java
1
star
29

UniGPS

Unified Graph Programming Framework
1
star
30

GADAM

1
star