• Stars
    star
    4
  • Rank 3,304,323 (Top 66 %)
  • Language
    Scala
  • Created almost 7 years ago
  • Updated almost 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Training Large Scale Statistical Machine Translation Models on Spark

More Repositories

1

marlin

A Distributed Matrix Operations Library Built on Top of Spark
Scala
105
star
2

MR-Course-Assignments

Assignments for courses of MapReduce
Shell
31
star
3

dolphin

Dolphin - a Deep Learning on MIC architecture Project.
C++
24
star
4

SmartFD

SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms
Scala
17
star
5

tachyon-perf

A General Performance Test Framework for Tachyon
Java
16
star
6

dfs-perf

A general performance test framework for Distributed File System
Java
13
star
7

DGST

DGST: Efficient and Scalable Generalized Suffix Tree Construction on Apache Spark
Scala
12
star
8

NAS-CTR

Python
12
star
9

Liquid

Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters
Python
10
star
10

DIFER

Python
10
star
11

forestlayer

ForestLayer: Efficient and scalable deep forest learning library based on Ray
Python
10
star
12

AdaMCL

PyTorch implementation of AdaCML
Python
10
star
13

BigSpa

A framework for large-sacle static program analysis.
Java
9
star
14

HAGNN

Hybrid Aggregation for Heterogeneous Graph Neural Networks
Python
7
star
15

GADAM

5
star
16

Octopus-DF

A cross-platfrom pandas-like Dataframe based on Pandas, Spark and Dask.
Python
4
star
17

PSP

Python
4
star
18

cichlid

Cichlid is a distributed RDFS & OWL reasoning system based on Spark.
Scala
4
star
19

EAAFE

The codes for paper "Evolutionary Automated Feature Engineering"
Python
3
star
20

AutoAC

Python
3
star
21

AutoMTL

The official implementation of paper *Automatic Multi-Task Learning Framework with Neural Architecture Search in Recommendations*
Python
3
star
22

Coral

Coral: Federated Query Join Order Optimization Based on Deep Reinforcement Learning
Java
2
star
23

Magpie

Efficient Big Data Query System Parameter Optimization based on Pre-selection and Search Pruning Approach
Java
2
star
24

trasa

Implementation for Transition Relation Aware Self-Attention for Session-based Recommendation
Python
2
star
25

PGA

partial attack for graph global attack
Jupyter Notebook
2
star
26

SparkDQ

The code repository for SparkDQ, a big data quality management system.
Python
2
star
27

CasMLN

Python
2
star
28

LLM_Paper_Learning

LLM Papers We Recommend to Read
2
star
29

PMPAS

Python
1
star
30

TSSE

Topic model for short text
Python
1
star
31

Raven

Benchmarking query engines with pre-computation on cloud
Python
1
star
32

FSClientCache

code repo for file system client-side cache paper
Java
1
star
33

UniGPS

Unified Graph Programming Framework
1
star
34

SAGNAS

Python
1
star