dmlc/ps-lite

Stars
1,525
Rank 30,714 (Top 0.7 %)
Language
C++
License
Apache License 2.0
Created over 9 years ago
Updated almost 2 years ago

dmlc/ps-lite

dmlc

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A lightweight parameter server interface

A light and efficient implementation of the parameter server framework. It provides clean yet powerful APIs. For example, a worker node can communicate with the server nodes by

Push(keys, values): push a list of (key, value) pairs to the server nodes
Pull(keys): pull the values from servers for a list of keys
Wait: wait untill a push or pull finished.

A simple example:

  std::vector<uint64_t> key = {1, 3, 5};
  std::vector<float> val = {1, 1, 1};
  std::vector<float> recv_val;
  ps::KVWorker<float> w;
  w.Wait(w.Push(key, val));
  w.Wait(w.Pull(key, &recv_val));

More features:

Flexible and high-performance communication: zero-copy push/pull, supporting dynamic length values, user-defined filters for communication compression
Server-side programming: supporting user-defined handles on server nodes

Build

ps-lite requires a C++11 compiler such as g++ >= 4.8. On Ubuntu >= 13.10, we can install it by

sudo apt-get update && sudo apt-get install -y build-essential git

Instructions for gcc 4.8 installation on other platforms:

Then clone and build

git clone https://github.com/dmlc/ps-lite
cd ps-lite && make -j4

How to use

ps-lite provides asynchronous communication for other projects:

Distributed deep neural networks: MXNet, CXXNET, Minverva, and BytePS
Distributed high dimensional inference, such as sparse logistic regression, factorization machines: DiFacto Wormhole

Research papers

Mu Li, Dave Andersen, Alex Smola, Junwoo Park, Amr Ahmed, Vanja Josifovski, James Long, Eugene Shekita, Bor-Yiing Su. Scaling Distributed Machine Learning with the Parameter Server. In Operating Systems Design and Implementation (OSDI), 2014
Mu Li, Dave Andersen, Alex Smola, and Kai Yu. Communication Efficient Distributed Machine Learning with the Parameter Server. In Neural Information Processing Systems (NIPS), 2014

xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

gluon-cv

Gluon CV Toolkit

gluon-nlp

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

nnvm

minpy

NumPy interface with mixed backend execution

mshadow

Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning

cxxnet

move forward to https://github.com/dmlc/mxnet

dlpack

common in-memory tensor structure

dmlc-core

A common bricks library for building scalable and portable distributed machine learning.

treelite

Universal model exchange and serialization format for decision tree forests

minerva

Minerva: a fast and flexible tool for deep learning on multi-GPU. It provides ndarray programming interface, just like Numpy. Python bindings and C++ bindings are both available. The resulting code can be run on CPU or GPU. Multi-GPU support is very easy.

parameter_server

moved to https://github.com/dmlc/ps-lite

mxnet-notebooks

Notebooks for MXNet

Jupyter Notebook

rabit

Reliable Allreduce and Broadcast Interface for distributed machine learning

mxnet.js

MXNetJS: Javascript Package for Deep Learning in Browser (without server)

MXNet.jl

MXNet Julia Package - flexible and efficient deep learning in Julia

tensorboard

Standalone TensorBoard for visualizing in deep learning

wormhole

mxnet-memonger

Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets

difacto

Distributed Factorization Machines

XGBoost.jl

XGBoost Julia Package

mxnet-model-gallery

Pre-trained Models of DMLC Project

GNNLens2

Visualization tool for Graph Neural Networks

HalideIR

Symbolic Expression and Statement Module for new DSLs

mxnet-gtc-tutorial

MXNet Tutorial for NVidia GTC 2016.

Jupyter Notebook

experimental-lda

MXNet.cpp

C++ interface for mxnet

experimental-mf

cache-friendly multithread matrix factorization

web-data

The repo to host all the web data including images for documents in dmlc projects.

Jupyter Notebook

nnvm-fusion

Kernel Fusion and Runtime Compilation Based on NNVM

dmlc.github.io

tl2cgen

TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models

cub

mxnet-deepmark

Benchmark speed and other issues internally, before push to deep-mark

mxnet-examples

xgboost-bench

drat

Drat Repository for DMLC R packages

nn-examples

gluon-nlp-notebooks

docs-redirect-for-mxnet

redirect mxnet.readthedocs.io to mxnet.io