Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

TypeScript

OCaml

Go

Clojure

C++

JavaScript

Scala

R

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Nix

R

Elixir

Java

Shell

MATLAB

C++

Scala

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇲🇼 Malawi

🇾🇹 Mayotte

🇬🇫 French Guiana

🇧🇻 Bouvet Island

🇱🇸 Lesotho

🇳🇷 Nauru

🇲🇬 Madagascar

🇩🇰 Denmark

All Countries Compare Countries

uber-research/ape-x

Stars
188
Rank 205,563 (Top 5 %)
Language
Python
License
Apache License 2.0
Created over 6 years ago
Updated over 5 years ago

uber-research/ape-x

uber-research

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"

Replication of Ape-X (Distributed Prioritized Experience Replay)

This repo replicates the results Horgan et al obtained:

[1] Distributed Prioritized Experience Replay

Our code is based off of code from OpenAI baselines. The original code and related paper from OpenAI can be found here. Their implementation of DQN was modified to use Tensorflow custom ops.

Although Ape-X was originally a distributed algorithm, this implementation was meant to maximize throughput on a single machine. It was optimized for 2 GPUs (data gathering + optimization) but could be modified to use only one. With 2 GPUs and 20~40 CPUs you should be able to achieve human median performance in about 2 hours.

How to run

clone repo

git clone https://github.com/uber-research/ape-x.git

create python3 virtual env

python3 -m venv env
. env/bin/activate

install requirements

pip install tensorflow-gpu gym

Follow the setup under gym_tensorflow/README.md and run ./make to compile the custom ops.

launch experiment

python apex.py --env video_pinball --num-timesteps 1000000000 --logdir=/tmp/agent

Monitor your results with tensorboard

tensorboard --logdir=/tmp/agent

visualize results

python demo.py --env video_pinball --logdir=/tmp/agent

deep-neuroevolution

Deep Neuroevolution

PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

UPSNet

UPSNet: A Unified Panoptic Segmentation Network

go-explore

Code for Go-Explore: a New Approach for Hard-Exploration Problems

PyTorch-NEAT

LaneGCN

[ECCV2020 Oral] Learning Lane Graph Representations for Motion Forecasting

sbnet

Sparse Blocks Networks

differentiable-plasticity

Implementations of the algorithms described in Differentiable plasticity: training plastic networks with gradient descent, a research paper from Uber AI Labs.

DeepPruner

DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch (ICCV 2019)

parallax

Tool for interactive embeddings visualization

learning-to-reweight-examples

Code for paper "Learning to Reweight Examples for Robust Deep Learning"

jpeg2dct

poet

Paired Open-Ended Trailblazer (POET) and Enhanced POET

intrinsic-dimension

Jupyter Notebook

CoordConv

atari-model-zoo

A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release that enables easy visualization and analysis of models, and comparison across training algorithms.

Jupyter Notebook

EvoGrad

TuRBO

safemutations

permute-quantize-finetune

Using ideas from product quantization for state-of-the-art neural network compression.

deconstructing-lottery-tickets

CRISP

metropolis-hastings-gans

GTN

backpropamine

Train self-modifying neural networks with neuromodulated plasticity

loss-change-allocation

MARVIN

Uber's Multi-Agent Routing Value Iteration Network

GOCC

Synthetic-Petri-Dish

RxThreadEffectChecker

Static checker for Rx Threading Effects, based on the Checker Framework

Map-Elites-Evolutionary

Map-Elites based on Evolution Strategies

D3G

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

java-dependency-validator

Dependency validator detects runtime compatibility issues at build time

vargp

Variational Auto-Regressive Gaussian Processes for Continual Learning

normative-uncertainty

Evolvability-ES

brezel

dispatch-optim

Constrainted based optimization

ga-world-models

FSDM

Code tor the SIGDIAL 2019 paper Flexibly-Structured Model for Task-Oriented Dialogues. It implements a deep learning end-to-end differentiable dialogue system model

rl-controller-verification

Quadcopter Verification

go-context-propagate

last-diff-analyzer

A multi-language tool for checking semantic equivalence for code

presto-HDFS-read-data

A dump of some of our Presto logs, for use as part of ongoing Presto/HDFS research and presentations.

xplane-bazel-docker

tailr