Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

CoffeeScript

Assembly

C++

Crystal

Haskell

Clojure

Scala

Erlang

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Nix

Go

R

Swift

Dart

PHP

Perl

Groovy

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇧🇲 Bermuda

🇮🇸 Iceland

🇭🇷 Croatia

🇮🇳 India

🇧🇯 Benin

🇲🇽 Mexico

🇧🇳 Brunei

All Countries Compare Countries

takuseno/ppo

Stars
100
Rank 340,703 (Top 7 %)
Language
Python
License
MIT License
Created about 7 years ago
Updated over 1 year ago

takuseno/ppo

takuseno

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Proximal Policy Optimization implementation with TensorFlow

PPO

Proximal Policy Optimization implementation with Tensorflow.

https://arxiv.org/pdf/1707.06347.pdf

This repository has been much updated from commit id a4fbd383f0f89ce2d881a8b78d6b8a03294e5c7c . New PPO requires a new dependency, rlsaber which is my utility repository that can be shared across different algorithms.

Some of my design follow OpenAI baselines. But, I used as many default tensorflow packages as possible unlike baselines, that makes my codes easier to be read.

In addition, my PPO automatically switches between continuous action-space and discrete action-space depending on environments. If you want to change hyper parameters, check atari_constants.py or box_constants.py, which will be loaded depending on environments too.

requirements

Python3

dependencies

tensorflow
gym[atari]
opencv-python
git+https://github.com/imai-laboratory/rlsaber

usage

training

$ python train.py [--env env-id] [--render] [--logdir log-name]

example

$ python train.py --env BreakoutNoFrameskip-v4 --logdir breakout

playing

$ python train.py --demo --load results/path-to-model [--env env-id] [--render]

example

$ python train.py --demo --load results/breakout/model.ckpt-xxxx --env BreakoutNoFrameskip-v4 --render

performance examples

Pendulumn-v0

BreakoutNoFrameskip-v4

implementation

This is inspired by following projects.

License

This repository is MIT-licensed.

d3rlpy

An offline deep reinforcement learning library

d4rl-pybullet

Datasets for data-driven deep reinforcement learning with PyBullet environments

minerva

An out-of-the-box GUI tool for offline deep reinforcement learning

d4rl-atari

Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)

d3rlpy-benchmarks

Benchmark data for d3rlpy

mvc-drl

Cleanest deep reinforcement learning implementation based on Web MVC architecture with complete unit testings

icm

Intrinsic Curiosity Module implementation with TensorFlow

cpp-dqn

Blazingly Fast Implementation of Deep Q-Network in C++ with NNabla

rsvg

Recurrent Stochastic Value Gradient implementation with TensorFlow

android-countrylist

This library is Android library for using country names and 2-alphabet codes

GeoMap

GeoChart view library for Android

miniature

a toy deep learning library written in Rust

singan-nnabla

SinGAN implementation with NNabla

ddpg

Deep Deterministic Policy Gradient implementation with TensorFlow

a3c

A3C implementation with TensorFlow

configurable-control-gym

Configurable control tasks based on default environments included in OpenAI Gym

a2c

A2C implementation with TensorFlow

dotfiles

takuseno.github.io

Personal website

beta-vae

beta-VAE implementation with TensorFlow

github-notebook

Markdown editor for GitHub

watchgpu-master

Master sever of GPU visualization

nnabla-mlflow

mlflow utilities for nnabla

nand2tetris

Study codes of "The Elements of Computing Systems"

dqn-sokushukai

Sample DQN code for 速習会 in Wantedly

watchgpu-edge

Edge server of GPU visualziation

nsg

News Source Getter

probabilistic_robotics

Study code of Probabilistic Robotics

Jupyter Notebook

unreal

UNREAL implementation with TensorFlow

mvc-drl-nnabla

NNabla implementation of mvc-drl

tensor-bridge

Transfer tensors between PyTorch, Jax and more