Discover ChuaCheowHuan/PBT_MARL_watered

Stars
6
Rank 2,539,965 (Top 51 %)
Language
Jupyter Notebook
License
MIT License
Created over 4 years ago
Updated over 1 year ago

ChuaCheowHuan

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

My attempt to reproduce a water down version of PBT (Population based training) for MARL (Multi-agent reinforcement learning) using DDPPO (Decentralized & distributed proximal policy optimization) from ray[rllib].

gym-continuousDoubleAuction

A custom MARL (multi-agent reinforcement learning) environment where multiple agents trade against one another (self-play) in a zero-sum continuous double auction. Ray [RLlib] is used for training.

Jupyter Notebook

121

reinforcement_learning

My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.

Jupyter Notebook

web_app_DPDTH

Sample CI/CD devops setup of Django & Postgres web app with Docker, Travis & Heroku.

JavaScript

sagemaker_Ray_RLlib_custom_env

Sample setup for custom reinforcement learning environment in Sagemaker. This example uses Proximal Policy Optimization with Ray (RLlib).

Python

bayesian_ML

Bayesian based machine learning implementations (GMM, VAE & conditional VAE).

Jupyter Notebook

ChuaCheowHuan/PBT_MARL_watered_down

ChuaCheowHuan

Reviews

Repository Details

More Repositories