RchalYang/torchrl

Stars
206
Rank 189,437 (Top 4 %)
Language
Python
Created almost 6 years ago
Updated about 2 years ago

RchalYang/torchrl

RchalYang

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)

TorchRL

Pytorch Implementation for RL Methods

Environments with continuous & discrete action space are supported.

Environments with 1d & 3d observation space are supported.

Multi-Process Env is supported

Requirements

General Requirements

Pytorch 1.7
Gym(0.10.9)
Mujoco(1.50.1)
tabulate (for log)
tensorboardX (log file output)

Tensorboard Requirements

Tensorflow: to start tensorboard or read log in tf records

Installation

use use environment.yml to create virtual envrionment

    conda create -f environment.yml
    source activate py_off

Mannually install all requirements

Usage

specify parameters for algorithms in config file & specify log directory / seed / device in argument

    python examples/ppo_continuous_vec.py --config config/ppo_halfcheetah.json --seed 0 --device 0 --id ppo_halfcheetah

Checkout examples folder for detailed informations

Currently contains:

On-Policy Methods:
- Reinforce
- A2C(Actor Critic)
- PPO(Proximal Policy Optimization)
- TRPO
Off-Policy Methods:
- Soft Actor Critic: SAC(TwinSAC)
- Deep Deterministic Policy Gradient :DDPG
- TD3
- DQN:
  - Basic Double DQN
  - Bootstrapped DQN
  - QRDQN

Soft-Module

Code for "Multi-task Reinforcement Learning with Soft Modularization"

RLPaperList

Personal Repo to keep track of RL papers

offlinerl

Repo for offline reinforcement learning methods

sdfood_solo

pytorch_on_policy_agent

Pytorch Implementation for Policy Gradient Methods

Introduction-to-Algorithm

homework for class Introduction to Algorithm at NKU