• Stars
    star
    833
  • Rank 54,305 (Top 2 %)
  • Language
    Python
  • Created almost 3 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Clean, Robust, and Unified PyTorch implementation of popular DRL Algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Clean, Robust, and Unified implementation of classical Deep Reinforcement Learning Algorithms


Link of my code:

Python Pytorch DRL


Recommended Resources for DRL

Books:

Online Courses:

Blogs:

Simulation Environments:


Important Papers

DQN: Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. nature, 2015, 518(7540): 529-533.

Double DQN: Van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double q-learning[C]//Proceedings of the AAAI conference on artificial intelligence. 2016, 30(1).

PER: Schaul T, Quan J, Antonoglou I, et al. Prioritized experience replay[J]. arXiv preprint arXiv:1511.05952, 2015.

PPO: Schulman J, Wolski F, Dhariwal P, et al. Proximal policy optimization algorithms[J]. arXiv preprint arXiv:1707.06347, 2017.

DDPG: Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous control with deep reinforcement learning[J]. arXiv preprint arXiv:1509.02971, 2015.

TD3: Fujimoto S, Hoof H, Meger D. Addressing function approximation error in actor-critic methods[C]//International conference on machine learning. PMLR, 2018: 1587-1596.

SAC: Haarnoja T, Zhou A, Abbeel P, et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]//International conference on machine learning. PMLR, 2018: 1861-1870.

ASL: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity


Training Curves of my Code:

Q-learning:

DQN/DDQN on Classic Control:

DQN/DDQN on Atari Game:

Pong Enduro

Prioritized DQN/DDQN on Classic Control:

CartPole LunarLander

PPO Discrete:

PPO Continuous:

DDPG:

Pendulum LunarLanderContinuous

TD3:

SAC Continuous:

SAC Discrete:

Actor-Sharer-Learner (ASL):

More Repositories

1

PPO-Continuous-Pytorch

A clean and robust Pytorch implementation of PPO on continuous action space.
Python
111
star
2

TD3-BipedalWalkerHardcore-v2

Solve BipedalWalkerHardcore-v2 with TD3
Python
79
star
3

PPO-Discrete-Pytorch

A clean and robust Pytorch implementation of PPO on Discrete action space
Python
52
star
4

SAC-Continuous-Pytorch

a clean and robust Pytorch implementation of SAC on continuous action space
Python
51
star
5

Duel-Double-DQN-Pytorch

A clean and robust implementation of Duel Double DQN
Python
27
star
6

OkayPlan

OkayPlan: A real-time global path palnning algorithm for dynamic environments
Python
27
star
7

SAC-Discrete-Pytorch

A clean and robust Pytorch implementation of SAC on discrete action space
Python
25
star
8

Actor-Sharer-Learner

Actor-Sharer-Learner training framework for off-policy DRL algorithms
Python
19
star
9

TD3-Pytorch

A clean and robust Pytorch implementation of TD3 on continuous action space
Python
18
star
10

Prioritized-Experience-Replay-DDQN-Pytorch

A clean and robust implementation of Prioritized DQN and Prioritized Double DQN
Python
17
star
11

Sparrow-V0

A Reinforcement Learning Friendly Simulator for Mobile Robot
Python
16
star
12

Real-time-Path-planning-with-SEPSO

Efficient Real-time Path Planning with SEPSO in Dynamic Scenarios
Python
10
star
13

Color

Color: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
10
star
14

DDPG-Pytorch

A clean Pytorch implementation of DDPG on continuous action space.
Python
8
star
15

Noisy-Duel-DDQN-Atari-Pytorch

A clean and robust implementation of Noisy-Duel-DDQN on Atari games
Python
8
star
16

okayplan_ros

Real-time global path planning algorithm for dynamic environments
Python
8
star
17

Q-learning

An implementation of Q-learning
Python
4
star
18

Sparrow-V1

A Reinforcement Learning Friendly Simulator for Mobile Robot
Python
3
star
19

C51-Categorical-DQN-Pytorch

A clean and robust Pytorch implementation of Categorical DQN (C51)
Python
2
star