• Stars
    star
    1,151
  • Rank 40,523 (Top 0.8 %)
  • Language
    Python
  • Created almost 3 years ago
  • Updated 4 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
Clean, Robust, and Unified implementation of classical Deep Reinforcement Learning Algorithms


Link of my code:

Python Pytorch DRL


Recommended Resources for DRL

Books:

Online Courses:

Blogs:

Simulation Environments:


Important Papers

DQN: Mnih V, Kavukcuoglu K, Silver D, et al. Human-level control through deep reinforcement learning[J]. nature, 2015, 518(7540): 529-533.

Double DQN: Van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double q-learning[C]//Proceedings of the AAAI conference on artificial intelligence. 2016, 30(1).

PER: Schaul T, Quan J, Antonoglou I, et al. Prioritized experience replay[J]. arXiv preprint arXiv:1511.05952, 2015.

PPO: Schulman J, Wolski F, Dhariwal P, et al. Proximal policy optimization algorithms[J]. arXiv preprint arXiv:1707.06347, 2017.

DDPG: Lillicrap T P, Hunt J J, Pritzel A, et al. Continuous control with deep reinforcement learning[J]. arXiv preprint arXiv:1509.02971, 2015.

TD3: Fujimoto S, Hoof H, Meger D. Addressing function approximation error in actor-critic methods[C]//International conference on machine learning. PMLR, 2018: 1587-1596.

SAC: Haarnoja T, Zhou A, Abbeel P, et al. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor[C]//International conference on machine learning. PMLR, 2018: 1861-1870.

ASL: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity


Training Curves of my Code:

Q-learning:

DQN/DDQN on Classic Control:

DQN/DDQN on Atari Game:

Pong Enduro

Prioritized DQN/DDQN on Classic Control:

CartPole LunarLander

PPO Discrete:

PPO Continuous:

DDPG:

Pendulum LunarLanderContinuous

TD3:

SAC Continuous:

SAC Discrete:

Actor-Sharer-Learner (ASL):

More Repositories

1

PPO-Continuous-Pytorch

A clean and robust Pytorch implementation of PPO on continuous action space.
Python
115
star
2

TD3-BipedalWalkerHardcore-v2

Solve BipedalWalkerHardcore-v2 with TD3
Python
79
star
3

PPO-Discrete-Pytorch

A clean and robust Pytorch implementation of PPO on Discrete action space
Python
56
star
4

SAC-Continuous-Pytorch

a clean and robust Pytorch implementation of SAC on continuous action space
Python
53
star
5

SAC-Discrete-Pytorch

A clean and robust Pytorch implementation of SAC on discrete action space
Python
30
star
6

Duel-Double-DQN-Pytorch

A clean and robust implementation of Duel Double DQN
Python
27
star
7

OkayPlan

OkayPlan: A real-time global path palnning algorithm for dynamic environments
Python
27
star
8

TD3-Pytorch

A clean and robust Pytorch implementation of TD3 on continuous action space
Python
20
star
9

Actor-Sharer-Learner

Actor-Sharer-Learner training framework for off-policy DRL algorithms
Python
19
star
10

Prioritized-Experience-Replay-DDQN-Pytorch

A clean and robust implementation of Prioritized DQN and Prioritized Double DQN
Python
17
star
11

Sparrow-V0

A Reinforcement Learning Friendly Simulator for Mobile Robot
Python
16
star
12

okayplan_ros

Real-time global path planning algorithm for dynamic environments
Python
13
star
13

DDPG-Pytorch

A clean Pytorch implementation of DDPG on continuous action space.
Python
12
star
14

Real-time-Path-planning-with-SEPSO

Efficient Real-time Path Planning with SEPSO in Dynamic Scenarios
Python
10
star
15

Color

Color: Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
10
star
16

Noisy-Duel-DDQN-Atari-Pytorch

A clean and robust implementation of Noisy-Duel-DDQN on Atari games
Python
8
star
17

Sparrow-V1

A Reinforcement Learning Friendly Simulator for Mobile Robot
Python
7
star
18

Q-learning

An implementation of Q-learning
Python
5
star
19

C51-Categorical-DQN-Pytorch

A clean and robust Pytorch implementation of Categorical DQN (C51)
Python
2
star