Deep Reinforcement Learning, Summer 2019 (Samsung)
This repository contains codes for Deep Reinforcement Learning (DRL) algorithms with PyTorch (v0.4.1). It also provides lecture slides that explain codes in detail.
The agents with the DRL algorithms have been implemented and trained using classic control environments in OpenAI Gym.
Table of Contents
00. Prerequisite
01. Deep Learning with PyTorch
02. Deep Q-Network (DQN) & Double DQN (DDQN)
03. Advantage Actor-Critic (A2C) & Deep Deterministic Policy Gradient (DDPG)
04. Trust Region Policy Optimization (TRPO) & Proximal Policy Optimization (PPO)
05. Soft Actor-Critic (SAC)
Learning curve
CartPole
Pendulum
Paper
- Deep Q-Network (DQN)
- Double DQN (DDQN)
- Advantage Actor-Critic (A2C)
- Asynchronous Advantage Actor-Critic (A3C)
- Deep Deterministic Policy Gradient (DDPG)
- Trust Region Policy Optimization (TRPO)
- Generalized Advantage Estimator (GAE)
- Proximal Policy Optimization (PPO)
- Soft Actor-Critic (SAC)