• Stars
    star
    297
  • Rank 139,202 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created over 8 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Collection of Deep Reinforcement Learning algorithms

deep-rl

Collection of Deep Reinforcement Learning algorithms.

Dependencies:

Tested with Python 2.7 and Python 3.6

So far:

  1. DDPG - Deep Deterministic Policy Gradients, evaluated on the Pendulum-v0 environment in OpenAI Gym.

Places where this code has been used

If you have used this code to do something cool, send me a link and a GIF (via email or pull request) and I'll add it

  1. @keithmgould used the same the DDPG code to solve the inverted Pendulum task in Roboschool. InvertedPendulum demo
  2. @janscholten Deep Reinforcement Learning with Feedback-based Exploration [code]

More Repositories

1

neural-combinatorial-rl-pytorch

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940
Python
544
star
2

POMDPy

POMDPs in Python.
Python
241
star
3

awesome-hyperparams

A curated list of awesome hyperparameters for deep learning
78
star
4

sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"
Python
38
star
5

EfficientMORL

EfficientMORL (ICML'21)
Python
23
star
6

REBAR-pytorch

Implementation of REBAR in PyTorch
Jupyter Notebook
17
star
7

V-RAVE

Virtual Reality Autonomous Vehicle Experience
C#
15
star
8

ppde

Official repository for "Plug & Play Directed Evolution for Proteins with Gradient-Based Discrete MCMC"
Python
10
star
9

pemami4911.github.io

https://pemami4911.github.io
HTML
6
star
10

IODINE.pytorch

Unofficial PyTorch implementation of IODINE https://arxiv.org/abs/1903.00450
Python
4
star
11

TF-Queues-Full-MNIST-Example

Python
4
star
12

Nonlinear-Programming-Exercises

Programming exercises from Nonlinear Programming (3rd Edition) by Dimitri P. Bertsekas
Jupyter Notebook
4
star
13

MobileDR

MobileDR + traffic radar for vehicle tracking at traffic intersections.
3
star
14

ML-practice

Python
2
star
15

nvm-utils

utils for .nvm files from VisualSFM (http://ccwu.me/vsfm/)
Python
2
star
16

distributed-systems

Learning how to code distributed systems with Elixir
Elixir
2
star
17

esm_one_hot

Python
2
star
18

symmetric-and-object-centric-world-models

Code accompanying "A Symmetric and Object-Centric World Model for Stochastic Environments" https://github.com/orlrworkshop/orlrworkshop.github.io/blob/master/pdf/ORLR_3.pdf
Python
2
star
19

Osprey

CEN3031 Intro to Software Engineering // Health Accelerator // Osprey
JavaScript
1
star
20

machine-learning-practice.pytorch

Implementations of various machine learning algorithms from scratch for practice.
Jupyter Notebook
1
star
21

genesis.pytorch

Unofficial PyTorch implementation of GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations
Python
1
star
22

ppi-with-stacked-autoencoders

Jupyter Notebook
1
star