pemami4911/deep-rl

Stars
297
Rank 140,075 (Top 3 %)
Language
Python
License
MIT License
Created almost 9 years ago
Updated over 5 years ago

pemami4911/deep-rl

pemami4911

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Collection of Deep Reinforcement Learning algorithms

deep-rl

Collection of Deep Reinforcement Learning algorithms.

Dependencies:

Tested with Python 2.7 and Python 3.6

So far:

DDPG - Deep Deterministic Policy Gradients, evaluated on the Pendulum-v0 environment in OpenAI Gym.

Places where this code has been used

If you have used this code to do something cool, send me a link and a GIF (via email or pull request) and I'll add it

@keithmgould used the same the DDPG code to solve the inverted Pendulum task in Roboschool.
@janscholten Deep Reinforcement Learning with Feedback-based Exploration [code]

neural-combinatorial-rl-pytorch

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning https://arxiv.org/abs/1611.09940

POMDPy

POMDPs in Python.

awesome-hyperparams

A curated list of awesome hyperparameters for deep learning

sinkhorn-policy-gradient.pytorch

Code accompanying the paper "Learning Permutations with Sinkhorn Policy Gradient"

EfficientMORL

EfficientMORL (ICML'21)

REBAR-pytorch

Implementation of REBAR in PyTorch

Jupyter Notebook

V-RAVE

Virtual Reality Autonomous Vehicle Experience

ppde

Official repository for "Plug & Play Directed Evolution for Proteins with Gradient-Based Discrete MCMC"

pemami4911.github.io

https://pemami4911.github.io

TF-Queues-Full-MNIST-Example

IODINE.pytorch

Unofficial PyTorch implementation of IODINE https://arxiv.org/abs/1903.00450

Nonlinear-Programming-Exercises

Programming exercises from Nonlinear Programming (3rd Edition) by Dimitri P. Bertsekas

Jupyter Notebook

MobileDR

MobileDR + traffic radar for vehicle tracking at traffic intersections.

ML-practice

nvm-utils

utils for .nvm files from VisualSFM (http://ccwu.me/vsfm/)

distributed-systems

Learning how to code distributed systems with Elixir

esm_one_hot

symmetric-and-object-centric-world-models

Code accompanying "A Symmetric and Object-Centric World Model for Stochastic Environments" https://github.com/orlrworkshop/orlrworkshop.github.io/blob/master/pdf/ORLR_3.pdf

Osprey

CEN3031 Intro to Software Engineering // Health Accelerator // Osprey

machine-learning-practice.pytorch

Implementations of various machine learning algorithms from scratch for practice.

Jupyter Notebook

genesis.pytorch

Unofficial PyTorch implementation of GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

ppi-with-stacked-autoencoders

Jupyter Notebook