• Stars
    star
    2,231
  • Rank 20,496 (Top 0.5 %)
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created over 8 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A set of Deep Reinforcement Learning Agents implemented in Tensorflow.

Deep Reinforcement Learning Agents

This repository contains a collection of reinforcement learning algorithms written in Tensorflow. The ipython notebook here were written to go along with a still-underway tutorial series I have been publishing on Medium. If you are new to reinforcement learning, I recommend reading the accompanying post for each algorithm.

The repository currently contains the following algorithms:

  • Q-Table - An implementation of Q-learning using tables to solve a stochastic environment problem.
  • Q-Network - A neural network implementation of Q-Learning to solve the same environment as in Q-Table.
  • Simple-Policy - An implementation of policy gradient method for stateless environments such as n-armed bandit problems.
  • Contextual-Policy - An implementation of policy gradient method for stateful environments such as contextual bandit problems.
  • Policy-Network - An implementation of a neural network policy-gradient agent that solves full RL problems with states and delayed rewards, and two opposite actions (ie. CartPole or Pong).
  • Vanilla-Policy - An implementation of a neural network vanilla-policy-gradient agent that solves full RL problems with states, delayed rewards, and an arbitrary number of actions.
  • Model-Network - An addition to the Policy-Network algorithm which includes a separate network which models the environment dynamics.
  • Double-Dueling-DQN - An implementation of a Deep-Q Network with the Double DQN and Dueling DQN additions to improve stability and performance.
  • Deep-Recurrent-Q-Network - An implementation of a Deep Recurrent Q-Network which can solve reinforcement learning problems involving partial observability.
  • Q-Exploration - An implementation of DQN containing multiple action-selection strategies for exploration. Strategies include: greedy, random, e-greedy, Boltzmann, and Bayesian Dropout.
  • A3C-Doom - An implementation of Asynchronous Advantage Actor-Critic (A3C) algorithm. It utilizes multiple agents to collectively improve a policy. This implementation can solve RL problems in 3D environments such as VizDoom challenges.

More Repositories

1

TF-Tutorials

A collection of deep learning tutorials using Tensorflow and Python
Jupyter Notebook
523
star
2

Meta-RL

Implementation of Meta-RL A3C algorithm
Jupyter Notebook
401
star
3

oreilly-rl-tutorial

Contains Jupyter notebooks associated with the "Deep Reinforcement Learning Tutorial" tutorial given at the O'Reilly 2017 NYC AI Conference.
Jupyter Notebook
273
star
4

neuro-nav

A library for neuroscience-inspired navigation and decision making research.
Jupyter Notebook
197
star
5

dfp

Reinforcement Learning with Goals
Jupyter Notebook
170
star
6

Pix2Pix-Film

An implementation of Pix2Pix in Tensorflow for use with frames from films
Jupyter Notebook
165
star
7

pytorch-diffusion

A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'
Python
161
star
8

sound-cnn

A convolutional neural network that classifies sounds
Python
159
star
9

3D-TSNE

A Unity project for visualizing t-SNE data in 3D.
C#
73
star
10

RL-CC

Web-based Reinforcement Learning Control Center
Jupyter Notebook
64
star
11

successor_examples

Tutorials on learning and using successor representations.
Jupyter Notebook
49
star
12

ML-Tools

Variety of machine learning algorithms written in python
Python
42
star
13

DNN-Sentiment

Convolutional and recurrent deep neural networks for text sentiment analysis.
Python
32
star
14

NeuralDreamVideos

A deep learning model for creating video sequences
Jupyter Notebook
24
star
15

cognition-course

Slides used in Cognitive Psychology course taught during summer 2015 at the University of Oregon
6
star
16

synescape

Sound visualization app for musicians and music fans.
C#
5
star
17

interaction-grounded-learning

A simple PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL) from Xie et al., 2021.
Jupyter Notebook
4
star
18

serotonin-ebm

Python
1
star