Top Rating
- Top Contributors
  Discover the Top Open Source contributors by country or by language
- Interviews
  Discover real stories from Open Source developers
Discover

Discover your Favorite Language
Discover the top trending repositories and projects on Github. Explore the latest trends in your preferred languages.

Julia

Lua

Go

PowerShell

Solidity

Python

Java

Ruby

More Languages
Awesome

Awesome repositories
Discover the most awesome repositories and projects of your favorite languages. Inspired by the Awesome-* lists trend in GitHub.

Nix

C++

Go

Zig

Objective-C

Dart

MATLAB

Julia

More Languages
By Country

Rankings by Country
Discover the community of talented open source contributors in each country.

🇰🇭 Cambodia

🇸🇾 Syria

🇱🇸 Lesotho

🇧🇼 Botswana

🇵🇦 Panama

🇸🇹 São Tomé and Príncipe

🇧🇳 Brunei

🇫🇮 Finland

All Countries Compare Countries

ugo-nama-kun/DQN-chainer

Stars
202
Rank 193,691 (Top 4 %)
Language
Python
License
MIT License
Created over 9 years ago
Updated over 8 years ago

ugo-nama-kun/DQN-chainer

ugo-nama-kun

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

DQN-chainer

This software is a python implementation of Deep Q-Networks for playing ATARI games with Chainer package.

I followed the implementation described in:

V. Mnih et al., "Playing atari with deep reinforcement learning"

http://arxiv.org/pdf/1312.5602.pdf

V. Mnih et al., "Human-level control through deep reinforcement learning"

http://www.nature.com/nature/journal/v518/n7540/abs/nature14236.html

For japanese instruction of DQN and historical review, please check:

http://qiita.com/Ugo-Nama/items/08c6a5f6a571335972d5

Requirement

My implementation is dependent on RL-glue, Arcade Learning Environment, and Chainer. To run the software, you need following softwares/packages.

Python 2.7+
Numpy
Scipy
Pillow (PIL)
Chainer (1.3.0): https://github.com/pfnet/chainer
RL-glue core: https://sites.google.com/a/rl-community.org/rl-glue/Home/rl-glue
RL-glue Python codec: https://sites.google.com/a/rl-community.org/rl-glue/Home/Extensions/python-codec
Arcade Learning Environment (version ALE 0.4.4): http://www.arcadelearningenvironment.org/

This software was tested on Ubuntu 14.04 LTS.

How to run

Please check readme.txt

gym_torcs

nonpara_discrete_rl

Simple Nonparametric Reinforcement Learning with Universal Interface

RL_nyu-mon

mujoco_marker_example

examples of mujoco-py marker

two_resource_environment

DQN agent in 3D two resource problem for foraging task

gather_env

Gather Environment for RL Benchmarking

pydata_okinawa2017

Jupyter Notebook

ste

Sample code of straight-through gradient estimator of stochastic neural networks

random_agent_for_LIS

This is the random agent for LIS. This agent doesn't load and use caffe pre-trained model, and keeps taking random actions.

mcmc_demo

backprop_doc

fisher_ratio

calculating fisher discrimination ratio

Jupyter Notebook

tiny_transformer_samples

tiny Transformer examples

conv_ae

Convolutional Autoencoders Examples

moving_copy_in_chainer

template_unity_mlagents_env

This is a template unity setting of the mlagents for deep model training