• Stars
    star
    415
  • Rank 103,710 (Top 3 %)
  • Language
    Python
  • License
    Other
  • Created almost 8 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Reinforcement learning with unsupervised auxiliary tasks

UNREAL

CircleCI

About

Replicating UNREAL algorithm described in Google Deep Mind's paper "Reinforcement learning with unsupervised auxiliary tasks."

https://arxiv.org/pdf/1611.05397.pdf

Implemented with TensorFlow and DeepMind Lab environment.

Preview

seekavoid_arena_01

seekavoid_arena_01

stairway_to_melon

stairway_to_melon

nav_maze_static_01

nav_maze_static_01

Network

Network

All weights of convolution layers and LSTM layer are shared.

Requirements

  • TensorFlow (Tested with r1.0)
  • DeepMind Lab
  • numpy
  • cv2
  • pygame
  • matplotlib

Result

"seekavoid_arena_01" Level

seekavoid_01_score

"nav_maze_static_01" Level

nav_maze_static_01_score

How to train

First, download and install DeepMind Lab

$ git clone https://github.com/deepmind/lab.git

Then build it following the build instruction. https://github.com/deepmind/lab/blob/master/docs/build.md

Clone this repo in lab directory.

$ cd lab
$ git clone https://github.com/miyosuda/unreal.git

Add this bazel instruction at the end of lab/BUILD file

package(default_visibility = ["//visibility:public"])

Then run bazel command to run training.

bazel run //unreal:train --define headless=glx

--define headlesss=glx uses GPU rendering and it requires display not to sleep. (We need to disable display sleep.)

If you have any trouble with GPU rendering, please use software rendering with --define headless=osmesa option.

How to show result

To show result after training, run this command.

bazel run //unreal:display --define headless=glx

More Repositories

1

async_deep_reinforce

Asynchronous Methods for Deep Reinforcement Learning
Python
590
star
2

TensorFlowAndroidDemo

TensorFlow Android stand-alone demo
C++
275
star
3

TensorFlowAndroidMNIST

Tensorflow MNIST demo on Android
C++
264
star
4

disentangled_vae

Replicating "Understanding disentangling in ฮฒ-VAE"
Python
193
star
5

scan

SCAN: Learning Abstract Hierarchical Compositional Visual Concepts
Python
54
star
6

heartrate-monitor

Heart rate variability (HRV) analysis tool to detect autonomic nerve state
Swift
37
star
7

predictive_coding

Predictive Coding in the Visual Cortex: a Functional Interpretation of Some Extra-classical Receptive-field Effects
Jupyter Notebook
23
star
8

rat_grid

Vector-based navigation using grid-like representations in artificial agents
Jupyter Notebook
19
star
9

rodentia

3D learning environment with rigid body simulation for Linux/MacOSX
C++
15
star
10

episodic_control

Model-Free Episodic Control
Python
15
star
11

evolution_and_ai

Python
12
star
12

intro-to-dl-android

Jinnan Android Meetup Vol.1 "Androidใงๅ‹•ใ‹ใ™ใฏใ˜ใ‚ใฆใฎDeep Learning"
C++
5
star
13

narr-note

Fast math note-taking tool with Tex notation for MacOSX.
JavaScript
5
star
14

can

Continous Attractor Network Model
Jupyter Notebook
5
star
15

dendritic_bp

Dendritic error backpropagation in deep cortical microcircuits
Jupyter Notebook
4
star
16

snmf

Hebbian/Anti-Hebbian Network for Online NMF
Jupyter Notebook
3
star
17

narr-map

A minimal mind map editor
TypeScript
2
star
18

hvrnn

Hierarchical variational autoencoder
Python
2
star
19

reinforcement_learning_samples

samples of reinforcement learning
Java
2
star
20

manimalai

Easy-to-use Animal-AI clone environment
Python
1
star
21

tinymac

Tiny old mac (Macintosh 128K) like hardware
C
1
star
22

intro-to-dl2

Python
1
star