• Stars
    star
    167
  • Rank 225,476 (Top 5 %)
  • Language
    Python
  • Created over 7 years ago
  • Updated about 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implementation of the Option-Critic Architecture on the Atari (ALE) environment

The Option-Critic Architecture

Code for the Option-Critic Architecture https://arxiv.org/pdf/1609.05140v2.pdf.

Installation

Here's a list of all dependencies:

  • Numpy
  • Theano
  • Lasagne
  • Launcher
  • Argparse
  • Arcade Learning Environment
  • matplotlib
  • cv2 (OpenCV)

Training

To train, run following command:

python train_q.py --rom pong --num-options 8 --folder-name pong_tempmodel

To view a list of available parameters, run:

print train_q.py --help

To speed up training, we highly suggest using cudnn(CUDA).

Testing

To watch model after training, run:

python run_best_model.py models/pong_tempmodel/last_model.pkl