• Stars
    star
    166
  • Rank 227,748 (Top 5 %)
  • Language
    Python
  • License
    MIT License
  • Created about 7 years ago
  • Updated over 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym

Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Link to paper

Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym.

This paper introduces Wolpertinger training algorithm that extends the Deep Deterministic Policy Gradient training algorithm introduced in this paper.

I used and extended stevenpjg's implementation of DDPG algorithm found here licensed under the MIT license.

Master is currently only for continuous action spaces.

The branch discrete-and-continuous provides the ability to use the discrete environments of the gym.