• Stars
    star
    26
  • Rank 926,415 (Top 19 %)
  • Language
    Python
  • Created over 6 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm