• Stars
    star
    238
  • Rank 169,306 (Top 4 %)
  • Language
    Python
  • License
    MIT License
  • Created about 6 years ago
  • Updated over 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Random Network Distillation pytorch

Random Network Distillation

Intrinsic Reward Graph with play

Venture Montezuma's Revenge
Video Label
~ New model for Montezuma
  • Advantage Actor critic [1]
  • Parallel Advantage Actor critic [2]
  • Exploration by Random Network Distillation [3]
  • Proximal Policy Optimization Algorithms [4]

1. Setup

Requirements


2. How to Train

Modify the parameters in config.conf as you like.

python train.py

3. How to Eval

python eval.py

4. Loss/Reward Graph

  • Montezuma's Revenge Env image
  • Venture Env image

References

[1] Actor-Critic Algorithms
[2] Efficient Parallel Methods for Deep Reinforcement Learning
[3] Exploration by Random Network Distillation
[4] Proximal Policy Optimization Algorithms