TianhongDai/hindsight-experience-replay

Stars
391
Rank 110,003 (Top 3 %)
Language
Python
License
MIT License
Created almost 6 years ago
Updated almost 3 years ago

TianhongDai/hindsight-experience-replay

TianhongDai

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

Hindsight Experience Replay (HER)

This is a pytorch implementation of Hindsight Experience Replay.

Acknowledgement:

Openai Baselines

Requirements

python=3.5.2
openai-gym=0.12.5 (mujoco200 is supported, but you need to use gym >= 0.12.5, it has a bug in the previous version.)
mujoco-py=1.50.1.56 (~~Please use this version, if you use mujoco200, you may failed in the FetchSlide-v1~~)
pytorch=1.0.0 (If you use pytorch-0.4.1, you may have data type errors. I will fix it later.)
mpi4py

TODO List

support GPU acceleration - although I have added GPU support, but I still not recommend if you don't have a powerful machine.
add multi-env per MPI.
add the plot and demo of the FetchSlide-v1.

Instruction to run the code

If you want to use GPU, just add the flag --cuda (Not Recommended, Better Use CPU).

train the FetchReach-v1:

mpirun -np 1 python -u train.py --env-name='FetchReach-v1' --n-cycles=10 2>&1 | tee reach.log

train the FetchPush-v1:

mpirun -np 8 python -u train.py --env-name='FetchPush-v1' 2>&1 | tee push.log

train the FetchPickAndPlace-v1:

mpirun -np 16 python -u train.py --env-name='FetchPickAndPlace-v1' 2>&1 | tee pick.log

train the FetchSlide-v1:

mpirun -np 8 python -u train.py --env-name='FetchSlide-v1' --n-epochs=200 2>&1 | tee slide.log

Play Demo

python demo.py --env-name=<environment name>

Download the Pre-trained Model

Please download them from the Google Driver, then put the saved_models under the current folder.

Results

Training Performance

It was plotted by using 5 different seeds, the solid line is the median value.

Demo:

Tips: when you watch the demo, you can press TAB to switch the camera in the mujoco.

FetchPush-v1	FetchPickAndPlace-v1	FetchSlide-v1

reinforcement-learning-algorithms

This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)

integrated-gradient-pytorch

This is the pytorch implementation of the paper - Axiomatic Attribution for Deep Networks.

mosse-object-tracking

This is the implementation of MOSSE tracking algorithm (correlation filter based).

self-imitation-learning-pytorch

This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.

distributed-ppo

This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).

google-football-pytorch

It's the pytorch implementation of google research football.

metaworld-sac

div-hindsight

This is the official code of our paper "Diversity-based Trajectory and Goal Selection with Hindsight Experience Relay" [PRICAI 2021].

esil-hindsight

This is the official code of our paper "Episodic Self-Imitation Learning with Hindsight" [Electronics 2020].

wavelet-hdr

This is the official code for our paper "Wavelet-Based Network For High Dynamic Range Imaging" [CVIU 2023].

deep-hdr-baselines

react2-code

This is the official code of our paper "Machine Learning to Support Visual Auditing of Home-based Lateral Flow Immunoassay Self-Test Results for SARS-CoV-2 Antibodies" [Communications Medicine 2022].

dockerfiles

It contains the dockerfiles for the purpose of machine learning / deep learning research.

phd-thesis

daim-rl

This is the official code of our paper "Diversity-Augmented Intrinsic Motivation for Deep Reinforcement Learning" [Neurocomputing 2021].

DouTu