Awesome Decision Transformer
This is a collection of research papers for Decision Transformer (DT). And the repository will be continuously updated to track the frontier of DT.
Welcome to follow and star!
Table of Contents
Overview of Transformer
The Decision Transformer was proposed by “Decision Transformer: Reinforcement Learning via Sequence Modeling” by Chen L. et al. It casts (offline) Reinforcement Learning as a conditional-sequence modeling problem.
Specifically, DT model is a causal transformer model conditioned on the desired return, (past) states, and actions to generate future actions in an autoregressive manner.
Advantage
- Bypass the need for bootstrapping for long term credit assignment
- Avoid undesirable short-sighted behaviors due to the discounting future rewards.
- Enjoy the transformer models widely used in language and vision, which are easy to scale and adapt to multi-modal data.
Papers
format:
- [title](paper link) [links]
- author1, author2, and author3...
- publisher
- key
- code
- experiment environment
Arxiv
-
Can Offline Reinforcement Learning Help Natural Language Understanding?
- Ziqi Zhang, Yile Wang, Yue Zhang, Donglin Wang
- Key: Language model
- ExpEnv: MuJoco, Maze 2D
-
PACT: Perception-Action Causal Transformer for Autoregressive Robotics Pre-Training
- Rogerio Bonatti, Sai Vemprala, Shuang Ma, Felipe Frujeri, Shuhang Chen, Ashish Kapoor
- Key: Robotics, Pretrain, Multitask, Representation
- ExpEnv: MuSHR car, Habitat
-
LATTE: LAnguage Trajectory TransformEr
- Arthur Bucker, Luis Figueredo, Sami Haddadin, Ashish Kapoor, Shuang Ma, Sai Vemprala, Rogerio Bonatti
- Key: MultiModal, Robotics
- Code: official, official
- ExpEnv: CoppeliaSim
-
- Taku Yamagata, Ahmed Khalil, Raul Santos-Rodriguez
- Key: Q-Learning
- ExpEnv: D4RL
-
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
-
Transfer learning with causal counterfactual reasoning in Decision Transformers
- Ayman Boustati, Hana Chockler, Daniel C. McNamee
- Key: Causal reasoning, Transfer Learning
- ExpEnv: MINIGRID
-
Pretraining for Language Conditioned Imitation with Transformers
- Aaron L Putterman, Kevin Lu, Igor Mordatch, Pieter Abbeel
- Key: Text-Conditioned Decision
- ExpEnv: Text-Conditioned Frostbite (MultiModal Benchmark)
-
An Offline Deep Reinforcement Learning for Maintenance Decision-Making
- Hamed Khorasgani, Haiyan Wang, Chetan Gupta, Ahmed Farahat
- Publisher: Annual Conference of the PHM Society 2021
- Key: Offline Supervised RL, Remaining Useful Life Estimation
- ExpEnv: NASA C-MAPSS
-
A Sequence Modelling Approach to Question Answering in Text-Based Games
- Gregory Furman, Edan Toledo, Jonathan Shock, Jan Buys
- Publisher: Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022)
- Key: VQA
- ExpEnv: QAIT
-
- Qinjie Lin, Han Liu, Biswa Sengupta
- Key: Multi-Task RL, Sparse Reward
- ExpEnv: MINIGRID
-
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
-
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
-
Transformers are Adaptable Task Planners
- Vidhi Jain, Yixin Lin, Eric Undersander, Yonatan Bisk, Akshara Rai
- Key: Task Planning, Prompt, Control, Generalization
- Code: official
- ExpEnv: Dishwasher Loading
-
You Can't Count on Luck: Why Decision Transformers Fail in Stochastic Environments
- Keiran Paster, Sheila McIlraith, Jimmy Ba
- Key: Stochastic Environments
- ExpEnv: Gambling, Connect Four, 2048
-
When does return-conditioned supervised learning work for offline reinforcement learning?
-
SimStu-Transformer: A Transformer-Based Approach to Simulating Student Behaviour
- Zhaoxing Li, Lei Shi, Alexandra Cristea, Yunzhan Zhou, Chenghao Xiao, Ziqi Pan
- Key: Intelligent Tutoring System
-
Attention-Based Learning for Combinatorial Optimization
- Carson Smith
- Key: Combinatorial Optimization
ICLR 2023
- EDGI: Equivariant Diffusion for Planning with Embodied Agents
- Johann Brehmer, Joey Bose, Pim de Haan, Taco Cohen
- Publisher: ICLR 2023 Reincarnating RL workshop
- Key: rich geometric structure, equivariant, conditional generative modeling, representation
- ExpEnv: None
Neurips 2022
- Decision making as language generation
- Roland Memisevic, Sunny Panchal, Mingu Lee
- Publisher: NeurIPS 2022 Workshop FMDM
- Key: Generation
- ExpEnv: Traversals (Toy experiment)
CoRL 2022
-
Offline Reinforcement Learning for Customizable Visual Navigation
- Dhruv Shah, Arjun Bhorkar, Hrishit Leen, Ilya Kostrikov, Nicholas Rhinehart, Sergey Levine
- Publisher: CoRL 2022 (Oral)
- Key: Visual Navigation
- ExpEnv: RECON
-
Instruction-driven history-aware policies for robotic manipulations
-
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
ICML 2022
-
- Qinqing Zheng, Amy Zhang, Aditya Grover
- Publisher: ICML 2022 (Oral)
- Key: Online finetuning, Max-entropy, Exploration
- Code: unofficial
- ExpEnv: MuJoco, D4RL
-
Prompting Decision Transformer for Few-Shot Policy Generalization
-
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
-
AnyMorph: Learning Transferable Polices By Inferring Agent Morphology
- Brandon Trabucco, Mariano Phielipp, Glen Berseth
- Publisher: ICML 2022 (Poster)
- Key: Morphology, Transfer Learning, Zero Shot
- ExpEnv: Modular-RL
AAAI 2022
- Dreaming with Transformers
- Catherine Zeng, Jordan Docter, Alexander Amini, Igor Gilitschenski, Ramin Hasani, Daniela Rus
- Publisher: AAAI 2022 (RLG Workshop)
- Key: Dreamer, World Model
- ExpEnv: Deepmind Lab, VISTA
ICLR 2022
-
Learning Transferable Policies By Inferring Agent Morphology
- Brandon Trabucco, Mariano Phielipp, Glen Berseth
- Publisher: ICLR 2022 (GPL Workshop Poster)
- Key: Morphology, Transfer Learning, Zero Shot
- ExpEnv: Modular-RL
-
Generalized Decision Transformer for Offline Hindsight Information Matching
NeurIPS 2021
-
Decision Transformer: Reinforcement Learning via Sequence Modeling
-
Offline Reinforcement Learning as One Big Sequence Modeling Problem
-
TransDreamer: Reinforcement Learning with Transformer World Models
ICML 2021
Contributing
Our purpose is to make this repo even better. If you are interested in contributing, please refer to HERE for instructions in contribution.
License
Awesome Decision Transformer is released under the Apache 2.0 license.