DecisionTransformer: Reinforcement LearningviaSequenceModeling
–Neural Information Processing Systems
This stands insharp contrast tomuch workinreinforcement learning (RL), which learns a single policy to model a particular narrow behavior distribution. Given the diversity of applications andimpact oftransformer models, weseek toexamine their application tosequential decision making problems.
Neural Information Processing Systems
Feb-9-2026, 13:16:06 GMT
- Technology: