DecisionTransformer: Reinforcement LearningviaSequenceModeling

Neural Information Processing Systems 

This stands insharp contrast tomuch workinreinforcement learning (RL), which learns a single policy to model a particular narrow behavior distribution. Given the diversity of applications andimpact oftransformer models, weseek toexamine their application tosequential decision making problems.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found