Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens

Open in new window