Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

Neural Information Processing Systems 

The widely adopted state-of-the-art Online Decision Transformer (ODT) still struggles when pretrained with low-reward offline data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found