XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX

Neural Information Processing Systems 

At the same time, meta-RL methods have major limitations. Since the agent requires thousands of different tasks for generalization, faster adaptation during inference comes at the expense of significantly increased pre-training requirements. For example, a single training of the Ada agent (Team et al., 2023) takes five weeks

Similar Docs  Excel Report  more

TitleSimilaritySource
None found