XLand-MiniGrid: Scalable Meta-Reinforcement Learning Environments in JAX
–Neural Information Processing Systems
At the same time, meta-RL methods have major limitations. Since the agent requires thousands of different tasks for generalization, faster adaptation during inference comes at the expense of significantly increased pre-training requirements. For example, a single training of the Ada agent (Team et al., 2023) takes five weeks
Neural Information Processing Systems
Nov-17-2025, 16:58:00 GMT
- Country:
- Europe
- North America > United States (0.04)
- Industry:
- Education (0.70)
- Leisure & Entertainment > Games (0.46)
- Technology: