Grounding Aleatoric Uncertainty for Unsupervised Environment Design
–Neural Information Processing Systems
Adaptive curricula in reinforcement learning (RL) have proven effective for producing policies robust to discrepancies between the train and test environment.
ground-truth distribution, grounding aleatoric uncertainty, unsupervised environment design, (4 more...)
Neural Information Processing Systems
Dec-25-2025, 09:32:50 GMT
- Technology: