Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
–Neural Information Processing Systems
Beyond existing meta-RL analyses, we provide upper bounds of the expected optimality gap over the task distribution.
Neural Information Processing Systems
Feb-7-2026, 09:05:18 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Pennsylvania (0.04)
- Europe > United Kingdom
- Genre:
- Research Report > Experimental Study (0.92)
- Industry:
- Education (0.45)
- Technology: