Train Hard, Fight Easy: Robust Meta Reinforcement Learning

Jan-19-2025, 23:39:26 GMT–Neural Information Processing Systems

A major challenge of reinforcement learning (RL) in real-world applications is the variation between environments, tasks or clients. Standard MRL methods optimize the average return over tasks, but often suffer from poor results in tasks of high risk or difficulty. This limits system reliability since test tasks are not known in advance. In this work, we define a robust MRL objective with a controlled robustness level. Optimization of analogous robust objectives in RL is known to lead to both biased gradients and data inefficiency.

data inefficiency, fight easy, robust meta reinforcement learning, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 23:39:26 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.80)