When Is Generalizable Reinforcement Learning Tractable?

May-26-2025, 18:43:17 GMT–Neural Information Processing Systems

Agents trained by reinforcement learning (RL) often fail to generalize beyond the environment they were trained in, even when presented with new scenarios that seem similar to the training environment. We study the query complexity required to train RL agents that generalize to multiple environments. Intuitively, tractable generalization is only possible when the environments are similar or close in some sense. To capture this, we introduce Weak Proximity, a natural structural condition that requires the environments to have highly similar transition and reward functions and share a policy providing optimal value. Despite such shared structure, we prove that tractable generalization is impossible in the worst case.

artificial intelligence, generalizable reinforcement learning tractable, machine learning, (7 more...)

Neural Information Processing Systems

May-26-2025, 18:43:17 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)