The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning

May-27-2025, 05:18:50 GMT–Neural Information Processing Systems

Offline reinforcement learning (RL) aims to train agents from pre-collected datasets. However, this comes with the added challenge of estimating the value of behaviors not covered in the dataset. Model-based methods offer a potential solution by training an approximate dynamics model, which then allows collection of additional synthetic data via rollouts in this model. The prevailing theory treats this approach as online RL in an approximate dynamics model, and any remaining performance gap is therefore understood as being due to dynamics model errors. In this paper, we analyze this assumption and investigate how popular algorithms perform as the learned dynamics model is improved.

dynamic model, edge-of-reach problem, offline model-based reinforcement learning, (3 more...)

Neural Information Processing Systems

May-27-2025, 05:18:50 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.63)