Offline Model-based Adaptable Policy Learning Xiong-Hui Chen 1, Y ang Y u

Neural Information Processing Systems 

In reinforcement learning, a promising direction to avoid online trial-and-error costs is learning from an offline dataset. Current offline reinforcement learning methods commonly learn in the policy space constrained to in-support regions by the offline dataset, in order to ensure the robustness of the outcome policies.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found