Reining Generalization in Offline Reinforcement Learning via Representation Distinction

Neural Information Processing Systems 

Offline Reinforcement Learning (RL) aims to address the challenge of distribution shift between the dataset and the learned policy, where the value of out-of-distribution (OOD) data may be erroneously estimated due to overgeneralization.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found