An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning

Neural Information Processing Systems 

In the standard reinforcement learning (RL) setting, the primary goal is to obtain a policy that maximizes a cumulative scalar reward [Sutton and Barto, 2018].

Similar Docs  Excel Report  more

TitleSimilaritySource
None found