OnlineRobustReinforcementLearningwithModel Uncertainty

Neural Information Processing Systems 

Robust reinforcement learning (RL) is to find a policy that optimizes the worstcase performance over an uncertainty set of MDPs. In this paper, we focus on model-freerobust RL, where the uncertainty set is defined to be centering at a misspecified MDP that generates a single sample trajectory sequentially, and is assumed to beunknown.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found