OnlineRobustReinforcementLearningwithModel Uncertainty
–Neural Information Processing Systems
Robust reinforcement learning (RL) is to find a policy that optimizes the worstcase performance over an uncertainty set of MDPs. In this paper, we focus on model-freerobust RL, where the uncertainty set is defined to be centering at a misspecified MDP that generates a single sample trajectory sequentially, and is assumed to beunknown.
Neural Information Processing Systems
Feb-8-2026, 06:47:15 GMT
- Country:
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Technology: