Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs

Marek Petrik, Reazul Hasan Russel

Neural Information Processing Systems 

Robust MDPs (RMDPs) can be used to compute policies with provable worst-case guarantees in reinforcement learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found