Beyond Confidence Regions: Tight Bayesian Ambiguity Sets for Robust MDPs
Marek Petrik, Reazul Hasan Russel
–Neural Information Processing Systems
Robust MDPs (RMDPs) can be used to compute policies with provable worst-case guarantees in reinforcement learning.
Neural Information Processing Systems
Aug-20-2025, 00:18:46 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada (0.04)
- United States > New Hampshire (0.04)
- Europe > United Kingdom