Regret based Robust Solutions for Uncertain Markov Decision Processes
–Neural Information Processing Systems
In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust optimization approaches for these problems have focussed on the computation of {\em maximin} policies which maximize the value corresponding to the worst realization of the uncertainty.
Neural Information Processing Systems
Sep-30-2025, 12:58:48 GMT
- Technology: