Regret based Robust Solutions for Uncertain Markov Decision Processes

Neural Information Processing Systems 

In this paper, we seek robust policies for uncertain Markov Decision Processes (MDPs). Most robust optimization approaches for these problems have focussed on the computation of {\em maximin} policies which maximize the value corresponding to the worst realization of the uncertainty.