Twice regularized MDPs and the equivalence between robustness and regularization

Open in new window