Bellman Optimality of Average-Reward Robust Markov Decision Processes with a Constant Gain

Open in new window