Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach

Open in new window