Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach