Reinforcement Learning with Imperfect Transition Predictions: ABellman-Jensen Approach

Open in new window