Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

Open in new window