Optimism in Reinforcement Learning and Kullback-Leibler Divergence

Open in new window