Distributional Bellman and the C51 Algorithm
I got the chance to read this paper on Distributional Bellman published by DeepMind in July. Glossing over it the first time, my impression was that it would be an important paper, since the theory was sound and the experimental results were promising. However, it did not generate as much noise in the reinforcement learning community as I would have hoped. Nevertheless, as I thought the idea of Distributional Bellman was pretty neat, I decided to implement it (in Keras) and test it out myself. I hope this article can help interested readers better understanding the core concepts of Distributional Bellman. To understand Distributional Bellman, we first have to acquire a basic understanding of Q Learning.
Jan-28-2018, 17:14:20 GMT
- Technology: