Goto

Collaborating Authors

 Reinforcement Learning










2c3ddf4bf13852db711dd1901fb517fa-AuthorFeedback.pdf

Neural Information Processing Systems

As[R1]38 has pointed out, our novel interpretation of KL term gives new insights and variations on online Bayesian learning.39 Since UCL samples the weight parameters only once for each iteration, applying it to actor-critic based42 reinforcement learning algorithm becomes possible.