Bayesian Bellman Operators Matthew Fellows Kristian Hartikainen Shimon Whiteson Department of Computer Science University of Oxford

Neural Information Processing Systems 

We demonstrate that BootDQNprior+'s lagged target parameters, which are essential to its performance, arise from applying approximate inference to the BBO posterior.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found