Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning

Neural Information Processing Systems 

This work was supported in part by the National Science Foundation under grant CCF-2149588 and Cisco, Inc.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found