EnsembleSampling_Final

Neural Information Processing Systems 

Ensemble sampling serves as a practical approximation to Thompson sampling when maintaining an exact posterior distribution over model parameters is computationally intractable. In this paper, we establish a regret bound that ensures desirable behavior when ensemble sampling is applied to the linear bandit problem.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found