Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures Hamish Flynn

Neural Information Processing Systems 

We present improved algorithms with worst-case regret guarantees for the stochastic linear bandit problem. The widely used "optimism in the face of uncertainty"

Similar Docs  Excel Report  more

TitleSimilaritySource
None found