Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures
–Neural Information Processing Systems
We present improved algorithms with worst-case regret guarantees for the stochastic linear bandit problem. The widely used "optimism in the face of uncertainty"
Neural Information Processing Systems
Feb-11-2025, 03:56:11 GMT