Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures Hamish Flynn
–Neural Information Processing Systems
We present improved algorithms with worst-case regret guarantees for the stochastic linear bandit problem. The widely used "optimism in the face of uncertainty"
Neural Information Processing Systems
Mar-27-2025, 11:34:15 GMT