Noise-Adaptive Thompson Sampling for Linear Contextual Bandits
–Neural Information Processing Systems
Linear contextual bandits represent a fundamental class of models with numerous real-world applications, and it is critical to developing algorithms that can effectively manage noise with unknown variance, ensuring provable guarantees for both worst-case constant-variance noise and deterministic reward scenarios.
Neural Information Processing Systems
Apr-27-2026, 01:58:01 GMT