Noise-Adaptive Thompson Sampling for Linear Contextual Bandits

Neural Information Processing Systems 

Linear contextual bandits represent a fundamental class of models with numerous real-world applications, and it is critical to developing algorithms that can effectively manage noise with unknown variance, ensuring provable guarantees for both worst-case constant-variance noise and deterministic reward scenarios.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found