Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
–Neural Information Processing Systems
We study the Bayesian regret of the renowned Thompson Sampling algorithm in contextual bandits with binary losses and adversarially-selected contexts.
Neural Information Processing Systems
Aug-14-2025, 08:12:08 GMT
- Country:
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- United Kingdom > England
- Asia > Middle East
- Jordan (0.04)
- Europe
- Genre:
- Research Report (0.46)