Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
–Neural Information Processing Systems
We study the Bayesian regret of the renowned Thompson Sampling algorithm in contextual bandits with binary losses and adversarially-selected contexts.
Neural Information Processing Systems
Aug-14-2025, 08:12:08 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > North Holland
- Asia > Middle East
- Genre:
- Research Report (0.46)