Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits
–Neural Information Processing Systems
We provide a tight regret analysis for ExpTS, which simultaneously yields both the finite-time regret bound as well as the asymptotic regret bound.
Neural Information Processing Systems
Aug-22-2025, 02:08:27 GMT
- Country:
- Asia > Singapore (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California (0.04)
- Genre:
- Research Report (0.92)
- Technology: