PAC-Bayesian Analysis of Contextual Bandits François Laviolette
–Neural Information Processing Systems
We derive an instantaneous (per-round) data-dependent regret bound for stochastic multiarmed bandits with side information (also known as contextual bandits).
Neural Information Processing Systems
Mar-15-2024, 02:44:14 GMT
- Country:
- North America > Canada
- Quebec (0.04)
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Germany > Baden-Württemberg
- Tübingen Region > Tübingen (0.14)
- Austria > Styria
- Leoben (0.04)
- United Kingdom > England
- North America > Canada
- Technology: