Exploiting Correlated Auxiliary Feedback in Parameterized Bandits
–Neural Information Processing Systems
In this paper, we first develop a method that exploits auxiliary feedback to build a reward estimator with tight confidence bounds, leading to a smaller regret.
Neural Information Processing Systems
Oct-8-2025, 03:06:58 GMT
- Country:
- Asia > Singapore (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Technology: