Latent Bandits Revisited
–Neural Information Processing Systems
A latent bandit is a bandit problem where the learning agent knows reward distributions of arms conditioned on an unknown discrete latent state .
Neural Information Processing Systems
Nov-14-2025, 17:13:32 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > Canada (0.04)
- Europe > United Kingdom
- Technology: