Cornering Stationary and Restless Mixing Bandits with Remix-UCB
Julien Audiffren, Liva Ralaivola
–Neural Information Processing Systems
We study the restless bandit problem where arms are associated with stationary ϕ-mixing processes and where rewards are therefore dependent: the question that arises from this setting is that of carefully recovering some independence by'ignoring' the values of some rewards.
Neural Information Processing Systems
Oct-2-2025, 05:53:32 GMT
- Country:
- Industry:
- Technology: