Cornering Stationary and Restless Mixing Bandits with Remix-UCB
Julien Audiffren, Liva Ralaivola
–Neural Information Processing Systems
We study the restless bandit problem where arms are associated with stationary ϕ-mixing processes and where rewards are therefore dependent: the question that arises from this setting is that of carefully recovering some independence by'ignoring' the values of some rewards.
Neural Information Processing Systems
Feb-7-2025, 00:45:30 GMT