Efficient Pure Exploration in Adaptive Round model
Tianyuan Jin, Jieming SHI, Xiaokui Xiao, Enhong Chen
–Neural Information Processing Systems
In the adaptive setting, many multi-armed bandit applications allow the learner to adaptively draw samples and adjust sampling strategy in rounds. In many real applications, not only the query complexity but also the round complexity need to be optimized. In this paper, we study both PAC and exact top-k arm identification problems and design efficient algorithms considering both round complexity and query complexity.
Neural Information Processing Systems
Jan-21-2025, 12:47:44 GMT