Sub-samplingforEfficientNon-Parametric BanditExploration
–Neural Information Processing Systems
In this paper, we propose the first re-sampling based algorithm that is asymptotically optimal for several classes of possibly un-bounded parametric distributions.
Neural Information Processing Systems
Feb-8-2026, 03:26:49 GMT