IndexedMinimumEmpiricalDivergencefor UnimodalBandits
–Neural Information Processing Systems
Both means and distributions areunknown, which makes the problem non trivial, and the learner only knows thatν D where D is a given set of bandit configurations.
Neural Information Processing Systems
Feb-8-2026, 07:25:00 GMT