IndexedMinimumEmpiricalDivergencefor UnimodalBandits

Neural Information Processing Systems 

Both means and distributions areunknown, which makes the problem non trivial, and the learner only knows thatν D where D is a given set of bandit configurations.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found