Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs

Neural Information Processing Systems 

We show that the regret bound is near-optimal even with very heavy-tailed noise.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found