Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial Constraints

Neural Information Processing Systems 

Surprisingly, we show that estimating the constraints with an UCB-like approach guarantees optimal performances.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found