Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff

Ofer Dekel, Ronen Eldan, Tomer Koren

Neural Information Processing Systems 

Bandit convex optimization is one of the fundamental problems in the field of online learning.