Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff

Neural Information Processing Systems 

Bandit convex optimization is one of the fundamental problems in the field of online learning.