Bandit Convex Optimization: Towards Tight Bounds Kfir Y. Levy Technion--Israel Institute of Technology Technion--Israel Institute of Technology Haifa 32000, Israel

Mar-13-2024, 12:33:48 GMT–Neural Information Processing Systems

Bandit Convex Optimization (BCO) is a fundamental framework for decision making under uncertainty, which generalizes many problems from the realm of online and statistical learning. While the special case of linear cost functions is well understood, a gap on the attainable regret for BCO with nonlinear losses remains an important open question. In this paper we take a step towards understanding the best attainable regret bounds for BCO: we give an efficient and near-optimal regret algorithm for BCO with strongly-convex and smooth loss functions. In contrast to previous works on BCO that use time invariant exploration schemes, our method employs an exploration scheme that shrinks with time.

algorithm, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Mar-13-2024, 12:33:48 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East > Israel > Haifa District > Haifa (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.34)