Comparator-Adaptive Convex Bandits

Mar-21-2025, 11:04:21 GMT–Neural Information Processing Systems

We study bandit convex optimization methods that adapt to the norm of the comparator, a topic that has only been studied before for its full-information counterpart. Specifically, we develop convex bandit algorithms with regret bounds that are small whenever the norm of the comparator is small. We first use techniques from the full-information setting to develop comparator-adaptive algorithms for linear bandits. Then, we extend the ideas to convex bandits with Lipschitz or smooth loss functions, using a new variant of the standard single-point gradient estimator and carefully designed surrogate losses.

artificial intelligence, data mining, machine learning, (15 more...)

Neural Information Processing Systems

Mar-21-2025, 11:04:21 GMT

Conferences PDF

Add feedback

Country:
- North America (0.46)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning > Optimization (0.34)
  - Data Science > Data Mining
    - Big Data (0.35)