AITopics | regret minimization

We give a randomized online algorithm that guarantees near-optimal $\widetilde O(\sqrt T)$ expected swap regret against any sequence of $T$ adaptively chosen Lipschitz convex losses on the unit interval. This improves the previous best bound of $\widetilde O(T^{2/3})$ and answers an open question of Fishelson et al. [2025b]. In addition, our algorithm is efficient: it runs in $\mathsf{poly}(T)$ time. A key technical idea we develop to obtain this result is to discretize the unit interval into bins at multiple scales of granularity and simultaneously use all scales to make randomized predictions, which we call multi-scale binning and may be of independent interest. A direct corollary of our result is an efficient online algorithm for minimizing the calibration error for general elicitable properties. This result does not require the Lipschitzness assumption of the identification function needed in prior work, making it applicable to median calibration, for which we achieve the first $\widetilde O(\sqrt T)$ calibration error guarantee.

artificial intelligence, machine learning, swap regret, (18 more...)

arXiv.org Machine Learning

2602.08862

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
(2 more...)

Genre: Research Report (0.70)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Game Theory (0.68)

Add feedback

Response to Reviewer 2: Empirical evaluation: Interestingly, we actually did an empirical evaluation in the earlier

Neural Information Processing SystemsFeb-9-2026, 06:23:30 GMT

We thank the reviewers for the positive feedback and their interest in our work! Below we address some questions. Both algorithms are well-tuned for hyperparameters. We didn't include it in the submission because after all the We will make sure to define them earlier in the paper in the revision. We are happy to clarify them.

artificial intelligence, evaluation, machine learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.33)

Add feedback

4b70484ebef62484e0c8cdd269e482fd-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 18:52:10 GMT

adaptive regret, algorithm, dynamic regret, (13 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EfficientMethodsforNon-stationaryOnlineLearning

Neural Information Processing SystemsFeb-8-2026, 18:52:07 GMT

Inparticular, dynamic regret [Zinkevich,2003;Zhang et al.,2018a]and adaptiveregret [Hazan and Seshadhri, 2009; Daniely et al., 2015] are proposed as two principled metrics to guide the algorithm design. Theunknowncomparators orunknown intervals bring considerable uncertainty to online optimization.

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China > Jiangsu Province > Nanjing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Add feedback

Filters

Collaborating Authors

regret minimization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

580760fb5def6e2ca8eaf601236d5b08-Supplemental.pdf

580760fb5def6e2ca8eaf601236d5b08-Paper.pdf

4f87658ef0de194413056248a00ce009-Supplemental.pdf

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

d0d5dd7bd2ee9f095e50084c2ba3a716-Paper-Conference.pdf

Near-optimal Swap Regret Minimization for Convex Losses

Response to Reviewer 2: Empirical evaluation: Interestingly, we actually did an empirical evaluation in the earlier

4b70484ebef62484e0c8cdd269e482fd-Supplemental-Conference.pdf

EfficientMethodsforNon-stationaryOnlineLearning