PRODuctive bandits: Importance Weighting No More

Mar-21-2026, 18:43:40 GMT–Neural Information Processing Systems

Prod is a seminal algorithm in full-information online learning, which has been conjectured to be fundamentally sub-optimal for multi-armed bandits.By leveraging the interpretation of Prod as a first-order OMD approximation, we present the following surprising results:1. Variants of Prod can obtain optimal regret for adversarial multi-armed bandits.

artificial intelligence, big data, data mining, (7 more...)

Neural Information Processing Systems

Mar-21-2026, 18:43:40 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Artificial Intelligence (0.40)
  - Data Science > Data Mining
    - Big Data (0.66)