Reviews: The Multi-fidelity Multi-armed Bandit

Jan-20-2025, 08:35:21 GMT–Neural Information Processing Systems

The paper in my opinion studies an interesting and relevant problem - one of modelling the tradeoff between information, cost and reward (whether to choose low information that is cheap or high information that is expensive) - in online learning, specifically stochastic bandits. In this sense it may be useful as a benchmark to improve upon. Though the paper seems technically solid, a key shortcoming is the lack of adequate explanation about its results and assumptions. The regret definition adopted seems unnatural at least from one angle - why not penalize resource consumption (or'cost') additively instead of multiplicatively as done here? The authors' example of ad-display motivates their definition, but may not be the most general.

information, multi-fidelity multi-armed bandit, review, (5 more...)

Neural Information Processing Systems

Jan-20-2025, 08:35:21 GMT

Conferences Web Page

Add feedback

Industry:
- Education (0.38)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (0.34)
  - Data Science > Data Mining
    - Big Data (0.40)