AITopics | optimal regret bound

Dying Experts: Efficient Algorithms with Optimal Regret Bounds

Neural Information Processing SystemsMay-27-2025, 08:22:29 GMT

We study a variant of decision-theoretic online learning in which the set of experts that are available to Learner can shrink over time. This is a restricted version of the well-studied sleeping experts problem, itself a generalization of the fundamental game of prediction with expert advice. Similar to many works in this direction, our benchmark is the ranking regret. Various results suggest that achieving optimal regret in the fully adversarial sleeping experts problem is computationally hard. This motivates our relaxation where any expert that goes to sleep will never again wake up.

efficient algorithm, optimal regret bound, sleeping expert problem, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

Reviews: Dying Experts: Efficient Algorithms with Optimal Regret Bounds

Neural Information Processing SystemsJan-21-2025, 21:01:10 GMT

The dying expert setting is interesting, it would be appreciated to give more examples. The overall writing is good and easy to follow. I have a simple question on the performance measure, ranking regret. In the definition of (1), authors claim \sigma t(\pi) is the first alive expert of ordering \pi in round t. So why do we need to specify the "first" alive expert, rather than the alive expert with the optimal performance?

efficient algorithm, optimal regret bound

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Reviews: Dying Experts: Efficient Algorithms with Optimal Regret Bounds

Neural Information Processing SystemsJan-21-2025, 21:00:58 GMT

In addition to the upper bound the reviewers found the lower bounds of interest. The reviewers are unanimous in their opinion that this paper should be accepted.

efficient algorithm, optimal regret bound, reviewer

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Dying Experts: Efficient Algorithms with Optimal Regret Bounds

Neural Information Processing SystemsOct-9-2024, 14:01:54 GMT

We study a variant of decision-theoretic online learning in which the set of experts that are available to Learner can shrink over time. This is a restricted version of the well-studied sleeping experts problem, itself a generalization of the fundamental game of prediction with expert advice. Similar to many works in this direction, our benchmark is the ranking regret. Various results suggest that achieving optimal regret in the fully adversarial sleeping experts problem is computationally hard. This motivates our relaxation where any expert that goes to sleep will never again wake up.

efficient algorithm, optimal regret bound, sleeping expert problem, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

Dying Experts: Efficient Algorithms with Optimal Regret Bounds

Shayestehmanesh, Hamid, Azami, Sajjad, Mehta, Nishant A.

Neural Information Processing SystemsMar-19-2020, 00:46:28 GMT

We study a variant of decision-theoretic online learning in which the set of experts that are available to Learner can shrink over time. This is a restricted version of the well-studied sleeping experts problem, itself a generalization of the fundamental game of prediction with expert advice. Similar to many works in this direction, our benchmark is the ranking regret. Various results suggest that achieving optimal regret in the fully adversarial sleeping experts problem is computationally hard. This motivates our relaxation where any expert that goes to sleep will never again wake up.

artificial intelligence, machine learning, optimal regret bound, (4 more...)

Neural Information Processing Systems

Genre: Research Report (0.44)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.60)

Add feedback

Further Optimal Regret Bounds for Thompson Sampling

Agrawal, Shipra, Goyal, Navin

arXiv.org Machine LearningSep-14-2012

Thompson Sampling is one of the oldest heuristics for multi-armed bandit problems. It is a randomized algorithm based on Bayesian ideas, and has recently generated significant interest after several studies demonstrated it to have better empirical performance compared to the state of the art methods. In this paper, we provide a novel regret analysis for Thompson Sampling that simultaneously proves both the optimal problem-dependent bound of $(1+\epsilon)\sum_i \frac{\ln T}{\Delta_i}+O(\frac{N}{\epsilon^2})$ and the first near-optimal problem-independent bound of $O(\sqrt{NT\ln T})$ on the expected regret of this algorithm. Our near-optimal problem-independent bound solves a COLT 2012 open problem of Chapelle and Li. The optimal problem-dependent regret bound for this problem was first proven recently by Kaufmann et al. [ALT 2012]. Our novel martingale-based analysis techniques are conceptually simple, easily extend to distributions other than the Beta distribution, and also extend to the more general contextual bandits setting [Manuscript, Agrawal and Goyal, 2012].

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

1209.3353

Country: Asia > India (0.04)

Genre: Research Report > New Finding (0.48)

Technology: