Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs

Dec-25-2025, 02:11:49 GMT–Neural Information Processing Systems

This study considers online learning with general directed feedback graphs. For this problem, we present best-of-both-worlds algorithms that achieve nearly tight regret bounds for adversarial environments as well as poly-logarithmic regret bounds for stochastic environments. As Alon et al. [2015] have shown, tight regret bounds depend on the structure of the feedback graph: strongly observable graphs yield minimax regret of $\tilde{\Theta}( \alpha^{1/2} T^{1/2})$, while weakly observable graphs induce minimax regret of $\tilde{\Theta}( \delta^{1/3} T^{2/3})$, where $\alpha$ and $\delta$, respectively, represent the independence number of the graph and the domination number of a certain portion of the graph.

name change, online learning, optimal best-of-both-world algorithm, (9 more...)

Neural Information Processing Systems

Dec-25-2025, 02:11:49 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Setting > Online (0.66)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)