Nearly Optimal Best-of-Both-Worlds Algorithms for Online Learning with Feedback Graphs

Neural Information Processing Systems 

This study considers online learning with general directed feedback graphs. For this problem, we present best-of-both-worlds algorithms that achieve nearly tight regret bounds for adversarial environments as well as poly-logarithmic regret bounds for stochastic environments. As Alon et al. [2015] have shown, tight regret bounds depend on the structure of the feedback graph: strongly observable graphs yield minimax regret of Θ(α

Similar Docs  Excel Report  more

TitleSimilaritySource
None found