Goto

Collaborating Authors

 Learning Management


Stochastic Online Learning with Feedback Graphs: Finite-Time and Asymptotic Optimality

Neural Information Processing Systems

We show that, surprisingly, the notion of optimal finite-time regret is not a uniquely defined property in this context and that, in general, it is decoupled from the asymptotic rate. We discuss alternative choices and propose a notion of finite-time optimality that we argue is meaningful .










Online learning with dynamics: A minimax perspective

Neural Information Processing Systems

Given such a setup, a natural question to ask is how does one measure the performance of the learner? Classical online learning studies one such notion of performance known as regret.