Delay and Cooperation in Nonstochastic Linear Bandits

Neural Information Processing Systems 

This paper offers a nearly optimal algorithm for online linear optimization with delayed bandit feedback.