b3d6e130a30b176f2ca5af7d1e73953f-AuthorFeedback.pdf

Neural Information Processing Systems 

We thank all the reviewers for the thoughtful feedback. This is accordant with theoretical results that the time averaged regret goes to zero as T goes to infinity.