Learning in Generalized Linear Contextual Bandits with Stochastic Delays

Zhengyuan Zhou, Renyuan Xu, Jose Blanchet

Neural Information Processing Systems 

In this paper, we consider online learning in generalized linear contextual bandits where rewards are not immediately observed.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found