Learning in Generalized Linear Contextual Bandits with Stochastic Delays
Zhengyuan Zhou, Renyuan Xu, Jose Blanchet
–Neural Information Processing Systems
In this paper, we consider online learning in generalized linear contextual bandits where rewards are not immediately observed.
Neural Information Processing Systems
Oct-2-2025, 18:36:44 GMT