Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits

Neural Information Processing Systems 

We study the Bayesian regret of the renowned Thompson Sampling algorithm in contextual bandits with binary losses and adversarially-selected contexts.

Duplicate Docs Excel Report

Similar Docs  Excel Report  more

TitleSimilaritySource
None found