Contextual bandits with surrogate losses: Margin bounds and efficient algorithms

Dylan J. Foster, Akshay Krishnamurthy

Neural Information Processing Systems 

We use surrogate losses to obtain several new regret bounds and new algorithms for contextual bandit learning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found