Open Problem: Model Selection for Contextual Bandits

Foster, Dylan J., Krishnamurthy, Akshay, Luo, Haipeng

arXiv.org Machine Learning 

In statistical learning, algorithms for model selection allow the learner to adapt to the complexity of the best hypothesis class in a sequence. We ask whether similar guarantees are possible for contextual bandit learning.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found