Open Problem: Model Selection for Contextual Bandits
Foster, Dylan J., Krishnamurthy, Akshay, Luo, Haipeng
In statistical learning, algorithms for model selection allow the learner to adapt to the complexity of the best hypothesis class in a sequence. We ask whether similar guarantees are possible for contextual bandit learning.
Jun-18-2020
- Country:
- North America > United States
- California (0.14)
- Massachusetts (0.14)
- North America > United States
- Genre:
- Research Report (0.40)
- Industry:
- Education > Educational Setting (0.31)
- Technology: