Survey Bandits with Regret Guarantees

Krishnamurthy, Sanath Kumar, Athey, Susan

Feb-22-2020–arXiv.org Machine Learning

We consider a variant of the contextual bandit problem. In standard contextual bandits, when a user arrives we get the user's complete feature vector and then assign a treatment (arm) to that user. In a number of applications (like healthcare), collecting features from users can be costly. To address this issue, we propose algorithms that avoid needless feature collection while maintaining strong regret guarantees.

algorithm, supp, surveyucb, (15 more...)

arXiv.org Machine Learning

Feb-22-2020

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.70)

Industry:
- Health & Medicine (0.34)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.34)
  - Artificial Intelligence > Machine Learning
    - Supervised Learning > Representation Of Examples (0.34)
    - Statistical Learning (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found