Double-Linear Thompson Sampling for Context-Attentive Bandits

Bouneffouf, Djallel, Féraud, Raphaël, Upadhyay, Sohini, Khazaeni, Yasaman, Rish, Irina

Oct-15-2020–arXiv.org Artificial Intelligence

In this paper, we analyze and extend an online learning framework known as Context-Attentive Bandit, motivated by various practical applications, from medical diagnosis to dialog systems, where due to observation costs only a small subset of a potentially large number of context variables can be observed at each iteration; however, the agent has a freedom to choose which variables to observe. We derive a novel algorithm, called Context-Attentive Thompson Sampling (CATS), which builds upon the Linear Thompson Sampling approach, adapting it to Context-Attentive Bandit setting. We provide a theoretical regret analysis and an extensive empirical evaluation demonstrating advantages of the proposed approach over several baseline methods on a variety of real-life datasets.

algorithm, big data, health & medicine, (19 more...)

arXiv.org Artificial Intelligence

Oct-15-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.46)

Genre:
- Research Report (0.64)

Industry:
- Health & Medicine > Diagnostic Medicine (0.34)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Natural Language (1.00)
    - Representation & Reasoning > Personal Assistant Systems (0.68)
  - Data Science > Data Mining
    - Big Data (0.71)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found