Position-Based Multiple-Play Bandits with Thompson Sampling

Gauthier, Camille-Sovanneary, Gaudel, Romaric, Fromont, Elisa

Sep-28-2020–arXiv.org Machine Learning

Multiple-play bandits aim at displaying relevant items at relevant positions on a web page. We introduce a new bandit-based algorithm, PB-MHB, for online recommender systems which uses the Thompson sampling framework. This algorithm handles a display setting governed by the position-based model. Our sampling method does not require as input the probability of a user to look at a given position in the web page which is, in practice, very difficult to obtain. Experiments on simulated and real datasets show that our method, with fewer prior information, deliver better recommendations than state-of-the-art algorithms.

artificial intelligence, big data, pb-mhb, (19 more...)

arXiv.org Machine Learning

Sep-28-2020

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (1.00)
  - Data Science > Data Mining
    - Big Data (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found