Optimal Thresholding Linear Bandit

Feb-11-2024–arXiv.org Artificial Intelligence

The Thresholding Bandit Problem (TBP) Andrea Locatelli (2016); Kano et al. (2019) represents a specific combinatorial The study by Kano et al. (2019) emphasizes that in certain contexts, such as personalized recommendations, pursuing In this scenario, the arms' mean rewards follow a linear model with unknown parameters. We prove an instance-specific lower bound for the expected sample complexity of any correct algorithm.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Artificial Intelligence

Feb-11-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States > Michigan (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning (1.00)
    - Representation & Reasoning (1.00)
  - Data Science > Data Mining
    - Big Data (0.48)