Pure Exploration and Regret Minimization in Matching Bandits

Sentenac, Flore, Yi, Jialin, Calauzènes, Clément, Perchet, Vianney, Vojnovic, Milan

Jul-31-2021–arXiv.org Machine Learning

Finding an optimal matching in a weighted graph is a standard combinatorial problem. We consider its semi-bandit version where either a pair or a full matching is sampled sequentially. We prove that it is possible to leverage a rank-1 assumption on the adjacency matrix to reduce the sample complexity and the regret of off-the-shelf algorithms up to reaching a linear dependency in the number of vertices (up to poly log terms).

algorithm, artificial intelligence, machine learning, (16 more...)

arXiv.org Machine Learning

Jul-31-2021

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - California (0.14)
  - New York (0.14)

Genre:
- Research Report (0.81)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)