Combinatorial Pure Exploration of Multi-Armed Bandits

Chen, Shouyuan, Lin, Tian, King, Irwin, Lyu, Michael R., Chen, Wei

Feb-14-2020, 05:42:32 GMT–Neural Information Processing Systems

We study the {\em combinatorial pure exploration (CPE)} problem in the stochastic multi-armed bandit setting, where a learner explores a set of arms with the objective of identifying the optimal member of a \emph{decision class}, which is a collection of subsets of arms with certain combinatorial structures such as size-$K$ subsets, matchings, spanning trees or paths, etc. The CPE problem represents a rich class of pure exploration tasks which covers not only many existing models but also novel cases where the object of interest has a non-trivial combinatorial structure. In this paper, we provide a series of results for the general CPE problem. We present general learning algorithms which work for all decision classes that admit offline maximization oracles in both fixed confidence and fixed budget settings. We prove problem-dependent upper bounds of our algorithms.

combinatorial pure exploration, combinatorial structure, decision class, (5 more...)

Neural Information Processing Systems

Feb-14-2020, 05:42:32 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.43)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (0.83)
  - Data Science > Data Mining
    - Big Data (0.64)