Representative Action Selection for Large Action-Space Meta-Bandits
Zhou, Quan, Kozdoba, Mark, Mannor, Shie
We study the problem of selecting a subset from a large action space shared by a family of bandits, with the goal of achieving performance nearly matching that of using the full action space. We assume that similar actions tend to have related payoffs, modeled by a Gaussian process. To exploit this structure, we propose a simple epsilon-net algorithm to select a representative subset. We provide theoretical guarantees for its performance and compare it empirically to Thompson Sampling and Upper Confidence Bound.
May-27-2025
- Country:
- Asia > Japan
- Honshū > Kantō > Kanagawa Prefecture (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Asia > Japan
- Genre:
- Research Report (0.81)
- Industry:
- Health & Medicine > Therapeutic Area (0.46)
- Technology: