FindingOptimalArmsinNon-stochastic CombinatorialBanditswithSemi-banditFeedback andFiniteBudget
–Neural Information Processing Systems
The action is to choose a set of arms, whereupon feedback for each arm in the chosen set is received.
Neural Information Processing Systems
Feb-10-2026, 08:00:24 GMT