RecurrentSubmodularWelfareand MatroidBlockingSemi-Bandits

Neural Information Processing Systems 

In this work, we extend the above direction to a combinatorial semi-bandit setting and study avariant of stochastic MAB, where arms are subject to matroid constraints and each arm becomes unavailable (blocked) for afixed number of rounds after each play. A natural common generalization of the state-of-the-art for blocking bandits, and that for matroid bandits, only guarantees a1/2-approximation for general matroids.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found