Contextual Combinatorial Multi-armed Bandits with Volatile Arms and Submodular Reward
–Neural Information Processing Systems
In this paper, we study the stochastic contextual combinatorial multi-armed bandit (CC-MAB) framework that is tailored for volatile arms and submodular reward functions. CC-MAB inherits properties from both contextual bandit and combinatorial bandit: it aims to select a set of arms in each round based on the side information (a.k.a.
Neural Information Processing Systems
May-26-2025, 04:57:27 GMT
- Country:
- North America > United States > Florida > Miami-Dade County > Coral Gables (0.14)
- Industry:
- Government (0.46)
- Technology: