Batch-Size Independent Regret Bounds for Combinatorial Semi-Bandits with Probabilistically Triggered Arms or Independent Arms

Neural Information Processing Systems 

The agent's goal is to minimize the expected regret,