Stochastic contextual bandits with graph feedback: from independence number to MAS number Y uxiao Wen Y anjun Han

Neural Information Processing Systems 

The framework of formulating the feedback structure as feedback graphs in bandits has a long history (Mannor and Shamir, 2011; Alon et al., 2015, 2017; Lykouris et al.,

Similar Docs  Excel Report  more

TitleSimilaritySource
None found