Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Neural Information Processing Systems 

The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erdős-Rényi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found