Learning on the Edge: Online Learning with Stochastic Feedback Graphs

Jan-19-2025, 03:25:20 GMT–Neural Information Processing Systems

The framework of feedback graphs is a generalization of sequential decision-making with bandit or full information feedback. In this work, we study an extension where the directed feedback graph is stochastic, following a distribution similar to the classical Erdős-Rényi model. Specifically, in each round every edge in the graph is either realized or not with a distinct probability for each edge. Our result, which holds without any preliminary knowledge about \mathcal{G}, requires the learner to observe only the realized out-neighborhood of the chosen action. When the learner is allowed to observe the realization of the entire graph (but only the losses in the out-neighborhood of the chosen action), we derive a more efficient algorithm featuring a dependence on weighted versions of the independence and weak domination numbers that exhibits improved bounds for some special cases.

learning, stochastic feedback graph, varepsilon, (4 more...)

Neural Information Processing Systems

Jan-19-2025, 03:25:20 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Setting > Online (0.40)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (0.61)
  - Enterprise Applications > Human Resources
    - Learning Management (0.40)