Note that the regret ofthe algorithm in [1]satisfiesR(G,T) = O (δ(G)logn)

Feb-11-2026, 06:07:07 GMT–Neural Information Processing Systems

The bandit problem with graph feedback, proposed in [Mannor and Shamir, NeurIPS 2011], is modeled by a directed graphG = (V,E) where V is the collection of bandit arms, and once an arm is triggered, all its incident arms are observed.

artificial intelligence, data mining, machine learning, (21 more...)

Neural Information Processing Systems

Feb-11-2026, 06:07:07 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.49)
  - Artificial Intelligence
    - Representation & Reasoning (0.48)
    - Machine Learning (0.46)

Duplicate Docs Excel Report

Title
Understanding Bandits with Graph Feedback

Similar Docs Excel Report more

Title	Similarity	Source
None found