Learning to Play Multi-Follower Bayesian Stackelberg Games
Personnat, Gerson, Lin, Tao, Hossain, Safwan, Parkes, David C.
–arXiv.org Artificial Intelligence
In a multi-follower Bayesian Stackelberg game, a leader plays a mixed strategy over $L$ actions to which $n\ge 1$ followers, each having one of $K$ possible private types, best respond. The leader's optimal strategy depends on the distribution of the followers' private types. We study an online learning version of this problem: a leader interacts for $T$ rounds with $n$ followers with types sampled from an unknown distribution every round. The leader's goal is to minimize regret, defined as the difference between the cumulative utility of the optimal strategy and that of the actually chosen strategies. We design learning algorithms for the leader under different feedback settings. Under type feedback, where the leader observes the followers' types after each round, we design algorithms that achieve $\mathcal O\big(\sqrt{\min\{L\log(nKA T), nK \} \cdot T} \big)$ regret for independent type distributions and $\mathcal O\big(\sqrt{\min\{L\log(nKA T), K^n \} \cdot T} \big)$ regret for general type distributions. Interestingly, those bounds do not grow with $n$ at a polynomial rate. Under action feedback, where the leader only observes the followers' actions, we design algorithms with $\mathcal O( \min\{\sqrt{ n^L K^L A^{2L} L T \log T}, K^n\sqrt{ T } \log T \} )$ regret. We also provide a lower bound of $Ω(\sqrt{\min\{L, nK\}T})$, almost matching the type-feedback upper bounds.
arXiv.org Artificial Intelligence
Oct-3-2025
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > Kosovo
- District of Gjilan > Kamenica (0.04)
- North America > United States
- California > Santa Clara County
- Stanford (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- New York > New York County
- New York City (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California > Santa Clara County
- Asia > Middle East
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.67)
- Leisure & Entertainment > Games (0.46)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning (1.00)
- Representation & Reasoning > Agents (0.46)
- Data Science > Data Mining
- Big Data (0.68)
- Game Theory (1.00)
- Artificial Intelligence
- Information Technology