Goto

Collaborating Authors

 Agents






Learning to Play Sequential Games versus Unknown Opponents

Neural Information Processing Systems

To this end, we use kernel-based regularity assumptions to capture and exploit the structure in the opponent's response. We propose a novel algorithm for the learner when playing against an adversarial sequence of opponents.


No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium Andrea Celli

Neural Information Processing Systems

Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium.