No-Regret Learning Dynamics for Extensive-Form Correlated Equilibrium Andrea Celli

Neural Information Processing Systems 

Specifically, it has been known for more than 20 years that when all players seek to minimize their internal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium.