No-RegretLearningDynamicsforExtensive-Form CorrelatedEquilibrium

Neural Information Processing Systems 

Specifically, it has been known for more than 20 years that when all players seek to minimize theirinternal regret in a repeated normal-form game, the empirical frequency of play converges to a normal-form correlated equilibrium.