Regularized Gradient Descent Ascent for Two-Player Zero-Sum Markov Games

Neural Information Processing Systems 

Exploiting this property, Daskalakis et al. [2020] builds on the theory of Lin et al. [2020a] and shows