Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization

Open in new window