Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games

Open in new window