Provable Policy Gradient Methods for Average-Reward Markov Potential Games

Open in new window