Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning

Open in new window