Provable Self-Play Algorithms for Competitive Reinforcement Learning

Open in new window