Near-Optimal Reinforcement Learning with Self-Play

Open in new window