Near-OptimalReinforcementLearningwithSelf-Play

Neural Information Processing Systems 

This paper considers the problem of designing optimal algorithms for reinforcement learning in two-player zero-sum games. We focus on self-play algorithms which learn theoptimal policy by playing againstitself without any direct supervision.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found