Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization

Hoi-To Wai, Zhuoran Yang, Zhaoran Wang, Mingyi Hong

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/