Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization