Multi-Agent Fully Decentralized Off-Policy Learning with Linear Convergence Rates

Open in new window