Sample and Communication Efficient Fully Decentralized MARL Policy Evaluation via a New Approach: Local TD update

Open in new window