Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning

Neural Information Processing Systems 

In cooperative multi-agent reinforcement learning, centralized training and decentralized execution (CTDE) has achieved remarkable success.