Rethinking Individual Global Max in Cooperative Multi-Agent Reinforcement Learning
–Neural Information Processing Systems
In cooperative multi-agent reinforcement learning, centralized training and decentralized execution (CTDE) has achieved remarkable success.
Neural Information Processing Systems
Nov-16-2025, 07:21:38 GMT