Promoting Coordination through Policy Regularization in Multi-Agent Deep Reinforcement Learning

Neural Information Processing Systems 

While the tractability of independent agent-wise exploration is appealing, this approach fails on tasks that require elaborate group strategies. We argue that coordinating the agents' policies can guide their