E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance Can Chang 1, 2, Ni Mu

Neural Information Processing Systems 

The agents often have difficulties in cooperating on common goals, dividing complex tasks, and planning through several stages to make progress.