E-MAPP: EfficientMulti-AgentReinforcement LearningwithParallelProgramGuidance

Neural Information Processing Systems 

The agents often have difficulties in cooperating on common goals, dividing complex tasks, and planning through several stages to make progress.