A Environmental Settings

Neural Information Processing Systems 

They need to occupy 7 landmarks with size of 0.05. And the acceleration of agents is 0.7. Each predator is only allowed to communicate with three closest predators. The team reward is similar for both tasks. In cooperative navigation and predator prey, our model is trained based on MADDPG.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found