SupplementarymaterialforVariationalAutomatic CurriculumLearningforSparse-Reward CooperativeMulti-AgentProblems

Feb-8-2026, 15:44:56 GMT–Neural Information Processing Systems

All the source code can be found at our project websitehttps://sites.google.com/view/ The proof is largely based on [2]. The speaker and listener obtain +1 reward when the listener covers the correct landmark. We construct theHard-Spreadscenario by adding walls toseparate the room into three parts. For the tasks in the particle-world environment, we evaluate the performances of our algorithm andbaselines withtheaverage coverage oflandmarks inthelastfiveevaluation stepswithin every episode.

inlock-and-return, logp, push-ball, (11 more...)

Neural Information Processing Systems

Feb-8-2026, 15:44:56 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology (0.68)

Duplicate Docs Excel Report

Title
Supplementary material for Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems

Similar Docs Excel Report more

Title	Similarity	Source
None found