Goto

Collaborating Authors

 Africa


4b3cc0d1c897ebcf71aca92a4a26ac83-Paper-Conference.pdf

Neural Information Processing Systems

More specifically,for the output features ofthe penultimate layer, for each class the within-class features converge to their means, and the means of different classes exhibit a certain tight frame structure, which is also aligned withthelastlayer'sclassifier.




LearningCollaborativePoliciestoSolveNP-hard RoutingProblems

Neural Information Processing Systems

The seeder generates as diversified candidate solutions as possible (seeds) while being dedicated to exploring over the full combinatorial action space (i.e.,sequence ofassignment action).


MinglingForesightwithImagination: Model-Based CooperativeMulti-AgentReinforcementLearning

Neural Information Processing Systems

Thispaperproposes animplicit model-based multi-agent reinforcement learning method based onvalue decomposition methods. Under this method, agents can interact with thelearned virtual environment and evaluate thecurrent state value according to imagined future states in the latent space, making agents have the foresight. Our approach can be applied toanymulti-agent value decomposition method.