Goto

Collaborating Authors

 Agents






MinglingForesightwithImagination: Model-Based CooperativeMulti-AgentReinforcementLearning

Neural Information Processing Systems

Thispaperproposes animplicit model-based multi-agent reinforcement learning method based onvalue decomposition methods. Under this method, agents can interact with thelearned virtual environment and evaluate thecurrent state value according to imagined future states in the latent space, making agents have the foresight. Our approach can be applied toanymulti-agent value decomposition method.



Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning Yiqin Y ang

Neural Information Processing Systems

Moreover, we extend ICQ to multi-agent tasks by decomposing the joint-policy under the implicit constraint. Experimental results demonstrate that the extrapolation error is successfully controlled within a reasonable range and insensitive to the number of agents.