Learning to Cooperate with Humans using Generative Agents

Mar-21-2026, 01:34:16 GMT–Neural Information Processing Systems

Training agents that can coordinate zero-shot with humans is a key mission in multi-agent reinforcement learning (MARL). Current algorithms focus on training simulated human partner policies which are then used to train a Cooperator agent. The simulated human is produced either through behavior cloning over a dataset of human cooperation behavior, or by using MARL to create a population of simulated agents. However, these approaches often struggle to produce a Cooperator that can coordinate well with real humans, since the simulated humans fail to cover the diverse strategies and styles employed by people in the real world. We show \emph{learning a generative model of human partners} can effectively address this issue.

artificial intelligence, machine learning, reinforcement learning, (9 more...)

Neural Information Processing Systems

Mar-21-2026, 01:34:16 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.59)
  - Representation & Reasoning > Agents (0.41)