Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning
Waelchli, Daniel, Weber, Pascal, Koumoutsakos, Petros
–arXiv.org Artificial Intelligence
The discovery of individual objectives in collective behavior of complex dynamical systems such as fish schools and bacteria colonies is a long-standing challenge. Inverse reinforcement learning is a potent approach for addressing this challenge but its applicability to dynamical systems, involving continuous state-action spaces and multiple interacting agents, has been limited. In this study, we tackle this challenge by introducing an off-policy inverse multi-agent reinforcement learning algorithm (IMARL). Our approach combines the ReF-ER techniques with guided cost learning. By leveraging demonstrations, our algorithm automatically uncovers the reward function and learns an effective policy for the agents. Through extensive experimentation, we demonstrate that the proposed policy captures the behavior observed in the provided data, and achieves promising results across problem domains including single agent models in the OpenAI gym and multi-agent models of schooling behavior. The present study shows that the proposed IMARL algorithm is a significant step towards understanding collective dynamics from the perspective of its constituents, and showcases its value as a tool for studying complex physical systems exhibiting collective behaviour.
arXiv.org Artificial Intelligence
May-17-2023
- Country:
- South America > Brazil
- São Paulo (0.04)
- North America
- United States
- Texas > Harris County
- Houston (0.04)
- New York
- New York County > New York City (0.14)
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Texas > Harris County
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- Europe
- Portugal (0.04)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- South America > Brazil
- Genre:
- Research Report (0.70)
- Technology: