ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

Dec-24-2025, 00:51:52 GMT–Neural Information Processing Systems

Modern multi-agent reinforcement learning frameworks rely on centralized training and reward shaping to perform well. However, centralized training and dense rewards are not readily available in the real world. Current multi-agent algorithms struggle to learn in the alternative setup of decentralized training or sparse rewards. To address these issues, we propose a self-supervised intrinsic reward \textit{ELIGN - expectation alignment - } inspired by the self-organization principle in Zoology. Similar to how animals collaborate in a decentralized manner with those in their vicinity, agents trained with expectation alignment learn behaviors that match their neighbors' expectations.

elign, expectation alignment, multi-agent intrinsic reward, (4 more...)

Neural Information Processing Systems

Dec-24-2025, 00:51:52 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (0.74)
  - Machine Learning > Reinforcement Learning (0.59)