ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward Zixian Ma1

Neural Information Processing Systems 

To address these issues, we propose a self-supervised intrinsic reward ELIGN - expectation alignment - inspired by the self-organization principle in Zoology.