DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning

Open in new window