CORA: Coalitional Rational Advantage Decomposition for Multi-Agent Policy Gradients