PAC: AssistedValueFactorisationwithCounterfactual PredictionsinMulti-AgentReinforcementLearning
–Neural Information Processing Systems
To enable decentralized execution, we alsoderivefactorized per-agentpolicies inspired byamaximum-entropyMARL framework.
Neural Information Processing Systems
Feb-9-2026, 11:20:00 GMT
- Technology: