SHAQ: IncorporatingShapleyValueTheoryinto Multi-AgentQ-Learning
–Neural Information Processing Systems
Value factorisation is a useful technique for multi-agent reinforcement learning (MARL) in global reward game, however, its underlying mechanism is not yet fully understood.
Neural Information Processing Systems
Feb-7-2026, 23:54:18 GMT