Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Neural Information Processing Systems 

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the theoretical understanding of such methods is limited.