Multi-agent Markov Entanglement

Neural Information Processing Systems 

Value decomposition has long been a fundamental technique in multi-agent reinforcement learning and dynamic programming.