Mean-Field Sampling for Cooperative Multi-Agent Reinforcement Learning

Neural Information Processing Systems 

Designing efficient algorithms for multi-agent reinforcement learning (MARL) is fundamentally challenging because the size of the joint state and action spaces grows exponentially in the number of agents. These difficulties are exacerbated when balancing sequential global decision-making with local agent interactions.