Offline Multi-Agent Reinforcement Learning with Knowledge Distillation

Neural Information Processing Systems 

We introduce an offline multi-agent reinforcement learning (offline MARL) framework that utilizes previously collected data without additional online data collection.