Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL
–Neural Information Processing Systems
Most existing results only focus on online settings, in which agents can interact with the environment during training.
Neural Information Processing Systems
Aug-16-2025, 03:24:01 GMT