Pessimism Meets Invariance: Provably Efficient Offline Mean-Field Multi-Agent RL

Neural Information Processing Systems 

Most existing results only focus on online settings, in which agents can interact with the environment during training.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found