Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model

Open in new window