Minimax-Optimal Multi-Agent RL in Markov Games With a Generative Model