Multi-AgentReinforcementLearningis ASequenceModelingProblem