PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning