UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers