UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers

Open in new window