MAST: Multi-Agent Spatial Transformer for Learning to Collaborate