Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models