Evaluating Multi-Agent Coordination Abilities in Large Language Models