Large Language Models Miss the Multi-Agent Mark