LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments

Open in new window