MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools

Open in new window