CIBench: Evaluating Your LLMs with a Code Interpreter Plugin

Open in new window