T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step

Open in new window