open
–Neural Information Processing Systems
We create GTA (a benchmark forGeneral Tool Agents) to evaluate the general tool-use ability ofLLMs inreal-worldscenarios. Who created the dataset (e.g., which team, research group) and on behalf of which entity(e.g.,company,institution,organization)?
Neural Information Processing Systems
Feb-16-2026, 11:44:11 GMT