TaskBench: Benchmarking Large Language Models for Task Automation

Open in new window