GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents

Open in new window