ReportBench: Evaluating Deep Research Agents via Academic Survey Tasks

Open in new window