Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Open in new window