Deep Research Bench: Evaluating AI Web Research Agents