ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Open in new window