ConnectomeBench: Can LLMs Proofread the Connectome?
–Neural Information Processing Systems
Connectomics--the mapping of neural connections in an organism's brain--currently requires extraordinary human effort to proofread the data collected from imaging and machine-learning assisted segmentation. With the growing excitement around using AI agents to automate important scientific tasks, we explore whether current AI systems can perform multiple tasks necessary for data proofreading. We introduce ConnectomeBench, a multimodal benchmark evaluating large language model (LLM) capabilities in three critical proofreading tasks: segment type identification, split error correction, and merge error detection. Using expert annotated data from two large open-source datasets--a cubic millimeter of mouse visual cortex and the complete Drosophila brain--we evaluate proprietary multimodal LLMs including Claude 3.7/4 Sonnet, o4-mini, GPT-4.1,
Neural Information Processing Systems
Jun-17-2026, 01:56:41 GMT
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.88)
- Technology: