CGBENCH: Benchmarking Language Model Scientific Reasoning for Clinical Genetics Research
–Neural Information Processing Systems
Variant and gene interpretation are fundamental to personalized medicine and translational biomedicine. However, traditional approaches are manual and labor-intensive. Generative language models (LMs) can facilitate this process, accelerating the translation of fundamental research into clinically-actionable insights. While existing benchmarks have attempted to quantify the capabilities of LMs for interpreting scientific data, these studies focus on narrow tasks that do not translate to real-world research. To meet these challenges, we introduce CGBENCH, a robust benchmark that tests reasoning capabilities of LMs on scientific publications.
Neural Information Processing Systems
Jun-15-2026, 06:12:15 GMT
- Country:
- North America > United States > Minnesota (0.27)
- Genre:
- Questionnaire & Opinion Survey (0.67)
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Technology: