Evaluating Large Language Models for Medical Calculations
–Neural Information Processing Systems
Current benchmarks for evaluating large language models (LLMs) in medicine are primarily focused on question-answering involving domain knowledge and descriptive reasoning.
Neural Information Processing Systems
Nov-19-2025, 22:48:12 GMT
- Country:
- Europe
- Monaco (0.04)
- Netherlands (0.04)
- Spain > Aragón (0.04)
- North America > United States
- Illinois
- Champaign County > Urbana (0.04)
- Cook County > Chicago (0.04)
- Virginia (0.04)
- Illinois
- Europe
- Genre:
- Research Report
- Experimental Study (0.68)
- New Finding (1.00)
- Research Report
- Technology: