Evaluating Large Language Models for Medical Calculations

Neural Information Processing Systems 

Current benchmarks for evaluating large language models (LLMs) in medicine are primarily focused on question-answering involving domain knowledge and descriptive reasoning.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found