Evaluating large language models in medical applications: a survey

Open in new window