Reliable and diverse evaluation of LLM medical knowledge mastery

Open in new window