MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework

Open in new window