MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework