Enhancing Medical Text Evaluation with GPT-4