Automatic Evaluation of Healthcare LLMs Beyond Question-Answering