Automated Metrics for Medical Multi-Document Summarization Disagree with Human Evaluations

Open in new window