Better than Average: Paired Evaluation of NLP Systems

Open in new window