How to Select Datapoints for Efficient Human Evaluation of NLG Models?

Open in new window