DHP Benchmark: Are LLMs Good NLG Evaluators?

Open in new window