LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation

Open in new window