PRD: Peer Rank and Discussion Improve Large Language Model based Evaluations

Open in new window