Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Open in new window