Augmenting Human Evaluation with LLM Judges: How Many Human Reviews Do You Need?

Open in new window