No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding

Open in new window