Analyzing Uncertainty of LLM-as-a-Judge: Interval Evaluations with Conformal Prediction

Open in new window