Efficient Evaluation of LLM Performance with Statistical Guarantees

Open in new window