ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments

Open in new window