Benchmarking LLMs via Uncertainty Quantification Fanghua Ye1,2 Mingming Yang 1 Jianhui Pang 1,3 Longyue Wang