TreeEval: Benchmark-Free Evaluation of Large Language Models through Tree Planning

Open in new window