Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation

Open in new window