WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts Supplementary Material

Neural Information Processing Systems 

For details on M1-M5, please refer to Appendix B.3.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found