WenMind: A Comprehensive Benchmark for Evaluating Large Language Models in Chinese Classical Literature and Language Arts

Oct-10-2025, 03:43:42 GMT–Neural Information Processing Systems

Large Language Models (LLMs) have made significant advancements across numerous domains, but their capabilities in Chinese Classical Literature and Language Arts (CCLLA) remain largely unexplored due to the limited scope and tasks of existing benchmarks. To fill this gap, we propose WenMind, a comprehensive benchmark dedicated for evaluating LLMs in CCLLA.

benchmark, llm, qwen1, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 03:43:42 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada
  - Ontario > Toronto (0.04)
- Europe
  - Bulgaria (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
- Asia
  - Singapore (0.04)
  - Macao (0.04)
  - Indonesia > Bali (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - China
    - Jiangsu Province > Nanjing (0.04)
    - Guangdong Province > Guangzhou (0.04)

Genre:
- Research Report (0.67)
- Overview (0.45)

Industry:
- Education (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.71)

Duplicate Docs Excel Report

Title
5c1019b5711474ae5627dc8580614e01-Paper-Datasets_and_Benchmarks_Track.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found