SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View
Xiao, Yongjie, Liang, Hongru, Qin, Peixin, Zhang, Yao, Lei, Wenqiang
–arXiv.org Artificial Intelligence
Despite the great potential of large language models(LLMs) in machine comprehension, it is still disturbing to fully count on them in real-world scenarios. This is probably because there is no rational explanation for whether the comprehension process of LLMs is aligned with that of experts. In this paper, we propose SCOP to carefully examine how LLMs perform during the comprehension process from a cognitive view. Specifically, it is equipped with a systematical definition of five requisite skills during the comprehension process, a strict framework to construct testing data for these skills, and a detailed analysis of advanced open-sourced and closed-sourced LLMs using the testing data. With SCOP, we find that it is still challenging for LLMs to perform an expert-level comprehension process. Even so, we notice that LLMs share some similarities with experts, e.g., performing better at comprehending local information than global information. Further analysis reveals that LLMs can be somewhat unreliable -- they might reach correct answers through flawed comprehension processes. Based on SCOP, we suggest that one direction for improving LLMs is to focus more on the comprehension process, ensuring all comprehension skills are thoroughly developed during training.
arXiv.org Artificial Intelligence
Sep-8-2025
- Country:
- Asia
- Bangladesh > Dhaka Division
- Dhaka District > Dhaka (0.04)
- China
- Sichuan Province (0.04)
- Tianjin Province > Tianjin (0.04)
- Middle East > Republic of Türkiye
- Malatya Province > Malatya (0.04)
- South Korea (0.04)
- Bangladesh > Dhaka Division
- Europe
- France (0.04)
- Italy (0.04)
- Middle East > Cyprus (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Prince Edward Island (0.04)
- United States
- Nevada > Washoe County
- Reno (0.04)
- New York > New York County
- New York City (0.04)
- Mississippi (0.04)
- Iowa (0.04)
- North Dakota (0.04)
- California > Los Angeles County
- Los Angeles (0.04)
- Oregon > Deschutes County
- Bend (0.04)
- Texas (0.04)
- Florida
- Broward County > Pompano Beach (0.04)
- Charlotte County > Punta Gorda (0.04)
- Dade County (0.04)
- Marion County > Ocala (0.14)
- Miami-Dade County
- Pasco County (0.04)
- Pinellas County (0.04)
- Volusia County > DeLand (0.04)
- Idaho > Ada County
- Boise (0.04)
- South Carolina > Greenville County
- Wade Hampton (0.14)
- Ohio > Lucas County
- Toledo (0.04)
- Nevada > Washoe County
- Asia
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Education (1.00)
- Energy (0.67)
- Government > Regional Government
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Leisure & Entertainment (1.00)
- Media > Music (1.00)
- Technology: