QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation
Nguyen, Bang, Du, Tingting, Yu, Mengxia, Angrave, Lawrence, Jiang, Meng
–arXiv.org Artificial Intelligence
While the Question Generation (QG) task has been increasingly adopted in educational assessments, its evaluation remains limited by approaches that lack a clear connection to the educational values of test items. In this work, we introduce test item analysis, a method frequently used by educators to assess test question quality, into QG evaluation. Specifically, we construct pairs of candidate questions that differ in quality across dimensions such as topic coverage, item difficulty, item discrimination, and distractor efficiency. We then examine whether existing QG evaluation approaches can effectively distinguish these differences. Our findings reveal significant shortcomings in these approaches with respect to accurately assessing test item quality in relation to student performance. To address this gap, we propose a novel QG evaluation framework, QG-SMS, which leverages Large Language Model for Student Modeling and Simulation to perform test item analysis. As demonstrated in our extensive experiments and human evaluation study, the additional perspectives introduced by the simulated student profiles lead to a more effective and robust assessment of test items.
arXiv.org Artificial Intelligence
Mar-7-2025
- Country:
- North America
- United States
- Illinois (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- New York > New York County
- New York City (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Canada > Ontario
- Toronto (0.14)
- United States
- Europe
- Netherlands (0.04)
- Middle East > Malta (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- Asia
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- North America
- Genre:
- Research Report > New Finding (0.34)
- Instructional Material > Course Syllabus & Notes (0.31)
- Industry:
- Technology: