AITopics | comprehension question

Collaborating Authors

comprehension question

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Question Generation for Assessing Early Literacy Reading Comprehension

Yang, Xiaocheng, Shashidhar, Sumuk, Hakkani-Tur, Dilek

arXiv.org Artificial IntelligenceJul-31-2025

Assessment of reading comprehension through content-based interactions plays an important role in the reading acquisition process. In this paper, we propose a novel approach for generating comprehension questions geared to K-2 English learners. Our method ensures complete coverage of the underlying material and adaptation to the learner's specific proficiencies, and can generate a large diversity of question types at various difficulty levels to ensure a thorough evaluation. We evaluate the performance of various language models in this framework using the FairytaleQA dataset as the source material. Eventually, the proposed approach has the potential to become an important part of autonomous AI-driven English instructors.

computational linguistic, natural language, question answering, (16 more...)

arXiv.org Artificial Intelligence

2507.2241

Country: North America > United States > Illinois (0.16)

Genre: Research Report (0.71)

Industry:

Education > Educational Setting > K-12 Education (0.70)
Education > Assessment & Standards > Student Performance (0.63)
Government > Regional Government (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.58)

Add feedback

AutoMCQ -- Automatically Generate Code Comprehension Questions using GenAI

Goodfellow, Martin, Booth, Robbie, Fagan, Andrew, Lambert, Alasdair

arXiv.org Artificial IntelligenceMay-23-2025

Students often do not fully understand the code they have written. This sometimes does not become evident until later in their education, which can mean it is harder to fix their incorrect knowledge or misunderstandings. In addition, being able to fully understand code is increasingly important in a world where students have access to generative artificial intelligence (GenAI) tools, such as GitHub Copilot. One effective solution is to utilise code comprehension questions, where a marker asks questions about a submission to gauge understanding, this can also have the side effect of helping to detect plagiarism. However, this approach is time consuming and can be difficult and/or expensive to scale. This paper introduces AutoMCQ, which uses GenAI for the automatic generation of multiple-choice code comprehension questions. This is integrated with the CodeRunner automated assessment platform.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.1643

Country: Europe > United Kingdom > Scotland (0.19)

Genre: Research Report (0.40)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)

Add feedback

Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty

Jain, Yoshee, Hollander, John, He, Amber, Tang, Sunny, Zhang, Liang, Sabatini, John

arXiv.org Artificial IntelligenceFeb-24-2025

Reading comprehension is a key for individual success, yet the assessment of question difficulty remains challenging due to the extensive human annotation and large-scale testing required by traditional methods such as linguistic analysis and Item Response Theory (IRT). While these robust approaches provide valuable insights, their scalability is limited. There is potential for Large Language Models (LLMs) to automate question difficulty estimation; however, this area remains underexplored. Our study investigates the effectiveness of LLMs, specifically OpenAI's GPT-4o and o1, in estimating the difficulty of reading comprehension questions using the Study Aid and Reading Assessment (SARA) dataset. We evaluated both the accuracy of the models in answering comprehension questions and their ability to classify difficulty levels as defined by IRT. The results indicate that, while the models yield difficulty estimates that align meaningfully with derived IRT parameters, there are notable differences in their sensitivity to extreme item characteristics. These findings suggest that LLMs can serve as the scalable method for automated difficulty assessment, particularly in dynamic interactions between learners and Adaptive Instructional Systems (AIS), bridging the gap between traditional psychometric techniques and modern AIS for reading comprehension and paving the way for more adaptive and personalized educational assessments. The manuscript has been accepted for presentation at the 27th International Conference on Human-Computer Interaction in Gothenburg, Sweden, from June 22-27, 2025.

assessment, language model, question difficulty, (12 more...)

arXiv.org Artificial Intelligence

2502.17785

Country:

Europe > Sweden > Vaestra Goetaland > Gothenburg (0.24)
North America > United States > Illinois > Champaign County > Urbana (0.14)
North America > United States > Tennessee > Shelby County > Memphis (0.05)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education > Assessment & Standards > Student Performance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)

Add feedback

Automatic Feedback Generation for Short Answer Questions using Answer Diagnostic Graphs

Furuhashi, Momoka, Funayama, Hiroaki, Iwase, Yuya, Matsubayashi, Yuichiroh, Isobe, Yoriko, Nagahama, Toru, Sugawara, Saku, Inui, Kentaro

arXiv.org Artificial IntelligenceJan-26-2025

Short-reading comprehension questions help students understand text structure but lack effective feedback. Students struggle to identify and correct errors, while manual feedback creation is labor-intensive. This highlights the need for automated feedback linking responses to a scoring rubric for deeper comprehension. Despite advances in Natural Language Processing (NLP), research has focused on automatic grading, with limited work on feedback generation. To address this, we propose a system that generates feedback for student responses. Our contributions are twofold. First, we introduce the first system for feedback on short-answer reading comprehension. These answers are derived from the text, requiring structural understanding. We propose an "answer diagnosis graph," integrating the text's logical structure with feedback templates. Using this graph and NLP techniques, we estimate students' comprehension and generate targeted feedback. Second, we evaluate our feedback through an experiment with Japanese high school students (n=39). They answered two 70-80 word questions and were divided into two groups with minimal academic differences. One received a model answer, the other system-generated feedback. Both re-answered the questions, and we compared score changes. A questionnaire assessed perceptions and motivation. Results showed no significant score improvement between groups, but system-generated feedback helped students identify errors and key points in the text. It also significantly increased motivation. However, further refinement is needed to enhance text structure understanding.

artificial intelligence, natural language, student, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.21125/edulearn.2024.1590

2501.15777

Country:

Asia > Japan > Honshū > Tōhoku (0.06)
North America > United States > Ohio > Franklin County > Columbus (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > Middle East > UAE (0.04)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.95)

Industry:

Education > Assessment & Standards > Student Performance (1.00)
Education > Educational Setting > K-12 Education > Secondary School (0.55)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Digital Comprehensibility Assessment of Simplified Texts among Persons with Intellectual Disabilities

Säuberli, Andreas, Holzknecht, Franz, Haller, Patrick, Deilen, Silvana, Schiffl, Laura, Hansen-Schirra, Silvia, Ebling, Sarah

arXiv.org Artificial IntelligenceFeb-20-2024

Text simplification refers to the process of increasing the comprehensibility of texts. Automatic text simplification models are most commonly evaluated by experts or crowdworkers instead of the primary target groups of simplified texts, such as persons with intellectual disabilities. We conducted an evaluation study of text comprehensibility including participants with and without intellectual disabilities reading unsimplified, automatically and manually simplified German texts on a tablet computer. We explored four different approaches to measuring comprehensibility: multiple-choice comprehension questions, perceived difficulty ratings, response time, and reading speed. The results revealed significant variations in these measurements, depending on the reader group and whether the text had undergone automatic or manual simplification. For the target group of persons with intellectual disabilities, comprehension questions emerged as the most reliable measure, while analyzing reading speed provided valuable insights into participants' reading behavior.

participant, simplification, target group, (12 more...)

arXiv.org Artificial Intelligence

2402.13094

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
North America > United States > New York > New York County > New York City (0.14)
North America > United States > Hawaii > Honolulu County > Honolulu (0.05)
(12 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.93)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)

Add feedback

Can LLMs Grade Short-answer Reading Comprehension Questions : Foundational Literacy Assessment in LMICs

Henkel, Owen, Hills, Libby, Roberts, Bill, McGrane, Joshua

arXiv.org Artificial IntelligenceOct-26-2023

This paper presents emerging evidence of using generative large language models (i.e., GPT-4) to reliably evaluate short-answer reading comprehension questions. Specifically, we explore how various configurations of generative (LLMs) are able to evaluate student responses from a new dataset, drawn from a battery of reading assessments conducted with over 150 students in Ghana. As this dataset is novel and hence not used in training runs of GPT, it offers an opportunity to test for domain shift and evaluate the generalizability of generative LLMs, which are predominantly designed and trained on data from high-income North American countries. We found that GPT-4, with minimal prompt engineering performed extremely well on evaluating the novel dataset (Quadratic Weighted Kappa 0.923, F1 0.88), substantially outperforming transfer-learning based approaches, and even exceeding expert human raters (Quadratic Weighted Kappa 0.915, F1 0.87). To the best of our knowledge, our work is the first to empirically evaluate the performance of generative LLMs on short-answer reading comprehension questions, using real student data, and suggests that generative LLMs have the potential to reliably evaluate foundational literacy. Currently the assessment of formative literacy and numeracy is infrequent in many low and middle-income countries (LMICs) due to the cost and operational complexities of conducting them at scale. Automating the grading process for reading assessment could enable wider usage, and in turn improve decision-making regarding curricula, school management, and teaching practice at the classroom level. Importantly, in contrast transfer learning based approaches, generative LLMs generalize well and the technical barriers to their use are low, making them more feasible to implement and scale in lower resource educational contexts.

comprehension question, foundational literacy assessment, llm grade short-answer, (1 more...)

arXiv.org Artificial Intelligence

2310.18373

Country: Africa > Ghana (0.24)

Genre: Research Report (0.40)

Industry:

Education > Assessment & Standards > Student Performance (1.00)
Education > Educational Setting > K-12 Education > Primary School (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Add feedback

A Discriminative Model for Identifying Readers and Assessing Text Comprehension from Eye Movements

Makowski, Silvia, Jäger, Lena, Abdelwahab, Ahmed, Landwehr, Niels, Scheffer, Tobias

arXiv.org Machine LearningSep-21-2018

We study the problem of inferring readers' identities and estimating their level of text comprehension from observations of their eye movements during reading. We develop a generative model of individual gaze patterns (scanpaths) that makes use of lexical features of the fixated words. Using this generative model, we derive a Fisher-score representation of eye-movement sequences. We study whether a Fisher-SVM with this Fisher kernel and several reference methods are able to identify readers and estimate their level of text comprehension based on eye-tracking data. While none of the methods are able to estimate text comprehension accurately, we find that the SVM with Fisher kernel excels at identifying readers.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1809.08031

Country:

Europe > Germany (0.69)
North America > United States (0.68)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Surprise: Youve got some explaining to do

Foster, Meadhbh, Keane, Mark T.

arXiv.org Artificial IntelligenceAug-9-2013

Why are some events more surprising than others? We propose that events that are more difficult to explain are those that are more surprising. The two experiments reported here test the impact of different event outcomes (Outcome-Type) and task demands (Task) on ratings of surprise for simple story scenarios. For the Outcome-Type variable, participants saw outcomes that were either known or less-known surprising outcomes for each scenario. For the Task variable, participants either answered comprehension questions or provided an explanation of the outcome. Outcome-Type reliably affected surprise judgments; known outcomes were rated as less surprising than less-known outcomes. Task also reliably affected surprise judgments; when people provided an explanation it lowered surprise judgments relative to simply answering comprehension questions. Both experiments thus provide evidence on this less-explored explanation aspect of surprise, specifically showing that ease of explanation is a key factor in determining the level of surprise experienced.

artificial intelligence, explanation, scenario, (15 more...)

arXiv.org Artificial Intelligence

1308.2236

Country:

Europe > Germany (0.16)
North America > United States (0.15)

Genre: Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback