Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation

Chu, Yucheng, He, Peng, Li, Hang, Han, Haoyu, Yang, Kaiqi, Xue, Yu, Li, Tingting, Krajcik, Joseph, Tang, Jiliang

Jun-5-2025–arXiv.org Artificial Intelligence

Short answer assessment is a vital component of science education, allowing evaluation of students' complex three-dimensional understanding. Large language models (LLMs) that possess human-like ability in linguistic tasks are increasingly popular in assisting human graders to reduce their workload. However, LLMs' limitations in domain knowledge restrict their understanding in task-specific requirements and hinder their ability to achieve satisfactory performance. Retrieval-augmented generation (RAG) emerges as a promising solution by enabling LLMs to access relevant domain-specific knowledge during assessment. In this work, we propose an adaptive RAG framework for automated grading that dynamically retrieves and incorporates domain-specific knowledge based on the question and student answer context. Our approach combines semantic search and curated educational sources to retrieve valuable reference materials. Experimental results in a science education dataset demonstrate that our system achieves an improvement in grading accuracy compared to baseline LLM approaches. The findings suggest that RAG-enhanced grading systems can serve as reliable support with efficient performance gains.

dimension, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

Jun-5-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.69)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Education
  - Curriculum > Subject-Specific Education (1.00)
  - Assessment & Standards > Student Performance (0.89)
  - Educational Technology > Educational Software
    - Computer-Aided Assessment (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.72)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found