Automated Knowledge Component Generation and Knowledge Tracing for Coding Problems

Duan, Zhangqi, Fernandez, Nigel, Kanakadandi, Sri, Akram, Bita, Lan, Andrew

Feb-25-2025–arXiv.org Artificial Intelligence

Knowledge components (KCs) mapped to problems help model student learning, tracking their mastery levels on fine-grained skills thereby facilitating personalized learning and feedback in online learning platforms. However, crafting and tagging KCs to problems, traditionally performed by human domain experts, is highly labor-intensive. We present a fully automated, LLM-based pipeline for KC generation and tagging for open-ended programming problems. We also develop an LLM-based knowledge tracing (KT) framework to leverage these LLM-generated KCs, which we refer to as KCGen-KT. We conduct extensive quantitative and qualitative evaluations validating the effectiveness of KCGen-KT. On a real-world dataset of student code submissions to open-ended programming problems, KCGen-KT outperforms existing KT methods. We investigate the learning curves of generated KCs and show that LLM-generated KCs have a comparable level-of-fit to human-written KCs under the performance factor analysis (PFA) model. We also conduct a human evaluation to show that the KC tagging accuracy of our pipeline is reasonably accurate when compared to that by human domain experts.

kcgen-kt, mastery level, programming problem, (15 more...)

arXiv.org Artificial Intelligence

Feb-25-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - North Carolina (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - Massachusetts > Hampshire County
    - Amherst (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
  - Florida > Miami-Dade County
    - Miami (0.04)
- Europe
  - United Kingdom > England
    - Durham > Durham (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Africa > Middle East
  - Morocco (0.04)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Education > Educational Technology > Educational Software > Computer Based Training (0.66)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found