Automated Knowledge Component Generation and Knowledge Tracing for Coding Problems
Duan, Zhangqi, Fernandez, Nigel, Kanakadandi, Sri, Akram, Bita, Lan, Andrew
–arXiv.org Artificial Intelligence
Knowledge components (KCs) mapped to problems help model student learning, tracking their mastery levels on fine-grained skills thereby facilitating personalized learning and feedback in online learning platforms. However, crafting and tagging KCs to problems, traditionally performed by human domain experts, is highly labor-intensive. We present a fully automated, LLM-based pipeline for KC generation and tagging for open-ended programming problems. We also develop an LLM-based knowledge tracing (KT) framework to leverage these LLM-generated KCs, which we refer to as KCGen-KT. We conduct extensive quantitative and qualitative evaluations validating the effectiveness of KCGen-KT. On a real-world dataset of student code submissions to open-ended programming problems, KCGen-KT outperforms existing KT methods. We investigate the learning curves of generated KCs and show that LLM-generated KCs have a comparable level-of-fit to human-written KCs under the performance factor analysis (PFA) model. We also conduct a human evaluation to show that the KC tagging accuracy of our pipeline is reasonably accurate when compared to that by human domain experts.
arXiv.org Artificial Intelligence
Feb-25-2025
- Country:
- North America > United States
- North Carolina (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Europe
- United Kingdom > England
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Africa > Middle East
- Morocco (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology: