Generative Data Imputation for Sparse Learner Performance Data Using Generative Adversarial Imputation Networks

Zhang, Liang, Lin, Jionghao, Sabatini, John, Zapata-Rivera, Diego, Forsyth, Carol, Jiang, Yang, Hollander, John, Hu, Xiangen, Graesser, Arthur C.

Apr-15-2025–arXiv.org Artificial Intelligence

DV ANCEMENTS in AI-driven technologies have significantly enhanced modern education through personalized tutoring and adaptive learning strategies on online platforms [1], [2]. Intelligent T utoring Systems (ITSs) exemplify this progress by leveraging advanced machine learning and natural language processing models to create interactive learning environments that improve outcomes across domains like literacy [3], mathematics [4], language learning [5], biology [6] and other STEM fields [7]. As human learners interact with ITSs, often through question-and-answer scenarios with immediate responses, their performance data becomes crucial for learner modeling, enabling systems to track progress, predict future performance, and adapt instruction accordingly [8]. Learner models like Bayesian Knowledge Tracing (BKT) and other knowledge tracing variants utilize the learner performance data to uncover learning characteristics, estimate knowledge states and acquisition [9]. However, in real-world scenarios, missing learner performance data is prevalent due to factors, such as learner dropout or disengagement [10], technical issues or incomplete data logging [11], biased sampling within experimental groups [12], and more. These challenges often lead to sparse data, where items (i.e., questions or problems) remain unattempted (e.g., learners may bypass the question, leave it unanswered due to a lack of response initiation, or make no attempt to engage with it), alongside limited learner interactions [13], [14]. As shown in Figure 1, missing performance records can occur along both the attempt and question dimensions during learner-ITS interactions. In the right portion of the figure's two matrices, entries marked with "?

imputation, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Apr-15-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China > Hong Kong (0.04)
  - Malaysia > Penang (0.04)
- Europe
  - Finland (0.04)
  - Netherlands > North Brabant
    - Eindhoven (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
- North America
  - Canada
    - Quebec > Montreal (0.04)
    - Saskatchewan (0.04)
  - United States
    - Arkansas > Craighead County
      - Jonesboro (0.04)
    - California
      - Orange County > Irvine (0.04)
      - San Diego County > San Diego (0.04)
    - Illinois (0.04)
    - New Jersey > Mercer County
      - Princeton (0.04)
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - Tennessee > Shelby County
      - Memphis (0.04)
- Oceania > Australia (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (1.00)

Industry:
- Education
  - Assessment & Standards (1.00)
  - Curriculum > Subject-Specific Education (0.86)
  - Educational Setting > Online (1.00)
  - Educational Technology > Educational Software
    - Computer Based Training (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Cognitive Science > Problem Solving (0.68)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)
    - Natural Language (1.00)
    - Representation & Reasoning > Uncertainty (0.67)
  - Enterprise Applications > Human Resources
    - Learning Management (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found