Sparse Factor Analysis for Learning and Content Analytics

Lan, Andrew S., Waters, Andrew E., Studer, Christoph, Baraniuk, Richard G.

Jul-19-2013–arXiv.org Machine Learning

We develop a new model and algorithms for machine learning-based learning analytics, which estimate a learner's knowledge of the concepts underlying a domain, and content analytics, which estimate the relationships among a collection of questions and those concepts. Our model represents the probability that a learner provides the correct response to a question in terms of three factors: their understanding of a set of underlying concepts, the concepts involved in each question, and each question's intrinsic difficulty. We estimate these factors given the graded responses to a collection of questions. The underlying estimation problem is ill-posed in general, especially when only a subset of the questions are answered. The key observation that enables a well-posed solution is the fact that typical educational domains of interest involve only a small number of key concepts. Leveraging this observation, we develop both a bi-convex maximum-likelihood and a Bayesian solution to the resulting SPARse Factor Analysis (SPARFA) problem. We also incorporate user-defined tags on questions to facilitate the interpretability of the estimated factors. Experiments with synthetic and real-world data demonstrate the efficacy of our approach. Finally, we make a connection between SPARFA and noisy, binary-valued (1-bit) dictionary learning that is of independent interest.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Machine Learning

Jul-19-2013

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- North America > United States
  - Illinois > Cook County
    - Chicago (0.04)
  - Texas > Harris County
    - Houston (0.04)

Genre:
- Instructional Material > Course Syllabus & Notes (0.67)
- Research Report (1.00)

Industry:
- Education
  - Educational Setting (0.67)
  - Educational Technology > Educational Software
    - Computer Based Training (0.68)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning
      - Learning Graphical Models > Directed Networks
        Bayesian Learning (1.00)
      - Statistical Learning (1.00)
    - Representation & Reasoning > Uncertainty
      - Bayesian Inference (1.00)
  - Data Science > Data Mining (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found