Constrained C-Test Generation via Mixed-Integer Programming
Lee, Ji-Ung, Pfetsch, Marc E., Gurevych, Iryna
–arXiv.org Artificial Intelligence
This work proposes a novel method to generate C-Tests; a deviated form of cloze tests (a gap filling exercise) where only the last part of a word is turned into a gap. In contrast to previous works that only consider varying the gap size or gap placement to achieve locally optimal solutions, we propose a mixed-integer programming (MIP) approach. This allows us to consider gap size and placement simultaneously, achieving globally optimal solutions, and to directly integrate state-of-the-art models for gap difficulty prediction into the optimization problem. A user study with 40 participants across four C-Test generation strategies (including GPT-4) shows that our approach (MIP) significantly outperforms two of the baseline strategies (based on gap placement and GPT-4); and performs on-par with the third (based on gap size). Our analysis shows that GPT-4 still struggles to fulfill explicit constraints during generation and that MIP produces C-Tests that correlate best with the perceived difficulty. We publish our code, model, and collected data consisting of 32 English C-Tests with 20 gaps each (totaling 3,200 individual gap responses) under an open source license.
arXiv.org Artificial Intelligence
Apr-12-2024
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Oregon (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- California > Los Angeles County
- Santa Monica (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- Austria > Vienna (0.14)
- Italy (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Germany > Hesse
- Darmstadt Region > Darmstadt (0.04)
- Asia
- Middle East > Jordan (0.04)
- Japan (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- North America
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Promising Solution (0.86)
- Research Report
- Industry:
- Education > Educational Setting (0.45)
- Technology: