FEAT: A Preference Feedback Dataset through a Cost-Effective Auto-Generation and Labeling Framework for English AI Tutoring
Seo, Hyein, Hwang, Taewook, Lee, Yohan, Jung, sangkeun
–arXiv.org Artificial Intelligence
In English education tutoring, teacher feedback is essential for guiding students. Recently, AI-based tutoring systems have emerged to assist teachers; however, these systems require high-quality and large-scale teacher feedback data, which is both time-consuming and costly to generate manually. In this study, we propose FEAT, a cost-effective framework for generating teacher feedback, and have constructed three complementary datasets: (1) DIRECT-Manual (DM), where both humans and large language models (LLMs) collaboratively generate high-quality teacher feedback, albeit at a higher cost; (2) DIRECT-Generated (DG), an LLM-only generated, cost-effective dataset with lower quality;, and (3) DIRECT-Augmented (DA), primarily based on DG with a small portion of DM added to enhance quality while maintaining cost-efficiency. Experimental results showed that incorporating a small portion of DM (5-10%) into DG leads to superior performance compared to using 100% DM alone.
arXiv.org Artificial Intelligence
Aug-15-2025
- Country:
- Asia > Thailand
- Europe > Spain
- Catalonia > Barcelona Province > Barcelona (0.04)
- North America
- Mexico > Mexico City
- Mexico City (0.05)
- United States
- Florida > Miami-Dade County
- Miami (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Washington > King County
- Seattle (0.04)
- Florida > Miami-Dade County
- Mexico > Mexico City
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: