Transparent Reference-free Automated Evaluation of Open-Ended User Survey Responses
An, Subin, Ji, Yugyeong, Kim, Junyoung, Kook, Heejin, Lu, Yang, Seltzer, Josh
–arXiv.org Artificial Intelligence
Open-ended survey responses provide valuable insights in marketing research, but low-quality responses not only burden researchers with manual filtering but also risk leading to misleading conclusions, underscoring the need for effective evaluation. Existing automatic evaluation methods target LLM-generated text and inadequately assess human-written responses with their distinct characteristics. To address such characteristics, we propose a two-stage evaluation framework specifically designed for human survey responses. First, gibberish filtering removes nonsensical responses. Then, three dimensions-effort, relevance, and completeness-are evaluated using LLM capabilities, grounded in empirical analysis of real-world survey data. Validation on English and Korean datasets shows that our framework not only outperforms existing metrics but also demonstrates high practical applicability for real-world applications such as response quality prediction and response rejection, showing strong correlations with expert assessment.
arXiv.org Artificial Intelligence
Oct-9-2025
- Genre:
- Questionnaire & Opinion Survey (1.00)
- Research Report > New Finding (0.93)
- Personal > Interview (0.68)
- Technology: