Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews

Liu, Mengqiao, Wang, Tevin, Cohen, Cassandra A., Li, Sarah, Xiong, Chenyan

Feb-21-2025–arXiv.org Artificial Intelligence

Which large language model (LLM) is better? Every evaluation tells a story, but what do users really think about current LLMs? This paper presents CLUE, an LLM-powered interviewer that conducts in-the-moment user experience interviews, right after users interacted with LLMs, and automatically gathers insights about user opinions from massive interview logs. We conduct a study with thousands of users to understand user opinions on mainstream LLMs, recruiting users to first chat with a target LLM and then interviewed by CLUE. Our experiments demonstrate that CLUE captures interesting user opinions, for example, the bipolar views on the displayed reasoning process of DeepSeek-R1 and demands for information freshness and multi-modality. Our collected chat-and-interview logs will be released.

interview, llm, user opinion, (14 more...)

arXiv.org Artificial Intelligence

Feb-21-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Alaska (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - California > Alameda County
    - Berkeley (0.04)
- Asia > Myanmar
  - Tanintharyi Region > Dawei (0.04)

Genre:
- Research Report (1.00)
- Personal > Interview (1.00)
- Questionnaire & Opinion Survey (0.99)

Industry:
- Banking & Finance (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found