IQA-E VAL: Automatic Evaluation of Human-Model Interactive Question Answering

Oct-10-2025, 16:13:58 GMT–Neural Information Processing Systems

To evaluate Large Language Models (LLMs) for question answering (QA), traditional methods typically focus on assessing single-turn responses to given questions. However, this approach doesn't capture the dynamic nature of human-AI interactions, where humans actively seek information through conversation.

evaluation, interaction, iqa-e val, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 16:13:58 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Texas > Travis County
    - Austin (0.04)
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
- Europe
  - Germany > Hamburg (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.72)

Duplicate Docs Excel Report

Title
c6a23b26eaaefd187973658f5001f4fe-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found