IQA-E VAL: Automatic Evaluation of Human-Model Interactive Question Answering
–Neural Information Processing Systems
To evaluate Large Language Models (LLMs) for question answering (QA), traditional methods typically focus on assessing single-turn responses to given questions. However, this approach doesn't capture the dynamic nature of human-AI interactions, where humans actively seek information through conversation.
Neural Information Processing Systems
Oct-10-2025, 16:13:58 GMT
- Country:
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Germany > Hamburg (0.04)
- Belgium > Brussels-Capital Region
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Texas > Travis County
- Austin (0.04)
- Pennsylvania > Allegheny County
- Europe
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Technology: