Towards Objective Evaluation of Socially-Situated Conversational Robots: Assessing Human-Likeness through Multimodal User Behaviors

Inoue, Koji, Lala, Divesh, Ochi, Keiko, Kawahara, Tatsuya, Skantze, Gabriel

Sep-25-2023–arXiv.org Artificial Intelligence

This paper tackles the challenging task of evaluating socially situated conversational robots and presents a novel objective evaluation approach that relies on multimodal user behaviors. In this study, our main focus is on assessing the human-likeness of the robot as the primary evaluation metric. While previous research often relied on subjective evaluations from users, our approach aims to evaluate the robot's human-likeness based on observable user behaviors indirectly, thus enhancing objectivity and reproducibility. To begin, we created an annotated dataset of human-likeness scores, utilizing user behaviors found in an attentive listening dialogue corpus. We then conducted an analysis to determine the correlation between multimodal user behaviors and human-likeness scores, demonstrating the feasibility of our proposed behavior-based evaluation method.

artificial intelligence, natural language, user behavior, (15 more...)

arXiv.org Artificial Intelligence

Sep-25-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Japan (0.17)
- Europe
  - France (0.16)
  - Sweden (0.14)
- North America > United States (0.14)

Genre:
- Research Report > New Finding (0.36)

Industry:
- Health & Medicine (0.94)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (0.31)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found