SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation
Ma, Longxuan, Zhuang, Ziyu, Zhang, Weinan, Li, Mingda, Liu, Ting
–arXiv.org Artificial Intelligence
This paper introduces a novel Self-supervised Fine-grained Dialogue Evaluation framework (SelF-Eval). The core idea is to model the correlation between turn quality and the entire dialogue quality. We first propose a novel automatic data construction method that can automatically assign fine-grained scores for arbitrarily dialogue data. Then we train \textbf{SelF-Eval} with a multi-level contrastive learning schema which helps to distinguish different score levels. Experimental results on multiple benchmarks show that SelF-Eval is highly consistent with human evaluations and better than the state-of-the-art models. We give a detailed analysis of the experiments in this paper. Our code is available on GitHub.
arXiv.org Artificial Intelligence
Sep-16-2022
- Country:
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Asia
- China
- Heilongjiang Province > Harbin (0.04)
- Hong Kong (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- China
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Italy > Tuscany
- Florence (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Belgium > Brussels-Capital Region
- North America
- Dominican Republic (0.04)
- United States > Minnesota
- Hennepin County > Minneapolis (0.14)
- Africa > Ethiopia
- Genre:
- Research Report > Promising Solution (0.48)
- Technology: