Better Correlation and Robustness: A Distribution-Balanced Self-Supervised Learning Framework for Automatic Dialogue Evaluation
–Neural Information Processing Systems
Turn-level dialogue evaluation models (TDEMs), using self-supervised learning (SSL) framework, have achieved state-of-the-art performance in open-domain dialogue evaluation. However, these models inevitably face two potential problems. First, they have low correlations with humans on medium coherence samples as the SSL framework often brings training data with unbalanced coherence distribution. Second, the SSL framework leads TDEM to nonuniform score distribution. There is a danger that the nonuniform score distribution will weaken the robustness of TDEM through our theoretical analysis.
Neural Information Processing Systems
May-30-2025, 21:11:09 GMT
- Country:
- Asia (0.68)
- Europe (1.00)
- North America > United States
- Louisiana (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas (0.14)
- Technology: