Better Correlation and Robustness: A Distribution-Balanced Self-Supervised Learning Framework for Automatic Dialogue Evaluation

May-30-2025, 21:11:09 GMT–Neural Information Processing Systems

Turn-level dialogue evaluation models (TDEMs), using self-supervised learning (SSL) framework, have achieved state-of-the-art performance in open-domain dialogue evaluation. However, these models inevitably face two potential problems. First, they have low correlations with humans on medium coherence samples as the SSL framework often brings training data with unbalanced coherence distribution. Second, the SSL framework leads TDEM to nonuniform score distribution. There is a danger that the nonuniform score distribution will weaken the robustness of TDEM through our theoretical analysis.

computational linguistic, machine learning, natural language, (16 more...)

Neural Information Processing Systems

May-30-2025, 21:11:09 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.68)
- Europe (1.00)
- North America > United States
  - Louisiana (0.14)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Texas (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Inductive Learning (0.70)
    - Neural Networks (1.00)
  - Natural Language > Discourse & Dialogue (0.67)