Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues

Deng, Wentao, Pei, Jiahuan, Ren, Zhaochun, Chen, Zhumin, Ren, Pengjie

Jul-13-2023–arXiv.org Artificial Intelligence

Answer selection in open-domain dialogues aims to select an accurate answer from candidates. Recent success of answer selection models hinges on training with large amounts of labeled data. However, collecting large-scale labeled data is labor-intensive and time-consuming. In this paper, we introduce the predicted intent labels to calibrate answer labels in a self-training paradigm. Specifically, we propose the intent-calibrated self-training (ICAST) to improve the quality of pseudo answer labels through the intent-calibrated answer selection paradigm, in which we employ pseudo intent labels to help improve pseudo answer labels. We carry out extensive experiments on two benchmark datasets with open-domain dialogues. The experimental results show that ICAST outperforms baselines consistently with 1%, 5% and 10% labeled data. Specifically, it improves 2.06% and 1.00% of F1 score on the two datasets, compared with the strongest baseline with only 5% labeled data.

artificial intelligence, machine learning, selection, (17 more...)

arXiv.org Artificial Intelligence

Jul-13-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
- Europe > Netherlands
  - North Holland > Amsterdam (0.04)
- Asia
  - Nepal (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - China > Shandong Province
    - Qingdao (0.04)

Genre:
- Research Report > New Finding (0.48)

Industry:
- Education (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found