Weakly Supervised Data Augmentation Through Prompting for Dialogue Understanding

Chen, Maximillian, Papangelis, Alexandros, Tao, Chenyang, Rosenbaum, Andy, Kim, Seokhwan, Liu, Yang, Yu, Zhou, Hakkani-Tur, Dilek

Nov-2-2022–arXiv.org Artificial Intelligence

Dialogue understanding tasks often necessitate abundant annotated data to achieve good performance and that presents challenges in low-resource settings. To alleviate this barrier, we explore few-shot data augmentation for dialogue understanding by prompting large pre-trained language models and present a novel approach that iterates on augmentation quality by applying weakly-supervised filters. We evaluate our methods on the emotion and act classification tasks in DailyDialog and the intent classification task in Facebook Multilingual Task-Oriented Dialogue. Models fine-tuned on our augmented data mixed with few-shot ground truth data are able to approach or surpass existing state-of-the-art performance on both datasets. For DailyDialog specifically, using 10% of the ground truth data we outperform the current state-of-the-art model which uses 100% of the data.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Nov-2-2022

arXiv.org PDF

Add feedback

Country:
- North America > Dominican Republic (0.04)
- Asia
  - Singapore (0.04)
  - China > Hong Kong (0.04)

Genre:
- Research Report > Promising Solution (0.54)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (0.95)
    - Chatbot (0.70)
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found