In-Context Learning for Few-Shot Dialogue State Tracking
Hu, Yushi, Lee, Chia-Hsuan, Xie, Tianbao, Yu, Tao, Smith, Noah A., Ostendorf, Mari
–arXiv.org Artificial Intelligence
Collecting and annotating task-oriented dialogues is time-consuming and costly; thus, zero and few shot learning could greatly benefit dialogue state tracking (DST). In this work, we propose an in-context learning (ICL) framework for zero-shot and few-shot learning DST, where a large pre-trained language model (LM) takes a test instance and a few exemplars as input, and directly decodes the dialogue state without any parameter updates. To better leverage a tabular domain description in the LM prompt, we reformulate DST into a text-to-SQL problem. We also propose a novel approach to retrieve annotated dialogues as exemplars. Empirical results on MultiWOZ show that our method IC-DST substantially outperforms previous fine-tuned state-of-the-art models in few-shot settings. In addition, we test IC-DST in zero-shot settings, in which the model only takes a fixed task instruction as input, finding that it outperforms previous zero-shot methods by a large margin.
arXiv.org Artificial Intelligence
Oct-25-2022
- Country:
- Asia
- China > Hong Kong (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom > England
- Leicestershire > Leicester (0.04)
- Belgium > Brussels-Capital Region
- North America
- Dominican Republic (0.04)
- United States > Washington
- King County > Seattle (0.04)
- Oceania > Australia
- Asia
- Genre:
- Research Report > Promising Solution (0.54)
- Industry:
- Consumer Products & Services > Restaurants (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language
- Chatbot (0.94)
- Discourse & Dialogue (1.00)
- Large Language Model (1.00)
- Information Technology > Artificial Intelligence