UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking
Li, Chuang, Zhang, Yan, Kan, Min-Yen, Li, Haizhou
–arXiv.org Artificial Intelligence
Previous zero-shot dialogue state tracking (DST) methods only apply transfer learning, but ignore unlabelled data in the target domain. We transform zero-shot DST into few-shot DST by utilising such unlabelled data via joint and self-training methods. Our method incorporates auxiliary tasks that generate slot types as inverse prompts for main tasks, creating slot values during joint training. Cycle consistency between these two tasks enables the generation and selection of quality samples in unknown target domains for subsequent fine-tuning. This approach also facilitates automatic label creation, thereby optimizing the training and fine-tuning of DST models. We demonstrate this method's effectiveness on large language models in zero-shot scenarios, improving average joint goal accuracy by $8\%$ across all domains in MultiWOZ.
arXiv.org Artificial Intelligence
Oct-16-2023
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Middle East > UAE (0.04)
- Singapore (0.04)
- China
- Europe
- North America
- Canada > British Columbia
- Dominican Republic (0.04)
- United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Technology: