DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Zeng, Weihao, Fu, Dayuan, He, Keqing, Wang, Yejie, Xu, Yukai, Xu, Weiran
–arXiv.org Artificial Intelligence
Language models pre-trained on general text have achieved impressive results in diverse fields. Yet, the distinct linguistic characteristics of task-oriented dialogues (TOD) compared to general text limit the practical utility of existing language models. Current task-oriented dialogue pre-training methods overlook the one-to-many property of conversations, where multiple responses can be appropriate given the same conversation context. In this paper, we propose a novel dialogue pre-training model called DivTOD, which collaborates with LLMs to learn diverse task-oriented dialogue representations. DivTOD guides LLMs in transferring diverse knowledge to smaller models while removing domain knowledge that contradicts task-oriented dialogues. Experiments show that our model outperforms strong TOD baselines on various downstream dialogue tasks and learns the intrinsic diversity of task-oriented dialogues.
arXiv.org Artificial Intelligence
Mar-31-2024
- Country:
- Asia > China (0.28)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report (0.82)
- Industry:
- Education (0.68)
- Technology: