DFM: Dialogue Foundation Model for Universal Large-Scale Dialogue-Oriented Task Learning
Chen, Zhi, Bao, Jijia, Chen, Lu, Liu, Yuncong, Ma, Da, Chen, Bei, Wu, Mengyue, Zhu, Su, Dong, Xin, Ge, Fujiang, Miao, Qingliang, Lou, Jian-Guang, Yu, Kai
–arXiv.org Artificial Intelligence
Building a universal conversational agent has been a long-standing goal of the dialogue research community. Most previous works only focus on a small set of dialogue tasks. In this work, we aim to build a unified dialogue foundation model (DFM) which can be used to solve massive diverse dialogue tasks. To achieve this goal, a large-scale well-annotated dialogue dataset with rich task diversity (DialogZoo) is collected. We introduce a framework to unify all dialogue tasks and propose novel auxiliary self-supervised tasks to achieve stable training of DFM on the highly diverse large scale DialogZoo corpus. Experiments show that, compared with models of the same size, DFM can achieve state-of-the-art or competitive performance on very rich cross-domain downstream dialogue tasks. This demonstrates that DFM largely extends the ability of unified dialogue pre-trained model.
arXiv.org Artificial Intelligence
Oct-9-2022
- Country:
- North America > United States
- Louisiana (0.04)
- Europe
- Holy See > Vatican City (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Asia
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Leisure & Entertainment (0.46)
- Health & Medicine (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning (1.00)
- Machine Learning (1.00)
- Cognitive Science (0.93)
- Natural Language
- Discourse & Dialogue (1.00)
- Chatbot (0.66)
- Information Technology > Artificial Intelligence