Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models
Wu, Qingyang, Zhang, Yichi, Li, Yu, Yu, Zhou
–arXiv.org Artificial Intelligence
Existing dialog system models require extensive human annotations and are difficult to generalize to different tasks. The recent success of large pre-trained language models such as BERT and GPT -2 (Devlin et al., 2019; Radford et al., 2019) have suggested the effectiveness of incorporating language priors in downstream NLP tasks. However, how much pre-trained language models can help dialog response generation is still under exploration. In this paper, we propose a simple, general, and effective framework: Alternating Recurrent Dialog Model (ARDM). ARDM models each speaker separately and takes advantage of the large pre-trained language model. It requires no supervision from human annotations such as belief states or dialog acts to achieve effective conversations. ARDM outperforms or is on par with state-of-the-art methods on two popular task-oriented dialog datasets: CamRest676 and MultiWOZ. Moreover, we can generalize ARDM to more challenging, non-collaborative tasks such as persuasion. In persuasion tasks, ARDM is capable of generating humanlike responses to persuade people to donate to a charity. It has been a longstanding ambition for artificial intelligence researchers to create an intelligent conversational agent that can generate humanlike responses. Recently data-driven dialog models are more and more popular. However, most current state-of-the-art approaches still rely heavily on extensive annotations such as belief states and dialog acts (Lei et al., 2018). However, dialog content can vary considerably in different dialog tasks. Having a different intent or dialog act annotation scheme for each task is costly. For some tasks, it is even impossible, such as open-domain social chat. Thus, it is difficult to utilize these methods on challenging dialog tasks, such as persuasion and negotiation, where dialog states and acts are difficult to annotate.
arXiv.org Artificial Intelligence
Oct-8-2019
- Country:
- Oceania > Australia
- Victoria > Melbourne (0.04)
- New South Wales > Sydney (0.04)
- North America > United States
- Hawaii (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.28)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Europe
- Asia
- Middle East > Syria (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Oceania > Australia
- Genre:
- Research Report > Promising Solution (0.54)
- Technology: