Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems
Liu, Hong, Cai, Yucheng, Ou, Zhijian, Huang, Yi, Feng, Junlan
–arXiv.org Artificial Intelligence
The dialog Examples include GPT2-based SimpleTOD [14], SOLOIST state is often represented by a set of slot-value pairs that determine [15], AuGPT [16] and UBAR [17], and T5-based PPTOD [18] the user's requirement. Based on the tracked dialog state, and MTTOD [19], among others. A drawback of existing the system will query a task-related database (DB), decide an PLM-based methods, viewed from efficiencies in memory, action and generate a response. The methodology for building computation and learning, is that the whole history is used as TOD systems is gradually advancing from separate training the conditioning input at each turn. The dialog model thus of individual modules [1, 2, 3] to the end-to-end (E2E) trainable becomes non-Markov across turns, i.e., the generation at current approach [4, 5, 6, 7, 8, 9, 10]. In early E2E methods, the turn depends not only on the previous turn but also on all sequential turns of a dialog are usually modeled as a Markov previous turns, namely the whole dialog history. Some models process and realized over LSTM-based backbones.
arXiv.org Artificial Intelligence
Oct-13-2022