Building Markovian Generative Architectures over Pretrained LM Backbones for Efficient Task-Oriented Dialog Systems

Liu, Hong, Cai, Yucheng, Ou, Zhijian, Huang, Yi, Feng, Junlan

Oct-13-2022–arXiv.org Artificial Intelligence

The dialog Examples include GPT2-based SimpleTOD [14], SOLOIST state is often represented by a set of slot-value pairs that determine [15], AuGPT [16] and UBAR [17], and T5-based PPTOD [18] the user's requirement. Based on the tracked dialog state, and MTTOD [19], among others. A drawback of existing the system will query a task-related database (DB), decide an PLM-based methods, viewed from efficiencies in memory, action and generate a response. The methodology for building computation and learning, is that the whole history is used as TOD systems is gradually advancing from separate training the conditioning input at each turn. The dialog model thus of individual modules [1, 2, 3] to the end-to-end (E2E) trainable becomes non-Markov across turns, i.e., the generation at current approach [4, 5, 6, 7, 8, 9, 10]. In early E2E methods, the turn depends not only on the previous turn but also on all sequential turns of a dialog are usually modeled as a Markov previous turns, namely the whole dialog history. Some models process and realized over LSTM-based backbones.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Oct-13-2022

arXiv.org PDF

Add feedback

Country:
- Asia
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - China > Beijing
    - Beijing (0.05)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.33)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found