Learning to Memorize in Neural Task-Oriented Dialogue Systems

May-19-2019–arXiv.org Artificial Intelligence

In this thesis, we leverage the neural copy mechanism and memory-augmented neural networks (MANNs) to address existing challenge of neural task-oriented dialogue learning. We show the effectiveness of our strategy by achieving good performance in multi-domain dialogue state tracking, retrieval-based dialogue systems, and generation-based dialogue systems. We first propose a transferable dialogue state generator (TRADE) that leverages its copy mechanism to get rid of dialogue ontology and share knowledge between domains. We also evaluate unseen domain dialogue state tracking and show that TRADE enables zero-shot dialogue state tracking and can adapt to new few-shot domains without forgetting the previous domains. Second, we utilize MANNs to improve retrieval-based dialogue learning. They are able to capture dialogue sequential dependencies and memorize long-term information. We also propose a recorded delexicalization copy strategy to replace real entity values with ordered entity types. Our models are shown to surpass other retrieval baselines, especially when the conversation has a large number of turns. Lastly, we tackle generation-based dialogue learning with two proposed models, the memory-to-sequence (Mem2Seq) and global-to-local memory pointer network (GLMP). Mem2Seq is the first model to combine multi-hop memory attention with the idea of the copy mechanism. GLMP further introduces the concept of response sketching and double pointers copying. We show that GLMP achieves the state-of-the-art performance on human evaluation.

deep learning, information, neural network, (19 more...)

arXiv.org Artificial Intelligence

May-19-2019

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.28)
- North America
  - Canada (0.14)
  - United States
    - California (0.46)
    - Texas (0.14)
    - Pennsylvania (0.14)
- Europe
  - Spain (0.28)
  - Germany (0.14)

Genre:
- Research Report > New Finding (0.45)

Industry:
- Retail (1.00)
- Energy > Oil & Gas (0.68)
- Transportation > Ground
  - Road (0.68)
- Consumer Products & Services
  - Restaurants (1.00)
  - Food, Beverage, Tobacco & Cannabis (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (1.00)
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found