A Knowledge-Grounded Multimodal Search-Based Conversational Agent

Agarwal, Shubham, Dusek, Ondrej, Konstas, Ioannis, Rieser, Verena

Oct-20-2018–arXiv.org Artificial Intelligence

Multimodal search-based dialogue is a challenging new task: It extends visually grounded question answering systems into multi-turn conversations with access to an external database. We address this new challenge by learning a neural response generation system from the recently released Multimodal Dialogue (MMD) dataset (Saha et al., 2017). We introduce a knowledge-grounded multimodal conversational model where an encoded knowledge base (KB) representation is appended to the decoder input. Our model substantially outperforms strong baselines in terms of text-based similarity measures (over 9 BLEU points, 3 of which are solely due to the use of additional information from the KB.

machine learning, natural language, proc, (18 more...)

arXiv.org Artificial Intelligence

Oct-20-2018

arXiv.org PDF

Add feedback

Country:
- North America > Canada (0.14)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Discourse & Dialogue (0.93)
  - Representation & Reasoning > Search (0.60)
  - Machine Learning > Neural Networks
    - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found