Hierarchical Recurrent Attention Network for Response Generation

Xing, Chen (College of Computer and Control Engineering, College of Software, Nankai University, Tianjin) | Wu, Yu (Beihang University, Beijing) | Wu, Wei (Microsoft Research) | Huang, Yalou (College of Computer and Control Engineering, College of Software, Nankai University, Tianjin) | Zhou, Ming (Microsoft Research)

AAAI Conferences 

We study multi-turn response generation in chatbots where a response is generated according to a conversation context. Existing work has modeled the hierarchy of the context, but does not pay enough attention to the fact that words and utterances in the context are differentially important. As a result, they may lose important information in context and generate irrelevant responses. We propose a hierarchical recurrent attention network (HRAN) to model both the hierarchy and the importance variance in a unified framework. In HRAN, a hierarchical attention mechanism attends to important parts within and among utterances with word level attention and utterance level attention respectively. Empirical studies on both automatic evaluation and human judgment show that HRAN can significantly outperform state-of-the-art models for context based response generation.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found