Dolan, Bill
Generating Informative and Diverse Conversational Responses via Adversarial Information Maximization
Zhang, Yizhe, Galley, Michel, Gao, Jianfeng, Gan, Zhe, Li, Xiujun, Brockett, Chris, Dolan, Bill
Responses generated by neural conversational models tend to lack informativeness and diversity. We present Adversarial Information Maximization (AIM), an adversarial learning strategy that addresses these two related but distinct problems. To foster response diversity, we leverage adversarial training that allows distributional matching of synthetic and real responses. To improve informativeness, our framework explicitly optimizes a variational lower bound on pairwise mutual information between query and response. Empirical results from automatic and human evaluations demonstrate that our methods significantly boost informativeness and diversity.
A Knowledge-Grounded Neural Conversation Model
Ghazvininejad, Marjan (Information Sciences Institute, USC) | Brockett, Chris (Microsoft) | Chang, Ming-Wei (Microsoft) | Dolan, Bill (Microsoft) | Gao, Jianfeng (Microsoft) | Yih, Wen-tau (Microsoft) | Galley, Michel (Microsoft)
Neural network models are capable of generating extremely natural sounding conversational interactions. However, these models have been mostly applied to casual scenarios (e.g., as “chatbots”) and have yet to demonstrate they can serve in more useful conversational applications. This paper presents a novel, fully data-driven, and knowledge-grounded neural conversation model aimed at producing more contentful responses. We generalize the widely-used Sequence-to-Sequence (Seq2Seq) approach by conditioning responses on both conversation history and external “facts”, allowing the model to be versatile and applicable in an open-domain setting. Our approach yields significant improvements over a competitive Seq2Seq baseline. Human judges found that our outputs are significantly more informative.
Microsummarization of Online Reviews: An Experimental Study
Mason, Rebecca (Google, Inc.) | Gaska, Benjamin (University of Arizona) | Durme, Benjamin Van (Johns Hopkins University) | Choudhury, Pallavi (Microsoft Research) | Hart, Ted (Microsoft Research) | Dolan, Bill (Microsoft Research) | Toutanova, Kristina (Microsoft Research) | Mitchell, Margaret (Microsoft Research)
Mobile and location-based social media applications provide platforms for users to share brief opinions about products, venues, and services. These quickly typed opinions, or microreviews, are a valuable source of current sentiment on a wide variety of subjects. However, there is currently little research on how to mine this information to present it back to users in easily consumable way. In this paper, we introduce the task of microsummarization, which combines sentiment analysis, summarization, and entity recognition in order to surface key content to users. We explore unsupervised and supervised methods for this task, and find we can reliably extract relevant entities and the sentiment targeted towards them using crowdsourced labels as supervision. In an end-to-end evaluation, we find our best-performing system is vastly preferred by judges over a traditional extractive summarization approach. This work motivates an entirely new approach to summarization, incorporating both sentiment analysis and item extraction for modernized, at-a-glance presentation of public opinion.