Kloetzer, Julien
Semi-Distantly Supervised Neural Model for Generating Compact Answers to Open-Domain Why Questions
Ishida, Ryo (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Iida, Ryu (National Institute of Information and Communications Technology) | Kruengkrai, Canasai (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology)
This paper proposes a neural network-based method for generating compact answers to open-domain why-questions (e.g., "Why was Mr. Trump elected as the president of the US?"). Unlike factoid question answering methods that provide short text spans as answers, existing work for why-question answering have aimed at answering questions by retrieving relatively long text passages, each of which often consists of several sentences, from a text archive. While the actual answer to a why-question may be expressed over several consecutive sentences, these often contain redundant and/or unrelated parts. Such answers would not be suitable for spoken dialog systems and smart speakers such as Amazon Echo, which receive much attention in these days. In this work, we aim at generating non-redundant compact answers to why-questions from answer passages retrieved from a very large web data corpora (4 billion web pages) by an already existing open-domain why-question answering system, using a novel neural network obtained by extending existing summarization methods. We also automatically generate training data using a large number of causal relations automatically extracted from 4 billion web pages by an existing supervised causality recognizer. The data is used to train our neural network, together with manually created training data. Through a series of experiments, we show that both our novel neural network and auto-generated training data improve the quality of the generated answers both in ROUGE score and in a subjective evaluation.
Improving Event Causality Recognition with Multiple Background Knowledge Sources Using Multi-Column Convolutional Neural Networks
Kruengkrai, Canasai (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Hashimoto, Chikara (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology) | Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Tanaka, Masahiro (National Institute of Information and Communications Technology)
We propose a method for recognizing such event causalities as "smoke cigarettes" → "die of lung cancer" using background knowledge taken from web texts as well as original sentences from which candidates for the causalities were extracted. We retrieve texts related to our event causality candidates from four billion web pages by three distinct methods, including a why-question answering system, and feed them to our multi-column convolutional neural networks. This allows us to identify the useful background knowledge scattered in web texts and effectively exploit the identified knowledge to recognize event causalities. We empirically show that the combination of our neural network architecture and background knowledge significantly improves average precision, while the previous state-of-the-art method gains just a small benefit from such background knowledge.
A Semi-Supervised Learning Approach to Why-Question Answering
Oh, Jong-Hoon (National Institute of Information and Communications Technology) | Torisawa, Kentaro (National Institute of Information and Communications Technology) | Hashimoto, Chikara (National Institute of Information and Communications Technology) | Iida, Ryu (National Institute of Information and Communications Technology) | Tanaka, Masahiro (National Institute of Information and Communications Technology) | Kloetzer, Julien (National Institute of Information and Communications Technology)
We propose a semi-supervised learning method for improving why-question answering (why-QA). The key of our method is to generate training data (question-answer pairs) from causal relations in texts such as "[Tsunamis are generated]( effect ) because [the ocean's water mass is displaced by an earthquake]( cause )." A naive method for the generation would be to make a question-answer pair by simply converting the effect part of the causal relations into a why-question, like "Why are tsunamis generated?" from the above example, and using the source text of the causal relations as an answer. However, in our preliminary experiments, this naive method actually failed to improve the why-QA performance. The main reason was that the machine-generated questions were often incomprehensible like "Why does (it) happen?", and that the system suffered from overfitting to the results of our automatic causality recognizer. Hence, we developed a novel method that effectively filters out incomprehensible questions and retrieves from texts answers that are likely to be paraphrases of a given causal relation. Through a series of experiments, we showed that our approach significantly improved the precision of the top answer by 8% over the current state-of-the-art system for Japanese why-QA.