Question Answering
AntNet: Deep Answer Understanding Network for Natural Reverse QA
Yang, Lei, Yin, Qing, Hou, Linlin, Gui, Jie, Wu, Ou, Kwok, James
--This study refers to a reverse question answering (reverse QA) procedure, in which machines proactively raise questions and humans supply answers. This procedure exists in many real human-machine interaction applications. A crucial problem in human-machine interaction is answer understanding. Existing solutions rely on mandatory option term selection to avoid automatic answer understanding. However, these solutions lead to unnatural human-computer interaction and harm user experience. T o this end, this study proposed a novel deep answer understanding network, called AntNet, for reverse QA. The network consists of three new modules, namely, skeleton extraction for questions, relevance-aware representation of answers, and multi-hop based fusion. As answer understanding for reverse QA has not been explored, a new data corpus is compiled in this study. Experimental results indicate that our proposed network is significantly better than existing methods and those modified from classical natural language processing (NLP) deep models. The effectiveness of the three new modules is also verified. UTOMA TIC question answering (QA) is a crucial component in many human-machine interaction systems, such as intelligent customer service, as it can provide a natural way for humans to acquire information [1]. Therefore, QA has received increasing attention in academic research and industry communities in recent years [2]. Questions are solely raised by humans, and answers are then returned by machines in the conventional QA scenario. How to select the best matched answer is the key problem in this setting [3].
PIQA: Reasoning about Physical Commonsense in Natural Language
Bisk, Yonatan, Zellers, Rowan, Bras, Ronan Le, Gao, Jianfeng, Choi, Yejin
To apply eyeshadow without a brush, should I use a cotton swab or a toothpick? Questions requiring this kind of physical commonsense pose a challenge to today's natural language understanding systems. While recent pretrained models (such as BERT) have made progress on question answering over more abstract domains - such as news articles and encyclopedia entries, where text is plentiful - in more physical domains, text is inherently limited due to reporting bias. Can AI systems learn to reliably answer physical common-sense questions without experiencing the physical world? In this paper, we introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA. Though humans find the dataset easy (95% accuracy), large pretrained models struggle (77%). We provide analysis about the dimensions of knowledge that existing models lack, which offers significant opportunities for future research.
SemEval-2015 Task 3: Answer Selection in Community Question Answering
Nakov, Preslav, Mร rquez, Lluรญs, Magdy, Walid, Moschitti, Alessandro, Glass, James, Randeree, Bilal
Community Question Answering (cQA) provides new interesting research directions to the traditional Question Answering (QA) field, e.g., the exploitation of the interaction between users and the structure of related posts. In this context, we organized SemEval-2015 Task 3 on "Answer Selection in cQA", which included two subtasks: (a) classifying answers as "good", "bad", or "potentially relevant" with respect to the question, and (b) answering a YES/NO question with "yes", "no", or "unsure", based on the list of all answers. We set subtask A for Arabic and English on two relatively different cQA domains, i.e., the Qatar Living website for English, and a Quran-related website for Arabic. We used crowdsourcing on Amazon Mechanical Turk to label a large English training dataset, which we released to the research community. Thirteen teams participated in the challenge with a total of 61 submissions: 24 primary and 37 contrastive. The best systems achieved an official score (macro-averaged F1) of 57.19 and 63.7 for the English subtasks A and B, and 78.55 for the Arabic subtask A.
Conclusion-Supplement Answer Generation for Non-Factoid Questions
Nakatsuji, Makoto, Okui, Sohei
This paper tackles the goal of conclusion-supplement answer generation for non-factoid questions, which is a critical issue in the field of Natural Language Processing (NLP) and Artificial Intelligence (AI), as users often require supplementary information before accepting a conclusion. The current encoder-decoder framework, however, has difficulty generating such answers, since it may become confused when it tries to learn several different long answers to the same non-factoid question. Our solution, called an ensemble network, goes beyond single short sentences and fuses logically connected conclusion statements and supplementary statements. It extracts the context from the conclusion decoder's output sequence and uses it to create supplementary decoder states on the basis of an attention mechanism. It also assesses the closeness of the question encoder's output sequence and the separate outputs of the conclusion and supplement decoders as well as their combination. As a result, it generates answers that match the questions and have natural-sounding supplementary sequences in line with the context expressed by the conclusion sequence. Evaluations conducted on datasets including "Love Advice" and "Arts & Humanities" categories indicate that our model outputs much more accurate results than the tested baseline models do.
Visual Dialogue State Tracking for Question Generation
GuessWhat?! is a visual dialogue task between a guesser and an oracle. The guesser aims to locate an object supposed by the oracle oneself in an image by asking a sequence of Yes/No questions. Asking proper questions with the progress of dialogue is vital for achieving successful final guess. As a result, the progress of dialogue should be properly represented and tracked. Previous models for question generation pay less attention on the representation and tracking of dialogue states, and therefore are prone to asking low quality questions such as repeated questions. This paper proposes visual dialogue state tracking (VDST) based method for question generation. A visual dialogue state is defined as the distribution on objects in the image as well as representations of objects. Representations of objects are updated with the change of the distribution on objects. An object-difference based attention is used to decode new question. The distribution on objects is updated by comparing the question-answer pair and objects. Experimental results on GuessWhat?! dataset show that our model significantly outperforms existing methods and achieves new state-of-the-art performance. It is also noticeable that our model reduces the rate of repeated questions from more than 50% to 21.9% compared with previous state-of-the-art methods.
Schemaless Queries over Document Tables with Dependencies
Canim, Mustafa, Cornelio, Cristina, Iyengar, Arun, Musa, Ryan, Muro, Mariano Rodrigez
Unstructured enterprise data such as reports, manuals and guidelines often contain tables. The traditional way of integrating data from these tables is through a two-step process of table detection/extraction and mapping the table layouts to an appropriate schema. This can be an expensive process. In this paper we show that by using semantic technologies (RDF/SPARQL and database dependencies) paired with a simple but powerful way to transform tables with non-relational layouts, it is possible to offer query answering services over these tables with minimal manual work or domain-specific mappings. Our method enables users to exploit data in tables embedded in documents with little effort, not only for simple retrieval queries, but also for structured queries that require joining multiple interrelated tables.
MIT-IBM Watson AI Lab Releases Groundbreaking Research on AI and the Future of Work - Liwaiwai
IBM believes 100% of jobs will eventually change due to artificial intelligence, and new empirical research released last October 30 from the MIT-IBM Watson AI Lab reveals how. The research, The Future of Work: How New Technologies Are Transforming Tasks, used advanced machine learning techniques to analyze 170 million online job postings in the United States between 2010 and 2017. It shows, in the early stages of AI adoption, how tasks of individual jobs are transforming and the impact on employment and wages. "As new technologies continue to scale within businesses and across industries, it is our responsibility as innovators to understand not only the business process implications, but also the societal impact," said Martin Fleming, vice president and chief economist of IBM. "To that end, this empirical research from the MIT-IBM Watson AI Lab sheds new light on how tasks are reorganizing between people and machines as a result of AI and new technologies."
Global Thread-Level Inference for Comment Classification in Community Question Answering
Joty, Shafiq, Barrรณn-Cedeรฑo, Alberto, Martino, Giovanni Da San, Filice, Simone, Mร rquez, Lluรญs, Moschitti, Alessandro, Nakov, Preslav
Community question answering, a recent evolution of question answering in the Web context, allows a user to quickly consult the opinion of a number of people on a particular topic, thus taking advantage of the wisdom of the crowd. Here we try to help the user by deciding automatically which answers are good and which are bad for a given question. In particular, we focus on exploiting the output structure at the thread level in order to make more consistent global decisions. More specifically, we exploit the relations between pairs of comments at any distance in the thread, which we incorporate in a graph-cut and in an ILP frameworks. We evaluated our approach on the benchmark dataset of SemEval-2015 Task 3. Results improved over the state of the art, confirming the importance of using thread level information.
Scientists used artificial intelligence to discover a 2,000 year-old stick figure in Peru's mysterious Nazca Lines
Artificial intelligence has helped archaeologists uncover an ancient lost work of art. The Nazca Lines in Peru are ancient geoglyphs, images carved into the landscape. First formally studied in 1926, they depict people, animals, plants, and geometric shapes. The formations vary in size, with some of the biggest running up to 30 miles long. Their exact purpose is unknown, although some archaeologists think they may have had religious or spiritual significance.
Unsupervised Natural Question Answering with a Small Model
Andrews, Martin, Witteveen, Sam
The recent (2019-02) demonstration of the power of huge language models such as GPT-2 to memorise the answers to factoid questions raises questions about the extent to which knowledge is being embedded directly within these large models. This short paper describes an architecture through which much smaller models can also answer such questions - by making use of 'raw' external knowledge. The contribution of this work is that the methods presented here rely on unsupervised learning techniques, complementing the unsupervised training of the Language Model. The goal of this line of research is to be able to add knowledge explicitly, without extensive training.