AITopics | Question Answering

Collaborating Authors

Question Answering

"Questions are asked and answered every day. Question answering (QA) technology aims to deliver the same facility online. It goes further than the more familiar search based on keywords (as in Google, Yahoo, and other search engines), in attempting to recognize what a question expresses and to respond with an actual answer. This simplifies things for users in two ways. First, questions do not often translate into a simple list of keywords. ...Second, QA takes responsibility for providing answers, rather than a searchable list of links to potentially relevant documents (web pages), highlighted by snippets of text that show how the query matched the documents."
– from Bonnie Webber & Nick Webb. Question Answering. In The Handbook of Computational Linguistics and Natural Language Processing. Alexander Clark, Chris Fox, Shalom Lappin (Eds.). Wiley, 2010.

News Overviews Instructional Materials AI-Alerts Classics

Knowledge Graphs and Knowledge Networks: The Story in Brief

Sheth, Amit, Padhee, Swati, Gyrard, Amelie

arXiv.org Artificial IntelligenceMar-7-2020

Knowledge Graphs (KGs) represent real-world noisy raw information in a structured form, capturing relationships between entities. However, for dynamic real-world applications such as social networks, recommender systems, computational biology, relational knowledge representation has emerged as a challenging research problem where there is a need to represent the changing nodes, attributes, and edges over time. The evolution of search engine responses to user queries in the last few years is partly because of the role of KGs such as Google KG. KGs are significantly contributing to various AI applications from link prediction, entity relations prediction, node classification to recommendation and question answering systems. This article is an attempt to summarize the journey of KG for AI.

application, knowledge, knowledge graph, (14 more...)

arXiv.org Artificial Intelligence

2003.03623

Country:

North America > United States > South Carolina (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.50)

Industry:

Health & Medicine (1.00)
Government (0.94)
Information Technology > Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.74)
(2 more...)

Add feedback

Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.0

Hulburd, Eric

arXiv.org Machine LearningMar-3-2020

In this paper we explore the parameter efficiency of BERT arXiv:1810.04805 on version 2.0 of the Stanford Question Answering dataset (SQuAD2.0). We evaluate the parameter efficiency of BERT while freezing a varying number of final transformer layers as well as including the adapter layers proposed in arXiv:1902.00751. Additionally, we experiment with the use of context-aware convolutional (CACNN) filters, as described in arXiv:1709.08294v3, as a final augmentation layer for the SQuAD2.0 tasks. This exploration is motivated in part by arXiv:1907.10597, which made a compelling case for broadening the evaluation criteria of artificial intelligence models to include various measures of resource efficiency. While we do not evaluate these models based on their floating point operation efficiency as proposed in arXiv:1907.10597, we examine efficiency with respect to training time, inference time, and total number of model parameters. Our results largely corroborate those of arXiv:1902.00751 for adapter modules, while also demonstrating that gains in F1 score from adding context-aware convolutional filters are not practical due to the increase in training and inference time.

efficiency, module, squad2, (13 more...)

arXiv.org Machine Learning

2002.1067

Country:

North America > Cuba (0.04)
Europe > United Kingdom (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
(3 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

IBM Watson: how AI is transforming the supply chain

#artificialintelligenceMar-1-2020, 08:37:03 GMT

The supply chain industry is in a state of transition and transformation. New technology such as AI, Big Data and machine learning is making life easier for industry executives as an ever-increasing number of companies begin to digitise their offerings. In order to stay ahead in a dynamic and continuously evolving industry, businesses must trial technology to increase efficiency. The technology giants, IBM Watson, understands the challenge that supply chains face. The company has announced Watson Supply Chain Insights, an AI-based solution that enables supply chain professionals to get through a data overload for enhanced visibility throughout the entire supply chain.

ibm watson, supply chain, watson supply chain insight, (4 more...)

#artificialintelligence

Industry: Information Technology (0.78)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Case Based Reasoning (0.69)

Add feedback

A Study on Multimodal and Interactive Explanations for Visual Question Answering

Alipour, Kamran, Schulze, Jurgen P., Yao, Yi, Ziskind, Avi, Burachas, Giedrius

arXiv.org Artificial IntelligenceMar-1-2020

Explainability and interpretability of AI models is an essential factor affecting the safety of AI. While various explainable AI (XAI) approaches aim at mitigating the lack of transparency in deep networks, the evidence of the effectiveness of these approaches in improving usability, trust, and understanding of AI systems are still missing. We evaluate multimodal explanations in the setting of a Visual Question Answering (VQA) task, by asking users to predict the response accuracy of a VQA agent with and without explanations. We use between-subjects and within-subjects experiments to probe explanation effectiveness in terms of improving user prediction accuracy, confidence, and reliance, among other factors. The results indicate that the explanations help improve human prediction accuracy, especially in trials when the VQA system's answer is inaccurate. Furthermore, we introduce active attention, a novel method for evaluating causal attentional effects through intervention by editing attention maps. User explanation ratings are strongly correlated with human prediction accuracy and suggest the efficacy of these explanations in human-machine AI collaboration tasks.

accuracy, explanation, prediction, (16 more...)

arXiv.org Artificial Intelligence

2003.00431

Country:

North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
(3 more...)

Genre: Research Report > New Finding (0.93)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.65)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)

Add feedback

Do Multi-Hop Question Answering Systems Know How to Answer the Single-Hop Sub-Questions?

Tang, Yixuan, Ng, Hwee Tou, Tung, Anthony K. H.

arXiv.org Artificial IntelligenceFeb-23-2020

Multi-hop question answering (QA) requires a model to retrieve and integrate information from different parts of a long text to answer a question. Humans answer this kind of complex questions via a divide-and-conquer approach. In this paper, we investigate whether top-performing models for multi-hop questions understand the underlying sub-questions like humans. We adopt a neural decomposition model to generate sub-questions for a multi-hop complex question, followed by extracting the corresponding sub-answers. We show that multiple state-of-the-art multi-hop QA models fail to correctly answer a large portion of sub-questions, although their corresponding multi-hop questions are correctly answered. This indicates that these models manage to answer the multi-hop questions using some partial clues, instead of truly understanding the reasoning paths. We also propose a new model which significantly improves the performance on answering the sub-questions. Our work takes a step forward towards building a more explainable multi-hop QA system.

dataset, multi-hop question, paragraph, (15 more...)

arXiv.org Artificial Intelligence

2002.09919

Country:

North America > United States > New York (0.04)
Asia > Singapore (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)

Add feedback

Unsupervised Question Decomposition for Question Answering

Perez, Ethan, Lewis, Patrick, Yih, Wen-tau, Cho, Kyunghyun, Kiela, Douwe

arXiv.org Artificial IntelligenceFeb-22-2020

We aim to improve question answering (QA) by decomposing hard questions into easier sub-questions that existing QA systems can answer. Since collecting labeled decompositions is cumbersome, we propose an unsupervised approach to produce sub-questions. Specifically, by leveraging >10M questions from Common Crawl, we learn to map from the distribution of multi-hop questions to the distribution of single-hop sub-questions. We answer sub-questions with an off-the-shelf QA model and incorporate the resulting answers in a downstream, multi-hop QA system. On a popular multi-hop QA dataset, HotpotQA, we show large improvements over a strong baseline, especially on adversarial and out-of-domain questions. Our method is generally applicable and automatically learns to decompose questions of different classes, while matching the performance of decomposition methods that rely heavily on hand-engineering and annotation.

decomposition, qa model, unsupervised question decomposition, (13 more...)

arXiv.org Artificial Intelligence

2002.09758

Country:

Europe > Ireland (0.29)
Asia > India (0.05)
Europe > Spain > Canary Islands > Tenerife (0.05)
(17 more...)

Genre: Research Report (0.64)

Industry:

Media > Music (1.00)
Leisure & Entertainment (1.00)
Government > Regional Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Predicting drug–protein interaction using quasi-visual question answering system

#artificialintelligenceFeb-21-2020, 15:48:08 GMT

Identifying novel drug–protein interactions is crucial for drug discovery. For this purpose, many machine learning-based methods have been developed based on drug descriptors and one-dimensional protein sequences. However, protein sequences cannot accurately reflect the interactions in three-dimensional space. However, direct input of three-dimensional structure is of low efficiency due to the sparse three-dimensional matrix, and is also prevented by the limited number of co-crystal structures available for training. Here we propose an end-to-end deep learning framework to predict the interactions by representing proteins with a two-dimensional distance map from monomer structures (Image) and drugs with molecular linear notation (String), following the visual question answering mode. For efficient training of the system, we introduce a dynamic attentive convolutional neural network to learn fixed-size representations from the variable-length distance maps and a self-attentional sequential model to automatically extract semantic features from the linear notations.

machine learning, natural language, question answering, (7 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Training Question Answering Models From Synthetic Data

Puri, Raul, Spring, Ryan, Patwary, Mostofa, Shoeybi, Mohammad, Catanzaro, Bryan

arXiv.org Artificial IntelligenceFeb-21-2020

Question and answer generation is a data augmentation method that aims to improve question answering (QA) models given the limited amount of human labeled data. However, a considerable gap remains between synthetic and human-generated question-answer pairs. This work aims to narrow this gap by taking advantage of large language models and explores several factors such as model size, quality of pretrained models, scale of data synthesized, and algorithmic choices. On the SQuAD1.1 question answering task, we achieve higher accuracy using solely synthetic questions and answers than when using the SQuAD1.1 training set questions alone. Removing access to real Wikipedia data, we synthesize questions and answers from a synthetic corpus generated by an 8.3 billion parameter GPT-2 model. With no access to human supervision and only access to other models, we are able to train state of the art question answering networks on entirely model-generated data that achieve 88.4 Exact Match (EM) and 93.9 F1 score on the SQuAD1.1 dev set. We further apply our methodology to SQuAD2.0 and show a 2.8 absolute gain on EM score compared to prior work using synthetic data.

filtration, question generation, squad1, (14 more...)

arXiv.org Artificial Intelligence

2002.09599

Country:

North America > United States > Texas > Culberson County > Van Horn (0.14)
Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.05)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
(12 more...)

Genre: Research Report (0.82)

Industry:

Media > Music (0.93)
Health & Medicine (0.68)
Leisure & Entertainment > Sports > Soccer (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Contact FirstWord - Medtech Leaders: IBM Watson Health's Mark O'Herlihy on the true impact of AI in healthcare

#artificialintelligenceFeb-19-2020, 17:25:11 GMT

Get unlimited MedTech PLUS subscriptions for one low fixed rate with a FirstWord MedTech country license.

firstword dossier, latest earnings report product approval, report product approval and certification, (5 more...)

#artificialintelligence

Industry: Health & Medicine > Health Care Technology (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Case Based Reasoning (0.40)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)
Information Technology > Artificial Intelligence > Applied AI (0.40)

Add feedback

Estimating Robust Query Models with Convex Optimization

Collins-thompson, Kevyn

Neural Information Processing SystemsFeb-15-2020, 01:27:52 GMT

Query expansion is a long-studied approach for improving retrieval effectiveness by enhancing the userâ s original query with additional related terms. Current algorithms for automatic query expansion have been shown to consistently improve retrieval accuracy on average, but are highly unstable and have bad worst-case performance for individual queries. We introduce a novel risk framework that formulates query model estimation as a constrained metric labeling problem on a graph of term relations. Themodel combines assignment costs based on a baseline feedback algorithm, edge weights based on term similarity, and simple constraints to enforce aspect balance, aspect coverage, and term centrality. Results across multiple standard test collections show consistent and dramatic reductions in the number and magnitude of expansion failures, while retaining the strong positive gains of the baseline algorithm.

algorithm, convex optimization, robust query model, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.67)

Add feedback