AITopics | Question Answering

Collaborating Authors

Question Answering

"Questions are asked and answered every day. Question answering (QA) technology aims to deliver the same facility online. It goes further than the more familiar search based on keywords (as in Google, Yahoo, and other search engines), in attempting to recognize what a question expresses and to respond with an actual answer. This simplifies things for users in two ways. First, questions do not often translate into a simple list of keywords. ...Second, QA takes responsibility for providing answers, rather than a searchable list of links to potentially relevant documents (web pages), highlighted by snippets of text that show how the query matched the documents."
– from Bonnie Webber & Nick Webb. Question Answering. In The Handbook of Computational Linguistics and Natural Language Processing. Alexander Clark, Chris Fox, Shalom Lappin (Eds.). Wiley, 2010.

News Overviews Instructional Materials AI-Alerts Classics

The BLue Amazon Brain (BLAB): A Modular Architecture of Services about the Brazilian Maritime Territory

Pirozelli, Paulo, Castro, Ais B. R., de Oliveira, Ana Luiza C., Oliveira, André S., Cação, Flávio N., Silveira, Igor C., Campos, João G. M., Motheo, Laura C., Figueiredo, Leticia F., Pellicer, Lucas F. A. O., José, Marcelo A., José, Marcos M., Ligabue, Pedro de M., Grava, Ricardo S., Tavares, Rodrigo M., Matos, Vinícius B., Sym, Yan V., Costa, Anna H. R., Brandão, Anarosa A. F., Mauá, Denis D., Cozman, Fabio G., Peres, Sarajane M.

arXiv.org Artificial IntelligenceSep-6-2022

We describe the first steps in the development of an artificial agent focused on the Brazilian maritime territory, a large region within the South Atlantic also known as the Blue Amazon. The "BLue Amazon Brain" (BLAB) integrates a number of services aimed at disseminating information about this region and its importance, functioning as a tool for environmental awareness. The main service provided by BLAB is a conversational facility that deals with complex questions about the Blue Amazon, called BLAB-Chat; its central component is a controller that manages several task-oriented natural language processing modules (e.g., question answering and summarizer systems). These modules have access to an internal data lake as well as to third-party databases. A news reporter (BLAB-Reporter) and a purposely-developed wiki (BLAB-Wiki) are also part of the BLAB service architecture. In this paper, we describe our current version of BLAB's architecture (interface, backend, web services, NLP modules, and resources) and comment on the challenges we have faced so far, such as the lack of training data and the scattered state of domain information. Solving these issues presents a considerable challenge in the development of artificial intelligence for technical domains.

machine learning, natural language, question answering, (18 more...)

arXiv.org Artificial Intelligence

2209.07928

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
South America > Brazil > São Paulo (0.05)
South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)
(5 more...)

Genre: Research Report (0.50)

Industry: Government (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.89)

Add feedback

SQuARE: Software for Question Answering Research

#artificialintelligenceSep-5-2022, 11:40:18 GMT

Have you ever wanted to try Question Answering (QA) models but felt restrained because you needed to write some code to set them up? Have you ever wanted to compare QA models, but a Jupyter Notebook is too inconvenient to compare them? Have you ever wanted to use explainability methods such as saliency maps to explain the outputs, but you don't even know where to start? We have been there too! That's why we built SQuARE: Software for Question Answering Research!

explainability method, qa model, software, (7 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.86)

Add feedback

Query-focused Extractive Summarisation for Biomedical and COVID-19 Complex Question Answering

Mollá, Diego

arXiv.org Artificial IntelligenceSep-5-2022

This paper presents Macquarie University's participation to the two most recent BioASQ Synergy Tasks (as per June 2022), and to the BioASQ10 Task~B (BioASQ10b), Phase~B. In these tasks, participating systems are expected to generate complex answers to biomedical questions, where the answers may contain more than one sentence. We apply query-focused extractive summarisation techniques. In particular, we follow a sentence classification-based approach that scores each candidate sentence associated to a question, and the $n$ highest-scoring sentences are returned as the answer. The Synergy Task corresponds to an end-to-end system that requires document selection, snippet selection, and finding the final answer, but it has very limited training data. For the Synergy task, we selected the candidate sentences following two phases: document retrieval and snippet retrieval, and the final answer was found by using a DistilBERT/ALBERT classifier that had been trained on the training data of BioASQ9b. Document retrieval was achieved as a standard search over the CORD-19 data using the search API provided by the BioASQ organisers, and snippet retrieval was achieved by re-ranking the sentences of the top retrieved documents, using the cosine similarity of the question and candidate sentence. We observed that vectors represented via sBERT have an edge over tf.idf. BioASQ10b Phase B focuses on finding the specific answers to biomedical questions. For this task, we followed a data-centric approach. We hypothesised that the training data of the first BioASQ years might be biased and we experimented with different subsets of the training data. We observed an improvement of results when the system was trained on the second half of the BioASQ10b training data.

candidate sentence, synergy task, training data, (15 more...)

arXiv.org Artificial Intelligence

2209.01815

Country:

Oceania > Australia (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.41)
Health & Medicine > Therapeutic Area > Immunology (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.68)

Add feedback

Interactive Question Answering Systems: Literature Review

Biancofiore, Giovanni Maria, Deldjoo, Yashar, Di Noia, Tommaso, Di Sciascio, Eugenio, Narducci, Fedelucio

arXiv.org Artificial IntelligenceSep-4-2022

Question answering systems are recognized as popular and frequently effective means of information seeking on the web. In such systems, information seekers can receive a concise response to their query by presenting their questions in natural language. Interactive question answering is a recently proposed and increasingly popular solution that resides at the intersection of question answering and dialogue systems. On the one hand, the user can ask questions in normal language and locate the actual response to her inquiry; on the other hand, the system can prolong the question-answering session into a dialogue if there are multiple probable replies, very few, or ambiguities in the initial request. By permitting the user to ask more questions, interactive question answering enables users to dynamically interact with the system and receive more precise results. This survey offers a detailed overview of the interactive question-answering methods that are prevalent in current literature. It begins by explaining the foundational principles of question-answering systems, hence defining new notations and taxonomies to combine all identified works inside a unified framework. The reviewed published work on interactive question-answering systems is then presented and examined in terms of its proposed methodology, evaluation approaches, and dataset/application domain. We also describe trends surrounding specific tasks and issues raised by the community, so shedding light on the future interests of scholars. Our work is further supported by a GitHub page with a synthesis of all the major topics covered in this literature study. https://sisinflab.github.io/interactive-question-answering-systems-survey/

dataset, information, interaction, (14 more...)

arXiv.org Artificial Intelligence

2209.01621

Country:

Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Los Altos (0.04)
(3 more...)

Genre:

Workflow (1.00)
Research Report (1.00)
Overview (1.00)

Industry:

Information Technology (0.67)
Leisure & Entertainment > Sports > Olympic Games (0.46)
Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Exploiting Hybrid Semantics of Relation Paths for Multi-hop Question Answering Over Knowledge Graphs

Qiao, Zile, Ye, Wei, Zhang, Tong, Mo, Tong, Li, Weiping, Zhang, Shikun

arXiv.org Artificial IntelligenceSep-2-2022

Answering natural language questions on knowledge graphs (KGQA) remains a great challenge in terms of understanding complex questions via multi-hop reasoning. Previous efforts usually exploit large-scale entity-related text corpora or knowledge graph (KG) embeddings as auxiliary information to facilitate answer selection. However, the rich semantics implied in off-the-shelf relation paths between entities is far from well explored. This paper proposes improving multi-hop KGQA by exploiting relation paths' hybrid semantics. Specifically, we integrate explicit textual information and implicit KG structural features of relation paths based on a novel rotate-and-scale entity link prediction framework. Extensive experiments on three existing KGQA datasets demonstrate the superiority of our method, especially in multi-hop scenarios. Further investigation confirms our method's systematical coordination between questions and relation paths to identify answer entities.

information, relation path, representation, (14 more...)

arXiv.org Artificial Intelligence

2209.0087

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
Asia > China (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.82)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.52)

Add feedback

Building the Intent Landscape of Real-World Conversational Corpora with Extractive Question-Answering Transformers

Corbeil, Jean-Philippe, Li, Mia Taige, Ghavidel, Hadi Abdi

arXiv.org Artificial IntelligenceAug-30-2022

For companies with customer service, mapping intents inside their conversational data is crucial in building applications based on natural language understanding (NLU). Nevertheless, there is no established automated technique to gather the intents from noisy online chats or voice transcripts. Simple clustering approaches are not suited to intent-sparse dialogues. To solve this intent-landscape task, we propose an unsupervised pipeline that extracts the intents and the taxonomy of intents from real-world dialogues. Our pipeline mines intent-span candidates with an extractive Question-Answering Electra model and leverages sentence embeddings to apply a low-level density clustering followed by a top-level hierarchical clustering. Our results demonstrate the generalization ability of an ELECTRA large model fine-tuned on the SQuAD2 dataset to understand dialogues. With the right prompting question, this model achieves a rate of linguistic validation on intent spans beyond 85%. We furthermore reconstructed the intent schemes of five domains from the MultiDoGo dataset with an average recall of 94.3%.

dataset, dialogue, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2208.12886

Country:

North America > Canada (0.04)
North America > United States > Texas (0.04)
North America > United States > New York (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.88)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)

Add feedback

Faithful Reasoning Using Large Language Models

Creswell, Antonia, Shanahan, Murray

arXiv.org Artificial IntelligenceAug-30-2022

Although contemporary large language models (LMs) demonstrate impressive question-answering capabilities, their answers are typically the product of a single call to the model. This entails an unwelcome degree of opacity and compromises performance, especially on problems that are inherently multi-step. To address these limitations, we show how LMs can be made to perform faithful multi-step reasoning via a process whose causal structure mirrors the underlying logical structure of the problem. Our approach works by chaining together reasoning steps, where each step results from calls to two fine-tuned LMs, one for selection and one for inference, to produce a valid reasoning trace. Our method carries out a beam search through the space of reasoning traces to improve reasoning quality. We demonstrate the effectiveness of our model on multi-step logical deduction and scientific question-answering, showing that it outperforms baselines on final answer accuracy, and generates humanly interpretable reasoning traces whose validity can be checked by the user.

bald eagle, reasoning trace, squirrel, (13 more...)

arXiv.org Artificial Intelligence

2208.14271

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Maryland (0.04)
North America > Dominican Republic (0.04)
(2 more...)

Genre: Research Report (0.81)

Industry:

Materials > Metals & Mining (1.00)
Transportation (0.93)
Energy > Renewable > Ocean Energy (0.69)
Education > Curriculum > Subject-Specific Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.88)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.66)

Add feedback

AutoQGS: Auto-Prompt for Low-Resource Knowledge-based Question Generation from SPARQL

Xiong, Guanming, Bao, Junwei, Zhao, Wen, Wu, Youzheng, He, Xiaodong

arXiv.org Artificial IntelligenceAug-26-2022

This study investigates the task of knowledge-based question generation (KBQG). Conventional KBQG works generated questions from fact triples in the knowledge graph, which could not express complex operations like aggregation and comparison in SPARQL. Moreover, due to the costly annotation of large-scale SPARQL-question pairs, KBQG from SPARQL under low-resource scenarios urgently needs to be explored. Recently, since the generative pre-trained language models (PLMs) typically trained in natural language (NL)-to-NL paradigm have been proven effective for low-resource generation, e.g., T5 and BART, how to effectively utilize them to generate NL-question from non-NL SPARQL is challenging. To address these challenges, AutoQGS, an auto-prompt approach for low-resource KBQG from SPARQL, is proposed. Firstly, we put forward to generate questions directly from SPARQL for the KBQG task to handle complex operations. Secondly, we propose an auto-prompter trained on large-scale unsupervised data to rephrase SPARQL into NL description, smoothing the low-resource transformation from non-NL SPARQL to NL question with PLMs. Experimental results on the WebQuestionsSP, ComlexWebQuestions 1.1, and PathQuestions show that our model achieves state-of-the-art performance, especially in low-resource settings. Furthermore, a corpus of 330k factoid complex question-SPARQL pairs is generated for further KBQG research.

computational linguistic, sparql, subgraph, (12 more...)

arXiv.org Artificial Intelligence

2208.12461

Country:

North America > United States > Georgia > Fulton County > Atlanta (0.05)
Asia > China > Beijing > Beijing (0.05)
South America > Chile (0.04)
(15 more...)

Genre: Research Report (0.82)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.73)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.71)

Add feedback

Natural Language Processing Engineer - Remote Tech Jobs

#artificialintelligenceAug-23-2022, 02:29:57 GMT

Please note that at this time we are unable to sponsor employment authorization (both new and transfer). We are looking for an NLP Engineer to design and develop text-mining solutions, build NLP pipelines and find insights across diverse data types and sources. To apply for this job please visit www.linkedin.com.

natural language processing engineer, remote tech job

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)

Add feedback

FashionVQA: A Domain-Specific Visual Question Answering System

Wang, Min, Mahjoubfar, Ata, Joshi, Anupama

arXiv.org Artificial IntelligenceAug-23-2022

Humans apprehend the world through various sensory modalities, yet language is their predominant communication channel. Machine learning systems need to draw on the same multimodal richness to have informed discourses with humans in natural language; this is particularly true for systems specialized in visually-dense information, such as dialogue, recommendation, and search engines for clothing. To this end, we train a visual question answering (VQA) system to answer complex natural language questions about apparel in fashion photoshoot images. The key to the successful training of our VQA model is the automatic creation of a visual question-answering dataset with 168 million samples from item attributes of 207 thousand images using diverse templates. The sample generation employs a strategy that considers the difficulty of the question-answer pairs to emphasize challenging concepts. Contrary to the recent trends in using several datasets for pretraining the visual question answering models, we focused on keeping the dataset fixed while training various models from scratch to isolate the improvements from model architecture changes. We see that using the same transformer for encoding the question and decoding the answer, as in language models, achieves maximum accuracy, showing that visual language models (VLMs) make the best visual question answering systems for our dataset. The accuracy of the best model surpasses the human expert level, even when answering human-generated questions that are not confined to the template formats. Our approach for generating a large-scale multimodal domain-specific dataset provides a path for training specialized models capable of communicating in natural language. The training of such domain-expert models, e.g., our fashion VLM model, cannot rely solely on the large-scale general-purpose datasets collected from the web.

category, dataset, question template, (13 more...)

arXiv.org Artificial Intelligence

2208.11253

Country: North America > United States > California > Santa Clara County > Sunnyvale (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback