AITopics | Question Answering

Collaborating Authors

Question Answering

"Questions are asked and answered every day. Question answering (QA) technology aims to deliver the same facility online. It goes further than the more familiar search based on keywords (as in Google, Yahoo, and other search engines), in attempting to recognize what a question expresses and to respond with an actual answer. This simplifies things for users in two ways. First, questions do not often translate into a simple list of keywords. ...Second, QA takes responsibility for providing answers, rather than a searchable list of links to potentially relevant documents (web pages), highlighted by snippets of text that show how the query matched the documents."
– from Bonnie Webber & Nick Webb. Question Answering. In The Handbook of Computational Linguistics and Natural Language Processing. Alexander Clark, Chris Fox, Shalom Lappin (Eds.). Wiley, 2010.

News Overviews Instructional Materials AI-Alerts Classics

Self-Chained Image-Language Model for Video Localization and Question Answering

Yu, Shoubin, Cho, Jaemin, Yadav, Prateek, Bansal, Mohit

arXiv.org Artificial IntelligenceNov-29-2023

Recent studies have shown promising results on utilizing large pre-trained image-language models for video question answering. While these image-language models can efficiently bootstrap the representation learning of video-language models, they typically concatenate uniformly sampled video frames as visual inputs without explicit language-aware, temporal modeling. When only a portion of a video input is relevant to the language query, such uniform frame sampling can often lead to missing important visual cues. Although humans often find a video moment to focus on and rewind the moment to answer questions, training a query-aware video moment localizer often requires expensive annotations and high computational costs. To address this issue, we propose Self-Chained Video Localization-Answering (SeViLA), a novel framework that leverages a single image-language model (BLIP-2) to tackle both temporal keyframe localization and QA on videos. SeViLA framework consists of two modules: Localizer and Answerer, where both are parameter-efficiently fine-tuned from BLIP-2. We propose two ways of chaining these modules for cascaded inference and self-refinement. First, in the forward chain, the Localizer finds multiple language-aware keyframes in a video, which the Answerer uses to predict the answer. Second, in the reverse chain, the Answerer generates keyframe pseudo-labels to refine the Localizer, alleviating the need for expensive video moment localization annotations. Our SeViLA framework outperforms several strong baselines on 5 challenging video QA and event prediction benchmarks, and achieves the state-of-the-art in both fine-tuning (NExT-QA, STAR) and zero-shot (NExT-QA, STAR, How2QA, VLEP) settings. We also analyze the impact of Localizer, comparisons of Localizer with other temporal localization models, pre-training/self-refinement of Localizer, and varying the number of keyframes.

answerer, localizer, sevila framework, (11 more...)

arXiv.org Artificial Intelligence

2305.06988

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.92)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Pre-training Language Models for Comparative Reasoning

Yu, Mengxia, Zhang, Zhihan, Yu, Wenhao, Jiang, Meng

arXiv.org Artificial IntelligenceNov-28-2023

Comparative reasoning is a process of comparing objects, concepts, or entities to draw conclusions, which constitutes a fundamental cognitive ability. In this paper, we propose a novel framework to pre-train language models for enhancing their abilities of comparative reasoning over texts. While there have been approaches for NLP tasks that require comparative reasoning, they suffer from costly manual data labeling and limited generalizability to different tasks. Our approach introduces a novel method of collecting scalable data for text-based entity comparison, which leverages both structured and unstructured data. Moreover, we present a framework of pre-training language models via three novel objectives on comparative reasoning. Evaluation on downstream tasks including comparative question answering, question generation, and summarization shows that our pre-training framework significantly improves the comparative reasoning abilities of language models, especially under low-resource conditions. This work also releases the first integrated benchmark for comparative reasoning.

computational linguistic, proceedings, quintuple, (16 more...)

arXiv.org Artificial Intelligence

2305.14457

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
North America > United States > Pennsylvania (0.04)
North America > United States > Colorado > Boulder County > Boulder (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.88)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)

Add feedback

Just ClozE! A Novel Framework for Evaluating the Factual Consistency Faster in Abstractive Summarization

Li, Yiyang, Li, Lei, Litvak, Marina, Vanetik, Natalia, Hu, Dingxin, Li, Yuze, Zhou, Yanquan

arXiv.org Artificial IntelligenceNov-28-2023

The issue of factual consistency in abstractive summarization has received extensive attention in recent years, and the evaluation of factual consistency between summary and document has become an important and urgent task. Most of the current evaluation metrics are adopted from the question answering (QA) or natural language inference (NLI) task. However, the application of QA-based metrics is extremely time-consuming in practice while NLI-based metrics are lack of interpretability. In this paper, we propose a cloze-based evaluation framework called ClozE and show the great potential of the cloze-based metric. It inherits strong interpretability from QA, while maintaining the speed of NLI- level reasoning. We demonstrate that ClozE can reduce the evaluation time by nearly 96% relative to QA-based metrics while retaining their interpretability and performance through experiments on six human-annotated datasets and a meta-evaluation benchmark GO FIGURE (Gabriel et al., 2021). Finally, we discuss three important facets of ClozE in practice, which further shows better overall performance of ClozE compared to other metrics.

cloze, factual factor, metric, (14 more...)

arXiv.org Artificial Intelligence

2210.02804

Country:

Asia > Nepal (0.04)
Asia > China > Beijing > Beijing (0.04)
North America > United States > Arizona > Maricopa County > Glendale (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Football (0.94)
Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.34)

Add feedback

Releasing the CRaQAn (Coreference Resolution in Question-Answering): An open-source dataset and dataset creation methodology using instruction-following models

Grzywinski, Rob, D'Arcy, Joshua, Naidoff, Rob, Shukla, Ashish, Browne, Alex, Gibbons, Ren, Bent, Brinnae

arXiv.org Artificial IntelligenceNov-27-2023

Instruction-following language models demand robust methodologies for information retrieval to augment instructions for question-answering applications. A primary challenge is the resolution of coreferences in the context of chunking strategies for long documents. The critical barrier to experimentation of handling coreferences is a lack of open source datasets, specifically in question-answering tasks that require coreference resolution. In this work we present our Coreference Resolution in Question-Answering (CRaQAn) dataset, an open-source dataset that caters to the nuanced information retrieval requirements of coreference resolution in question-answering tasks by providing over 250 question-answer pairs containing coreferences. To develop this dataset, we developed a novel approach for creating high-quality datasets using an instruction-following model (GPT-4) and a Recursive Criticism and Improvement Loop.

coreference resolution, dataset, reviewer, (11 more...)

arXiv.org Artificial Intelligence

2311.16338

Country:

North America > United States > California (0.04)
South America (0.04)
Atlantic Ocean > North Atlantic Ocean (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)

Genre:

Research Report (0.84)
Personal > Honors (0.68)

Industry:

Health & Medicine > Government Relations & Public Policy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law (0.93)
Health & Medicine > Health Care Providers & Services > Reimbursement (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

A Comparative and Experimental Study on Automatic Question Answering Systems and its Robustness against Word Jumbling

Javaji, Shashidhar Reddy, Hu, Haoran, Vennam, Sai Sameer, Buddhavarapu, Vijaya Gajanan

arXiv.org Artificial IntelligenceNov-26-2023

We aim to understand and improve the performance of these state of the art Question answer generation using Natural Language models on different levels of noise. A model that Processing models is ubiquitous in the is robust to context corruption can increase the satisfaction world around us. It is used in many use cases such of the users of the system, especially for as the building of chat bots, suggestive prompts in people whose English is not their native language. It is and answer questions appropriately. This is highly relevant because a frequently asked questions relevant because many times we do not have control (FAQ) list can only have a finite amount of of the sources of data and how it is generated. In such cases, it becomes highly important to be able to answer new questions accurately as for the models to be able to understand and process long as it is a relevant question. In commercial this text which is riddled with noise.

dataset, experiment, performance degradation, (11 more...)

arXiv.org Artificial Intelligence

2311.15513

Country:

Europe > France (0.04)
North America > Canada > Alberta (0.04)
Asia > India > Karnataka > Bengaluru (0.04)

Genre:

Research Report > New Finding (0.50)
Research Report > Experimental Study (0.40)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Add feedback

Question Answering in Natural Language: the Special Case of Temporal Expressions

Stricker, Armand

arXiv.org Artificial IntelligenceNov-23-2023

Although general question answering has been well explored in recent years, temporal question answering is a task which has not received as much focus. Our work aims to leverage a popular approach used for general question answering, answer extraction, in order to find answers to temporal questions within a paragraph. To train our model, we propose a new dataset, inspired by SQuAD, specifically tailored to provide rich temporal information. We chose to adapt the corpus WikiWars, which contains several documents on history's greatest conflicts. Our evaluation shows that a deep learning model trained to perform pattern matching, often used in general question answering, can be adapted to temporal question answering, if we accept to ask questions whose answers must be directly present within a text.

expression, paragraph, temporal expression, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.26615/issn.2603-2821.2021_026

2311.14087

Country:

Europe > Germany (0.04)
Africa > South Africa (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(6 more...)

Genre: Research Report (0.50)

Industry: Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction

Mondal, Ishani, Yuan, Michelle, N, Anandhavelu, Garimella, Aparna, Ferraro, Francis, Blair-Stanek, Andrew, Van Durme, Benjamin, Boyd-Graber, Jordan

arXiv.org Artificial IntelligenceNov-17-2023

Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap templates in a real-world setting, we need to induce template slots from documents with zero or minimal supervision. Since the purpose of question answering intersect with the goal of information extraction, we use automatic question generation to induce template slots from the documents and investigate how a tiny amount of a proxy human-supervision on-the-fly (termed as InteractiveIE) can further boost the performance. Extensive experiments on biomedical and legal documents, where obtaining training data is expensive, reveal encouraging trends of performance improvement using InteractiveIE over AI-only baseline.

computational linguistic, explanation, proceedings, (16 more...)

arXiv.org Artificial Intelligence

2305.14659

Country:

North America > United States > Washington > King County > Seattle (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Virginia > Fairfax County > Fairfax (0.04)
(17 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (0.70)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.87)
(4 more...)

Add feedback

FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives

Chen, Jiaju, Lu, Yuxuan, Zhang, Shao, Yao, Bingsheng, Dong, Yuanzhe, Xu, Ying, Li, Yunyao, Wang, Qianwen, Wang, Dakuo, Sun, Yuling

arXiv.org Artificial IntelligenceNov-16-2023

AI models (including LLM) often rely on narrative question-answering (QA) datasets to provide customized QA functionalities to support downstream children education applications; however, existing datasets only include QA pairs that are grounded within the given storybook content, but children can learn more when teachers refer the storybook content to real-world knowledge (e.g., commonsense knowledge). We introduce the FairytaleCQA dataset, which is annotated by children education experts, to supplement 278 storybook narratives with educationally appropriate commonsense knowledge. The dataset has 5,868 QA pairs that not only originate from the storybook narrative but also contain the commonsense knowledge grounded by an external knowledge graph (i.e., ConceptNet). A follow-up experiment shows that a smaller model (T5-large) fine-tuned with FairytaleCQA reliably outperforms much larger prompt-engineered LLM (e.g., GPT-4) in this new QA-pair generation task (QAG). This result suggests that: 1) our dataset brings novel challenges to existing LLMs, and 2) human experts' data annotation are still critical as they have much nuanced knowledge that LLMs do not know in the children educational domain.

knowledge, pipeline, triplet, (16 more...)

arXiv.org Artificial Intelligence

2311.09756

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(11 more...)

Genre:

Workflow (0.95)
Research Report > New Finding (0.48)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

Crafting In-context Examples according to LMs' Parametric Knowledge

Lee, Yoonsang, Atreya, Pranav, Ye, Xi, Choi, Eunsol

arXiv.org Artificial IntelligenceNov-16-2023

In-context learning has been applied to knowledge-rich tasks such as question answering. In such scenarios, in-context examples are used to trigger a behaviour in the language model: namely, it should surface information stored in its parametric knowledge. We study the construction of in-context example sets, with a focus on the parametric knowledge of the model regarding in-context examples. We identify 'known' examples, where models can correctly answer from its parametric knowledge, and 'unknown' ones. Our experiments show that prompting with 'unknown' examples decreases the performance, potentially as it encourages hallucination rather than searching its parametric knowledge. Constructing an in-context example set that presents both known and unknown information performs the best across diverse settings. We perform analysis on three multi-answer question answering datasets, which allows us to further study answer set ordering strategies based on the LM's knowledge about each answer. Together, our study sheds lights on how to best construct in-context example sets for knowledge-rich tasks.

dataset, in-context example, parametric knowledge, (14 more...)

arXiv.org Artificial Intelligence

2311.09579

Country:

Africa > South Africa (0.14)
Asia > India (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(7 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media (0.93)
Government > Regional Government (0.93)
Education (0.68)
Leisure & Entertainment > Sports > Soccer (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.75)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.68)

Add feedback

AmQA: Amharic Question Answering Dataset

Abedissa, Tilahun, Usbeck, Ricardo, Assabie, Yaregal

arXiv.org Artificial IntelligenceNov-16-2023

Question Answering (QA) returns concise answers or answer lists from natural language text given a context document. Many resources go into curating QA datasets to advance robust models' development. There is a surge of QA datasets for languages like English, however, this is not true for Amharic. Amharic, the official language of Ethiopia, is the second most spoken Semitic language in the world. There is no published or publicly available Amharic QA dataset. Hence, to foster the research in Amharic QA, we present the first Amharic QA (AmQA) dataset. We crowdsourced 2628 question-answer pairs over 378 Wikipedia articles. Additionally, we run an XLMR Large-based baseline model to spark open-domain QA research interest. The best-performing baseline achieves an F-score of 69.58 and 71.74 in reader-retriever QA and reading comprehension settings respectively.

dataset, proceedings, question-answer pair, (11 more...)

arXiv.org Artificial Intelligence

2303.0329

Country:

Africa > Ethiopia > Addis Ababa > Addis Ababa (0.06)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
Asia > Japan > Kyūshū & Okinawa > Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
Africa > Ghana (0.04)

Genre: Research Report (0.41)

Industry: Education (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.64)
Information Technology > Communications > Social Media > Crowdsourcing (0.49)

Add feedback