AITopics | Question Answering

Collaborating Authors

Question Answering

"Questions are asked and answered every day. Question answering (QA) technology aims to deliver the same facility online. It goes further than the more familiar search based on keywords (as in Google, Yahoo, and other search engines), in attempting to recognize what a question expresses and to respond with an actual answer. This simplifies things for users in two ways. First, questions do not often translate into a simple list of keywords. ...Second, QA takes responsibility for providing answers, rather than a searchable list of links to potentially relevant documents (web pages), highlighted by snippets of text that show how the query matched the documents."
– from Bonnie Webber & Nick Webb. Question Answering. In The Handbook of Computational Linguistics and Natural Language Processing. Alexander Clark, Chris Fox, Shalom Lappin (Eds.). Wiley, 2010.

News Overviews Instructional Materials AI-Alerts Classics

Exploring Models and Data for Image Question Answering, Richard S. Zemel 1,2 University of Toronto

Neural Information Processing SystemsMar-13-2024, 00:59:40 GMT

This work aims to address the problem of image-based question-answering (QA) with new models and datasets. In our work, we propose to use neural networks and visual semantic embeddings, without intermediate stages such as object detection and image segmentation, to predict answers to simple questions about images. Our model performs 1.8 times better than the only published results on an existing image QA dataset. We also present a question generation algorithm that converts image descriptions, which are widely available, into QA form. We used this algorithm to produce an order-of-magnitude larger dataset, with more evenly distributed answers. A suite of baseline results on this new dataset are also presented.

algorithm, dataset, ground truth, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.87)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
(2 more...)

Add feedback

Beyond Memorization: The Challenge of Random Memory Access in Language Models

Zhu, Tongyao, Liu, Qian, Pang, Liang, Jiang, Zhengbao, Kan, Min-Yen, Lin, Min

arXiv.org Artificial IntelligenceMar-13-2024

Recent developments in Language Models (LMs) have shown their effectiveness in NLP tasks, particularly in knowledge-intensive tasks. However, the mechanisms underlying knowledge storage and memory access within their parameters remain elusive. In this paper, we investigate whether a generative LM (e.g., GPT-2) is able to access its memory sequentially or randomly. Through carefully-designed synthetic tasks, covering the scenarios of full recitation, selective recitation and grounded question answering, we reveal that LMs manage to sequentially access their memory while encountering challenges in randomly accessing memorized content. We find that techniques including recitation and permutation improve the random memory access capability of LMs. Furthermore, by applying this intervention to realistic scenarios of open-domain question answering, we validate that enhancing random access by recitation leads to notable improvements in question answering. The code to reproduce our experiments can be found at https://github.com/sail-sg/lm-random-memory-access.

computational linguistic, experiment, language model, (15 more...)

arXiv.org Artificial Intelligence

2403.07805

Country:

North America > Canada > Ontario > Toronto (0.05)
Asia > Singapore (0.04)
North America > United States > Texas (0.04)
(5 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.76)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning > Rote Learning (0.41)

Add feedback

A novel interface for adversarial trivia question-writing

Liu, Jason

arXiv.org Artificial IntelligenceMar-11-2024

A critical component when developing question-answering AIs is an adversarial dataset that challenges models to adapt to the complex syntax and reasoning underlying our natural language. Present techniques for procedurally generating adversarial texts are not robust enough for training on complex tasks such as answering multi-sentence trivia questions. We instead turn to human-generated data by introducing an interface for collecting adversarial human-written trivia questions. Our interface is aimed towards question writers and players of Quiz Bowl, a buzzer-based trivia competition where paragraph-long questions consist of a sequence of clues of decreasing difficulty. To incentivize usage, a suite of machine learning-based tools in our interface assist humans in writing questions that are more challenging to answer for Quiz Bowl players and computers alike. Not only does our interface gather training data for the groundbreaking Quiz Bowl AI project QANTA, but it is also a proof-of-concept of future adversarial data collection for question-answering systems. The results of performance-testing our interface with ten originally-composed questions indicate that, despite some flaws, our interface's novel question-writing features as well as its real-time exposure of useful responses from our machine models could facilitate and enhance the collection of adversarial questions. The code for our interface is available at: https://github.com/Zefan-Cai/QAML

interface, quiz bowl, reasoning, (14 more...)

arXiv.org Artificial Intelligence

2404.00011

Country:

South America > Paraguay (0.04)
North America > United States > Maryland (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(11 more...)

Genre: Research Report (0.82)

Industry:

Education (0.48)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.76)

Add feedback

A Knowledge-Injected Curriculum Pretraining Framework for Question Answering

Lin, Xin, Su, Tianhuang, Huang, Zhenya, Xue, Shangzi, Liu, Haifeng, Chen, Enhong

arXiv.org Artificial IntelligenceMar-10-2024

Knowledge-based question answering (KBQA) is a key task in NLP research, and also an approach to access the web data and knowledge, which requires exploiting knowledge graphs (KGs) for reasoning. In the literature, one promising solution for KBQA is to incorporate the pretrained language model (LM) with KGs by generating KG-centered pretraining corpus, which has shown its superiority. However, these methods often depend on specific techniques and resources to work, which may not always be available and restrict its application. Moreover, existing methods focus more on improving language understanding with KGs, while neglect the more important human-like complex reasoning. To this end, in this paper, we propose a general Knowledge-Injected Curriculum Pretraining framework (KICP) to achieve comprehensive KG learning and exploitation for KBQA tasks, which is composed of knowledge injection (KI), knowledge adaptation (KA) and curriculum reasoning (CR). Specifically, the KI module first injects knowledge into the LM by generating KG-centered pretraining corpus, and generalizes the process into three key steps that could work with different implementations for flexible application. Next, the KA module learns knowledge from the generated corpus with LM equipped with an adapter as well as keeps its original natural language understanding ability to reduce the negative impacts of the difference between the generated and natural corpus. Last, to enable the LM with complex reasoning, the CR module follows human reasoning patterns to construct three corpora with increasing difficulties of reasoning, and further trains the LM from easy to hard in a curriculum manner. We provide an implementation of the general framework, and evaluate the proposed KICP on four real-word datasets. The results demonstrate that our framework can achieve higher performances.

corpus, knowledge, reasoning, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3589334.3645406

2403.09712

Country:

Asia > Singapore > Central Region > Singapore (0.05)
Asia > China > Anhui Province > Hefei (0.04)
Europe > Greece (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.34)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)

Add feedback

KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations

Kweon, Sunjun, Choi, Byungjin, Kim, Minkyu, Park, Rae Woong, Choi, Edward

arXiv.org Artificial IntelligenceMar-5-2024

We introduce KorMedMCQA, the first Korean multiple-choice question answering (MCQA) benchmark derived from Korean healthcare professional licensing examinations, covering from the year 2012 to year 2023. This dataset consists of a selection of questions from the license examinations for doctors, nurses, and pharmacists, featuring a diverse array of subjects. We conduct baseline experiments on various large language models, including proprietary/open-source, multilingual/Korean-additional pretrained, and clinical context pretrained models, highlighting the potential for further enhancements. We make our data publicly available on HuggingFace (https://huggingface.co/datasets/sean0042/KorMedMCQA) and provide a evaluation script via LM-Harness, inviting further exploration and advancement in Korean healthcare environments.

arxiv preprint arxiv, dataset, language model, (12 more...)

arXiv.org Artificial Intelligence

2403.01469

Country:

Europe > Spain (0.04)
Europe > France (0.04)
Asia > India (0.04)
Asia > China (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)

Add feedback

EEE-QA: Exploring Effective and Efficient Question-Answer Representations

Hu, Zhanghao, Yang, Yijun, Xu, Junjie, Qiu, Yifu, Chen, Pinzhen

arXiv.org Artificial IntelligenceMar-4-2024

Current approaches to question answering rely on pre-trained language models (PLMs) like RoBERTa. This work challenges the existing question-answer encoding convention and explores finer representations. We begin with testing various pooling methods compared to using the begin-of-sentence token as a question representation for better quality. Next, we explore opportunities to simultaneously embed all answer candidates with the question. This enables cross-reference between answer choices and improves inference throughput via reduced memory usage. Despite their simplicity and effectiveness, these methods have yet to be widely studied in current frameworks. We experiment with different PLMs, and with and without the integration of knowledge graphs. Results prove that the memory efficacy of the proposed techniques with little sacrifice in performance. Practically, our work enhances 38-100% throughput with 26-65% speedups on consumer-grade GPUs by allowing for considerably larger batch sizes. Our work sends a message to the community with promising directions in both representation quality and efficiency for the question-answering task in natural language processing.

greaselm, knowledge graph, representation, (13 more...)

arXiv.org Artificial Intelligence

2403.02176

Country: Europe (0.14)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.73)

Add feedback

Answerability in Retrieval-Augmented Open-Domain Question Answering

Abdumalikov, Rustam, Minervini, Pasquale, Kementchedjhieva, Yova

arXiv.org Artificial IntelligenceMar-3-2024

The performance of Open-Domain Question Answering (ODQA) retrieval systems can exhibit sub-optimal behavior, providing text excerpts with varying degrees of irrelevance. Unfortunately, many existing ODQA datasets lack examples specifically targeting the identification of irrelevant text excerpts. Previous attempts to address this gap have relied on a simplistic approach of pairing questions with random text excerpts. This paper aims to investigate the effectiveness of models trained using this randomized strategy, uncovering an important limitation in their ability to generalize to irrelevant text excerpts with high semantic overlap. As a result, we observed a substantial decrease in predictive accuracy, from 98% to 1%. To address this limitation, we discovered an efficient approach for training models to recognize such excerpts. By leveraging unanswerable pairs from the SQuAD 2.0 dataset, our models achieve a nearly perfect (~100%) accuracy when confronted with these challenging text excerpts.

excerpt, text excerpt, unanswerable question, (15 more...)

arXiv.org Artificial Intelligence

2403.01461

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Texas (0.04)
(2 more...)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Add feedback

Automatic Question-Answer Generation for Long-Tail Knowledge

Kumar, Rohan, Kim, Youngmin, Ravi, Sunitha, Sun, Haitian, Faloutsos, Christos, Salakhutdinov, Ruslan, Yoon, Minji

arXiv.org Artificial IntelligenceMar-2-2024

Pretrained Large Language Models (LLMs) have gained significant attention for addressing open-domain Question Answering (QA). While they exhibit high accuracy in answering questions related to common knowledge, LLMs encounter difficulties in learning about uncommon long-tail knowledge (tail entities). Since manually constructing QA datasets demands substantial human resources, the types of existing QA datasets are limited, leaving us with a scarcity of datasets to study the performance of LLMs on tail entities. In this paper, we propose an automatic approach to generate specialized QA datasets for tail entities and present the associated research challenges. We conduct extensive experiments by employing pretrained LLMs on our newly generated long-tail QA datasets, comparing their performance with and without external resources including Wikipedia and Wikidata knowledge graphs.

dataset, knowledge, qa dataset, (13 more...)

arXiv.org Artificial Intelligence

2403.01382

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > Los Angeles County > Long Beach (0.05)
North America > United States > Hawaii (0.04)
(3 more...)

Genre: Research Report > New Finding (0.47)

Industry: Government (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering?

Vatsal, Shubham, Singh, Ayush, Tafreshi, Shabnam

arXiv.org Artificial IntelligenceFeb-28-2024

Health insurance companies have a defined process called prior authorization (PA) which is a health plan cost-control process that requires doctors and other healthcare professionals to get clearance in advance from a health plan before performing a particular procedure on a patient in order to be eligible for payment coverage. For health insurance companies, approving PA requests for patients in the medical domain is a time-consuming and challenging task. One of those key challenges is validating if a request matches up to certain criteria such as age, gender, etc. In this work, we evaluate whether GPT can validate numerous key factors, in turn helping health plans reach a decision drastically faster. We frame it as a question answering task, prompting GPT to answer a question from patient electronic health record. We experiment with different conventional prompting techniques as well as introduce our own novel prompting technique. Moreover, we report qualitative assessment by humans on the natural language generation outputs from our approach. Results show that our method achieves superior performance with the mean weighted F1 score of 0.61 as compared to its standard counterparts.

health record note, implicit rag, multi-choice question, (14 more...)

arXiv.org Artificial Intelligence

2402.18419

Country:

Europe > Spain > Valencian Community > Valencia Province > Valencia (0.04)
Europe > Bulgaria > Varna Province > Varna (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Health & Medicine > Consumer Health (1.00)
Banking & Finance > Insurance (0.95)
Health & Medicine > Health Care Technology > Medical Record (0.75)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.72)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
(2 more...)

Add feedback

Unsupervised multiple choices question answering via universal corpus

Zhang, Qin, Ge, Hao, Chen, Xiaojun, Fang, Meng

arXiv.org Artificial IntelligenceFeb-27-2024

Fabbri et al [8] and Li et al [11] further extended this idea with template-based Unsupervised question answering is a promising yet challenging question generation and iterative data refinement, but are still task, which alleviates the burden of building large-scale only applicable to EQA tasks. There are also some trials annotated data in a new domain. It motivates us to study the for MCQA without supervision. Liu and Lee [12] assumed unsupervised multiple-choice question answering (MCQA) the absence of correct answer labels, but directly train a QA problem. In this paper, we propose a novel framework designed model based on the context, question, and answer candidate to generate synthetic MCQA data barely based on sets. Ren and Zhu [13] emphasized the distractor generation, contexts from the universal domain without relying on any trying to construct a complete sample using the given context, form of manual annotation. Possible answers are extracted question, as well as the correct answer. Nevertheless, they and used to produce related questions, then we leverage still depend on a certain amount of data in the target domain, both named entities (NE) and knowledge graphs to discover like the contexts and questions, which further limits their plausible distractors to form complete synthetic samples.

answer candidate, dataset, distractor, (16 more...)

arXiv.org Artificial Intelligence

2402.17333

Country:

Asia > China > Guangdong Province > Shenzhen (0.05)
Europe > United Kingdom > England > Merseyside > Liverpool (0.04)

Genre:

Research Report (0.82)
Questionnaire & Opinion Survey (0.62)

Industry: Education (0.62)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback