AITopics | qa-pair

Collaborating Authors

qa-pair

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Objective quantification of mood states using large language models

Onysk, Jakub, Huys, Quentin

arXiv.org Artificial IntelligenceFeb-13-2025

Emotional states influence human behaviour and cognition, leading to diverse thought trajectories. Similarly, Large Language Models (LLMs) showcase an excellent level of response consistency across wide-ranging contexts (prompts). We leverage these parallels to establish a framework for quantifying mental states. Our approach utilises self-report questionnaires that reliably assess these states due to their inherent sensitivity to patterns of co-occurring responses. Specifically, we recruited a large sample of participants (N=422) to investigate how well an LLM (Mistral-7B-OpenOrca) quantifies a heterogenous set of depressive mood states measured with participants' open-ended responses to a depression questionnaire. We show LLM responses to held-out multiple-choice questions, given participants' open-ended answers, correlate strongly (r: 0.52-0.84) with true questionnaire scores, demonstrating LLM's generalisation from mood representations. We explore a link between these representations and factor analysis. Using ridge regression, we find depression-related subspaces within LLM hidden states. We show these subspaces to be predictive of participants' "Depression" and "Somatic & Emotional Distress" factor scores, as well as suicidality severity. Overall, LLMs can provide quantitative measures of mental states. The reliability of these hinges upon how informative the questions we ask participants are. Used correctly, this approach could supplement mental state assessment in a variety of settings.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2502.09487

Country:

North America > United States > New York (0.04)
Europe > France (0.04)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Consumer Health (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.66)

Add feedback

FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives

Chen, Jiaju, Lu, Yuxuan, Zhang, Shao, Yao, Bingsheng, Dong, Yuanzhe, Xu, Ying, Li, Yunyao, Wang, Qianwen, Wang, Dakuo, Sun, Yuling

arXiv.org Artificial IntelligenceNov-16-2023

AI models (including LLM) often rely on narrative question-answering (QA) datasets to provide customized QA functionalities to support downstream children education applications; however, existing datasets only include QA pairs that are grounded within the given storybook content, but children can learn more when teachers refer the storybook content to real-world knowledge (e.g., commonsense knowledge). We introduce the FairytaleCQA dataset, which is annotated by children education experts, to supplement 278 storybook narratives with educationally appropriate commonsense knowledge. The dataset has 5,868 QA pairs that not only originate from the storybook narrative but also contain the commonsense knowledge grounded by an external knowledge graph (i.e., ConceptNet). A follow-up experiment shows that a smaller model (T5-large) fine-tuned with FairytaleCQA reliably outperforms much larger prompt-engineered LLM (e.g., GPT-4) in this new QA-pair generation task (QAG). This result suggests that: 1) our dataset brings novel challenges to existing LLMs, and 2) human experts' data annotation are still critical as they have much nuanced knowledge that LLMs do not know in the children educational domain.

knowledge, pipeline, triplet, (16 more...)

arXiv.org Artificial Intelligence

2311.09756

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(11 more...)

Genre:

Workflow (0.95)
Research Report > New Finding (0.48)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

Add feedback

When to Read Documents or QA History: On Unified and Selective Open-domain QA

Lee, Kyungjae, Han, Sang-eun, Hwang, Seung-won, Lee, Moontae

arXiv.org Artificial IntelligenceJun-7-2023

Figure 1 illustrates the distinction of Open-domain question answering is a well-known our approach providing both knowledge to a unified task in natural language processing, aiming to answer reader as context. We retrieve a list of relevant factoid questions from an open set of domains. QA-pairs (called as QA-history), then treat the One commonly used approach for this task is the few retrieved QA examples, as if it is a relevant retrieve-then-read pipeline (also known as Openbook document passage. QA) to retrieve relevant knowledge, then reason Meanwhile, the closest approach to use multiple answers over the knowledge. Given the wide knowledge sources is concatenating the multisources range of topics that open-domain questions can uniformly into a single decoder (Oguz cover, a key to a successful answering model is: et al., 2020), but we argue knowledge selection is to access and utilize diverse knowledge sources critically missing. To motivate, Figure 1 shows the effectively. QA-history, from which answer'Eric Liddell' is Toward this goal, existing work can be categorized explicitly identified, while it is more implicit in the by the knowledge source used: document such that another name such as'Hugh Hudson' is known to often confuse QA models. It Document Corpus-based QA (Doc-QA): This is critical for the QA model to calibrate prediction type of work utilizes a general-domain Document quality as an indicator to decide when to use a Corpus (e.g., Wikipedia) (Karpukhin

calibration, natural language, question answering, (17 more...)

arXiv.org Artificial Intelligence

2306.04176

Country:

Asia > South Korea > Seoul > Seoul (0.04)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment (0.68)
Media > Film (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.61)

Add feedback

Quinductor: a multilingual data-driven method for generating reading-comprehension questions using Universal Dependencies

Kalpakchi, Dmytro, Boye, Johan

arXiv.org Artificial IntelligenceMay-12-2023

We propose a multilingual data-driven method for generating reading comprehension questions using dependency trees. Our method provides a strong, mostly deterministic, and inexpensive-totrain baseline for less-resourced languages. While a language-specific corpus is still required, its size is nowhere near those required by modern neural question generation (QG) architectures. Our method surpasses QG baselines previously reported in the literature and shows a good performance in terms of human evaluation. 1 Introduction We are interested in question generation (QG) - the task of automatically generating reading comprehension questions and their correct answers from given declarative sentences. Numerous methods have been proposed for solving this task, most of which have been aimed at the English language. Recent methods are based on neural networks and rely on the availability of large-scale datasets, such as SQuAD (Rajpurkar et al. 2016) - a question-answering dataset repurposed for QG - or large-scale pretrained models, such as GPT-3 (Brown et al. 2020). Early methods, mostly based on context-free grammars, relied on the strict word order and the limited inflectional morphology of English. These traits made it relatively straightforward to craft handwritten templates based on these grammars. The above mentioned idiosyncracies and the unique availability of large-scale resources for English leave a number of open challenges for developing QG methods applicable to languages other than English. The first challenge is the lack of large-scale training datasets, and a prohibitively high cost of obtaining such resources. State-of-the-art QG methods for English train their models on the previously mentioned SQuAD dataset, which contains more than 100,000 questions. Obtaining a good-quality dataset of a similar size is very expensive, especially for languages with fewer native speakers around the world. The second challenge is knowing how well available methods developed for English would generalize to other languages, especially synthetic ones with richer inflectional morphology and less strict word order (e.g., Finnish, Turkish or Russian). To the best of our knowledge, not much research has been done on QG for these kinds of languages. The third challenge is assessing the obtained performance results.

machine learning, natural language, question answering, (23 more...)

arXiv.org Artificial Intelligence

2103.10121

Country:

South America > Brazil (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
(2 more...)

Genre:

Workflow (0.93)
Research Report (0.63)

Industry: Education > Assessment & Standards > Student Performance (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.91)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Chen, Wenhu, Verga, Pat, de Jong, Michiel, Wieting, John, Cohen, William

arXiv.org Artificial IntelligenceJan-23-2023

Retrieval augmented language models have recently become the standard for knowledge intensive tasks. Rather than relying purely on latent semantics within the parameters of large neural models, these methods enlist a semi-parametric memory to encode an index of knowledge for the model to retrieve over. Most prior work has employed text passages as the unit of knowledge, which has high coverage at the cost of interpretability, controllability, and efficiency. The opposite properties arise in other methods which have instead relied on knowledge base (KB) facts. At the same time, more recent work has demonstrated the effectiveness of storing and retrieving from an index of Q-A pairs derived from text \citep{lewis2021paq}. This approach yields a high coverage knowledge representation that maintains KB-like properties due to its representations being more atomic units of information. In this work we push this line of research further by proposing a question-answer augmented encoder-decoder model and accompanying pretraining strategy. This yields an end-to-end system that not only outperforms prior QA retrieval methods on single-hop QA tasks but also enables compositional reasoning, as demonstrated by strong performance on two multi-hop QA datasets. Together, these methods improve the ability to interpret and control the model while narrowing the performance gap with passage retrieval systems.

information retrieval, machine learning, question answering, (19 more...)

arXiv.org Artificial Intelligence

2204.04581

Country:

North America > United States > California (0.14)
Asia > Middle East > Jordan (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report > New Finding (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Add feedback

Automatically generating question-answer pairs for assessing basic reading comprehension in Swedish

Kalpakchi, Dmytro, Boye, Johan

arXiv.org Artificial IntelligenceNov-28-2022

This paper presents an evaluation of the quality of automatically generated reading comprehension questions from Swedish text, using the Quinductor method. This method is a light-weight, data-driven but non-neural method for automatic question generation (QG). The evaluation shows that Quinductor is a viable QG method that can provide a strong baseline for neural-network-based QG methods.

natural language, question answering, template, (15 more...)

arXiv.org Artificial Intelligence

2211.15568

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > United Kingdom > Scotland > City of Aberdeen > Aberdeen (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Education > Assessment & Standards > Student Performance (0.91)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.38)

Add feedback

It is AI's Turn to Ask Human a Question: Question and Answer Pair Generation for Children Storybooks in FairytaleQA Dataset

Yao, Bingsheng, Wang, Dakuo, Wu, Tongshuang, Hoang, Tran, Sun, Branda, Li, Toby Jia-Jun, Yu, Mo, Xu, Ying

arXiv.org Artificial IntelligenceSep-8-2021

Existing question answering (QA) datasets are created mainly for the application of having AI to be able to answer questions asked by humans. But in educational applications, teachers and parents sometimes may not know what questions they should ask a child that can maximize their language learning results. With a newly released book QA dataset (FairytaleQA), which educational experts labeled on 46 fairytale storybooks for early childhood readers, we developed an automated QA generation model architecture for this novel application. Our model (1) extracts candidate answers from a given storybook passage through carefully designed heuristics based on a pedagogical framework; (2) generates appropriate questions corresponding to each extracted answer using a language model; and, (3) uses another QA model to rank top QA-pairs. Automatic and human evaluations show that our model outperforms baselines. We also demonstrate that our method can help with the scarcity issue of the children's book QA dataset via data augmentation on 200 unlabeled storybooks.

arxiv preprint arxiv, dataset, qa pair, (12 more...)

arXiv.org Artificial Intelligence

2109.03423

Country:

North America > United States > California > Orange County > Irvine (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Genre: Research Report (0.64)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

Add feedback

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Lewis, Patrick, Wu, Yuxiang, Liu, Linqing, Minervini, Pasquale, Küttler, Heinrich, Piktus, Aleksandra, Stenetorp, Pontus, Riedel, Sebastian

arXiv.org Artificial IntelligenceFeb-13-2021

Open-domain Question Answering models which directly leverage question-answer (QA) pairs, such as closed-book QA (CBQA) models and QA-pair retrievers, show promise in terms of speed and memory compared to conventional models which retrieve and read from text corpora. QA-pair retrievers also offer interpretable answers, a high degree of control, and are trivial to update at test time with new knowledge. However, these models lack the accuracy of retrieve-and-read systems, as substantially less knowledge is covered by the available QA-pairs relative to text corpora like Wikipedia. To facilitate improved QA-pair models, we introduce Probably Asked Questions (PAQ), a very large resource of 65M automatically-generated QA-pairs. We introduce a new QA-pair retriever, RePAQ, to complement PAQ. We find that PAQ preempts and caches test questions, enabling RePAQ to match the accuracy of recent retrieve-and-read models, whilst being significantly faster. Using PAQ, we train CBQA models which outperform comparable baselines by 5%, but trail RePAQ by over 15%, indicating the effectiveness of explicit retrieval. RePAQ can be configured for size (under 500MB) or speed (over 1K questions per second) whilst retaining high accuracy. Lastly, we demonstrate RePAQ's strength at selective QA, abstaining from answering when it is likely to be incorrect. This enables RePAQ to ``back-off" to a more expensive state-of-the-art model, leading to a combined system which is both more accurate and 2x faster than the state-of-the-art model alone.

accuracy, qa-pair, repaq, (14 more...)

arXiv.org Artificial Intelligence

2102.07033

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Italy > Tuscany > Florence (0.04)
(13 more...)

Genre: Research Report (0.90)

Industry: Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback