AITopics

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.62)

Neural Information Processing SystemsOct-7-2024, 07:51:46 GMT

Reviews: Chain of Reasoning for Visual Question Answering

Paper Summary: This paper presented a novel approach that performs chain of reasonings on the object level to generate answer for visual question answering. Object-level visual embeddings are first extracted through object detection networks as visual representation and sentence embedding of the question are extract question representation. Based on these, a sequential model that performs multi-steps of relational inference over (compound) object embeddings with the guidance of question is used to obtain the final representation for each sub-chain inference. A concatenation of these embeddings are then used to perform answer classification. Extensive experiments have been conducted on four public datasets and it achieves state-of-the-art performance on all of them.

clevr, reasoning, supplementary material, (7 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.63)

Neural Information Processing SystemsOct-7-2024, 05:02:37 GMT

Reviews: Learning to Specialize with Knowledge Distillation for Visual Question Answering

For example, one model might be specialized for'what color is the umbrella?' and another for'how many people are wearing glasses?' while at test time they question may be'what color are the glasses?'. Specifically, they train independently ensembled base VQA models on the entire dataset, and then while training using MCL, subset of models are trained using oracle assignments (as in usual MCL) while the rest are trained to imitate the base models' activations. Strengths -- The paper is very nicely written. It starts with a clear description of the problem, the observations made by the authors, and then the proposed solution -- positioning it appropriately with respect to prior work -- and then experiments. Given the small dataset, MCL and CMCL perform worse than independent ensembling, while MCL-KD performs better.

dataset, knowledge distillation, specialize, (9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.78)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.43)

arXiv.org Artificial IntelligenceOct-7-2024

Document-level Causal Relation Extraction with Knowledge-guided Binary Question Answering

Wang, Zimu, Xia, Lei, Wang, Wei, Du, Xinya

As an essential task in information extraction (IE), Event-Event Causal Relation Extraction (ECRE) aims to identify and classify the causal relationships between event mentions in natural language texts. However, existing research on ECRE has highlighted two critical challenges, including the lack of document-level modeling and causal hallucinations. In this paper, we propose a Knowledge-guided binary Question Answering (KnowQA) method with event structures for ECRE, consisting of two stages: Event Structure Construction and Binary Question Answering. We conduct extensive experiments under both zero-shot and fine-tuning settings with large language models (LLMs) on the MECI and MAVEN-ERE datasets. Experimental results demonstrate the usefulness of event structures on document-level ECRE and the effectiveness of KnowQA by achieving state-of-the-art on the MECI dataset. We observe not only the effectiveness but also the high generalizability and low inconsistency of our method, particularly when with complete event structures after fine-tuning the models.

causal relationship, event structure, proceedings, (15 more...)

2410.04752

Country:

Europe > Ukraine > Kyiv Oblast > Kyiv (0.04)
North America > United States > Texas (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Transportation > Infrastructure & Services (0.46)
Transportation > Air (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

arXiv.org Artificial IntelligenceOct-7-2024

A Russian Jeopardy! Data Set for Question-Answering Systems

Mikhalkova, Elena

Question answering (QA) is one of the most common NLP tasks that relates to named entity recognition, fact extraction, semantic search and some other fields. In industry, it is much appreciated in chatbots and corporate information systems. It is also a challenging task that attracted the attention of a very general audience at the quiz show Jeopardy! In this article we describe a Jeopardy!-like Russian QA data set collected from the official Russian quiz database Chgk (che ge ka). The data set includes 379,284 quiz-like questions with 29,375 from the Russian analogue of Jeopardy! - "Own Game". We observe its linguistic features and the related QA-task. We conclude about perspectives of a QA competition based on the data set collected from this database.

jeopardy, russian jeopardy, tournament, (17 more...)

2112.02325

Country:

Europe > Russia (0.14)
Asia > Russia > Ural Federal District > Tyumen Oblast > Tyumen (0.05)
Europe > United Kingdom > England (0.04)
(8 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment > Games > Jeopardy! (0.84)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)

arXiv.org Artificial IntelligenceOct-5-2024

Overview of Factify5WQA: Fact Verification through 5W Question-Answering

Suresh, Suryavardan, Rani, Anku, Patwa, Parth, Reganti, Aishwarya, Jain, Vinija, Chadha, Aman, Das, Amitava, Sheth, Amit, Ekbal, Asif

Researchers have found that fake news spreads much times faster than real news [1]. This is a major problem, especially in today's world where social media is the key source of news for many among the younger population. Fact verification, thus, becomes an important task and many media sites contribute to the cause. Manual fact verification is a tedious task, given the volume of fake news online. The Factify5WQA shared task aims to increase research towards automated fake news detection by providing a dataset with an aspect-based question answering based fact verification method. Each claim and its supporting document is associated with 5W questions that help compare the two information sources. The objective performance measure in the task is done by comparing answers using BLEU score to measure the accuracy of the answers, followed by an accuracy measure of the classification. The task had submissions using custom training setup and pre-trained language-models among others. The best performing team posted an accuracy of 69.56%, which is a near 35% improvement over the baseline.

machine learning, natural language, question answering, (15 more...)

2410.04236

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Washington > King County > Seattle (0.04)
North America > United States > South Carolina (0.04)
(5 more...)

Genre: Research Report (0.65)

Industry: Media > News (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.61)

Roy, Subal Chandra, Manik, Md Motaleb Hossen

Question-Answering System for Bangla: Fine-tuning BERT-Bangla for a Closed Domain

arXiv.org Artificial IntelligenceOct-4-2024

Question-answering systems for Bengali have seen limited development, particularly in domain-specific applications. Leveraging advancements in natural language processing, this paper explores a fine-tuned BERT-Bangla model to address this gap. It presents the development of a question-answering system for Bengali using a fine-tuned BERT-Bangla model in a closed domain. The dataset was sourced from Khulna University of Engineering \& Technology's (KUET) website and other relevant texts. The system was trained and evaluated with 2500 question-answer pairs generated from curated data. Key metrics, including the Exact Match (EM) score and F1 score, were used for evaluation, achieving scores of 55.26\% and 74.21\%, respectively. The results demonstrate promising potential for domain-specific Bengali question-answering systems. Further refinements are needed to improve performance for more complex queries.

machine learning, natural language, question answering, (19 more...)

2410.03923

Country: Asia > Bangladesh (0.05)

Genre: Research Report > New Finding (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Hwang, Seonjeong, Kim, Yunsu, Lee, Gary Geunbae

Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages

arXiv.org Artificial IntelligenceOct-4-2024

Automatic question generation (QG) serves a wide range of purposes, such as augmenting question-answering (QA) corpora, enhancing chatbot systems, and developing educational materials. Despite its importance, most existing datasets predominantly focus on English, resulting in a considerable gap in data availability for other languages. Cross-lingual transfer for QG (XLT-QG) addresses this limitation by allowing models trained on high-resource language datasets to generate questions in low-resource languages. In this paper, we propose a simple and efficient XLT-QG method that operates without the need for monolingual, parallel, or labeled data in the target language, utilizing a small language model. Our model, trained solely on English QA datasets, learns interrogative structures from a limited set of question exemplars, which are then applied to generate questions in the target language. Experimental results show that our method outperforms several XLT-QG baselines and achieves performance comparable to GPT-3.5-turbo across different languages. Additionally, the synthetic data generated by our model proves beneficial for training multilingual QA models. With significantly fewer parameters than large language models and without requiring additional training for target languages, our approach offers an effective solution for QG and QA tasks across various languages.

exemplar, question exemplar, target language, (15 more...)

2410.03197

Country:

Africa > Zimbabwe (0.15)
Africa > Malawi (0.05)
Africa > South Africa (0.05)
(6 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.91)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.72)

arXiv.org Artificial IntelligenceOct-4-2024

Educational Question Generation of Children Storybooks via Question Type Distribution Learning and Event-Centric Summarization

Zhao, Zhenjie, Hou, Yufang, Wang, Dakuo, Yu, Mo, Liu, Chengzhong, Ma, Xiaojuan

Generating educational questions of fairytales or storybooks is vital for improving children's literacy ability. However, it is challenging to generate questions that capture the interesting aspects of a fairytale story with educational meaningfulness. In this paper, we propose a novel question generation method that first learns the question type distribution of an input story paragraph, and then summarizes salient events which can be used to generate high-cognitive-demand questions. To train the event-centric summarizer, we finetune a pre-trained transformer-based sequence-to-sequence model using silver samples composed by educational question-answer pairs. On a newly proposed educational question answering dataset FairytaleQA, we show good performance of our method on both automatic and human evaluation metrics. Our work indicates the necessity of decomposing question type distribution learning and event-centric summary generation for educational question generation.

causal relationship, computational linguistic, paragraph, (12 more...)

doi: 10.18653/v1/2022.acl-long.348

2203.14187

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(9 more...)

Genre: Research Report (1.00)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Neural Information Processing SystemsOct-2-2024, 15:56:04 GMT

High-Order Attention Models for Visual Question Answering

The quest for algorithms that enable cognitive abilities is an important part of machine learning. A common trait in many recently investigated cognitive-like tasks is that they take into account different data modalities, such as visual and textual input. In this paper we propose a novel and generally applicable form of attention mechanism that learns high-order correlations between various data modalities. We show that high-order correlations effectively direct the appropriate attention to the relevant elements in the different data modalities that are required to solve the joint task. We demonstrate the effectiveness of our high-order attention mechanism on the task of visual question answering (VQA), where we achieve state-of-the-art performance on the standard VQA dataset.

machine learning, natural language, question answering, (18 more...)

Country:

North America > United States > Illinois (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Spain (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)