Goto

Collaborating Authors

Question Answering


The New IBM Watson Assistant Is Available

#artificialintelligence

Currently all actions created in the bot are included in the deployment version. I would like to be able, to select specific Actions, and only deploy selected Actions and not all actions in the bot. An orchestration layer managing or combining different bots might also be helpful. Within a bot, there will be various actions. You will get to a situation where you do not want to duplicate actions across bots, and use multiple bots simultaneously in one implementation.


IBM Watson and the future of Artificial Intelligence by Procommun

#artificialintelligence

Buy Now Pay Later (BNPL) services have increased in popularity in recent years and are ready to become a popular mode of financing. Experts claim that demand for BNPL has been accelerating in India for the past three to four years. Further, COVID-19 has boosted its demand. BNPL has now established itself as a more comfortable payment option, reducing borrowers' financial stress by providing no-cost EMIs. Uni Cards, which recently secured $18.5 million in financing, has launched its Uni Pay 1/3rd card. The product aims to enhance the customer experience in the credit card business.


Top 5 AI Based Question Generation Platforms in EdTech Industry

#artificialintelligence

Artificial intelligence and machine learning are fast becoming a part of the educational sector. In the global educational market, AI is estimated to reach US$3.68 billion by 2023 at a CAGR of 47%. Teachers, students, and the administration teams will benefit from using AI in their institutions. Artificial intelligence will soon become a deciding factor in EdTech. Educational institutions will need to prove themselves by adopting the latest AI technology and providing the staff, teachers, and students with the best possible environment to work and learn.


FQuAD2.0: French Question Answering and knowing that you know nothing

arXiv.org Artificial Intelligence

Question Answering, including Reading Comprehension, is one of the NLP research areas that has seen significant scientific breakthroughs over the past few years, thanks to the concomitant advances in Language Modeling. Most of these breakthroughs, however, are centered on the English language. In 2020, as a first strong initiative to bridge the gap to the French language, Illuin Technology introduced FQuAD1.1, a French Native Reading Comprehension dataset composed of 60,000+ questions and answers samples extracted from Wikipedia articles. Nonetheless, Question Answering models trained on this dataset have a major drawback: they are not able to predict when a given question has no answer in the paragraph of interest, therefore making unreliable predictions in various industrial use-cases. In the present work, we introduce FQuAD2.0, which extends FQuAD with 17,000+ unanswerable questions, annotated adversarially, in order to be similar to answerable ones. This new dataset, comprising a total of almost 80,000 questions, makes it possible to train French Question Answering models with the ability of distinguishing unanswerable questions from answerable ones. We benchmark several models with this dataset: our best model, a fine-tuned CamemBERT-large, achieves a F1 score of 82.3% on this classification task, and a F1 score of 83% on the Reading Comprehension task.


More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering

arXiv.org Artificial Intelligence

Textual Question Answering (QA) aims to provide precise answers to user's questions in natural language using unstructured data. One of the most popular approaches to this goal is machine reading comprehension(MRC). In recent years, many novel datasets and evaluation metrics based on classical MRC tasks have been proposed for broader textual QA tasks. In this paper, we survey 47 recent textual QA benchmark datasets and propose a new taxonomy from an application point of view. In addition, We summarize 8 evaluation metrics of textual QA tasks. Finally, we discuss current trends in constructing textual QA benchmarks and suggest directions for future work.


Can Question Generation Debias Question Answering Models? A Case Study on Question-Context Lexical Overlap

arXiv.org Artificial Intelligence

Question answering (QA) models for reading comprehension have been demonstrated to exploit unintended dataset biases such as question-context lexical overlap. This hinders QA models from generalizing to under-represented samples such as questions with low lexical overlap. Question generation (QG), a method for augmenting QA datasets, can be a solution for such performance degradation if QG can properly debias QA datasets. However, we discover that recent neural QG models are biased towards generating questions with high lexical overlap, which can amplify the dataset bias. Moreover, our analysis reveals that data augmentation with these QG models frequently impairs the performance on questions with low lexical overlap, while improving that on questions with high lexical overlap. To address this problem, we use a synonym replacement-based approach to augment questions with low lexical overlap. We demonstrate that the proposed data augmentation approach is simple yet effective to mitigate the degradation problem with only 70k synthetic examples. Our data is publicly available at https://github.com/KazutoshiShinoda/Synonym-Replacement.


K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

arXiv.org Artificial Intelligence

Knowledge enhanced pre-trained language models (K-PLMs) are shown to be effective for many public tasks in the literature but few of them have been successfully applied in practice. To address this problem, we propose K-AID, a systematic approach that includes a low-cost knowledge acquisition process for acquiring domain knowledge, an effective knowledge infusion module for improving model performance, and a knowledge distillation component for reducing the model size and deploying K-PLMs on resource-restricted devices (e.g., CPU) for real-world application. Importantly, instead of capturing entity knowledge like the majority of existing K-PLMs, our approach captures relational knowledge, which contributes to better-improving sentence-level text classification and text matching tasks that play a key role in question answering (QA). We conducted a set of experiments on five text classification tasks and three text matching tasks from three domains, namely E-commerce, Government, and Film&TV, and performed online A/B tests in E-commerce. Experimental results show that our approach is able to achieve substantial improvement on sentence-level question answering tasks and bring beneficial business value in industrial settings.


WebQA: Multihop and Multimodal QA

arXiv.org Artificial Intelligence

Web search is fundamentally multimodal and multihop. Often, even before asking a question we choose to go directly to image search to find our answers. Further, rarely do we find an answer from a single source but aggregate information and reason through implications. Despite the frequency of this everyday occurrence, at present, there is no unified question answering benchmark that requires a single model to answer long-form natural language questions from text and open-ended visual sources -- akin to a human's experience. We propose to bridge this gap between the natural language and computer vision communities with WebQA. We show that A. our multihop text queries are difficult for a large-scale transformer model, and B. existing multi-modal transformers and visual representations do not perform well on open-domain visual queries. Our challenge for the community is to create a unified multimodal reasoning model that seamlessly transitions and reasons regardless of the source modality.


What Would it Take to get Biomedical QA Systems into Practice?

arXiv.org Artificial Intelligence

Medical question answering (QA) systems have the potential to answer clinicians uncertainties about treatment and diagnosis on demand, informed by the latest evidence. However, despite the significant progress in general QA made by the NLP community, medical QA systems are still not widely used in clinical environments. One likely reason for this is that clinicians may not readily trust QA system outputs, in part because transparency, trustworthiness, and provenance have not been key considerations in the design of such models. In this paper we discuss a set of criteria that, if met, we argue would likely increase the utility of biomedical QA systems, which may in turn lead to adoption of such systems in practice. We assess existing models, tasks, and datasets with respect to these criteria, highlighting shortcomings of previously proposed approaches and pointing toward what might be more usable QA systems.


World's most accurate visual question–answering AI

#artificialintelligence

Toshiba Corporation has developed the world's most accurate highly versatile Visual Question Answering (VQA) AI, able to recognize not only people and objects, but also colors, shapes, appearances and background details in images. The AI overcomes the long-standing difficulty of answering questions on the positioning and appearance of people and objects, and has the ability to learn information required to handle a wide range of questions and answers. It can be applied to a wide range of purposes without any need for customization. In experiments using a public dataset comprising a large volume of images and data text, the VQA AI correctly answered 66.25% of questions without any pre-learning and 74.57% with pre-learning. For example, the AI can find a worker standing in a designated place by asking questions like, "is the person on a black mat?" which requires recognition of the individual, position, shape and color.