ContraQA: Question Answering under Contradicting Contexts
Pan, Liangming, Chen, Wenhu, Kan, Min-Yen, Wang, William Yang
–arXiv.org Artificial Intelligence
With a rise in false, inaccurate, and misleading information in propaganda, news, and social media, real-world Question Answering (QA) systems face the challenges of synthesizing and reasoning over contradicting information to derive correct answers. This urgency gives rise to the need to make QA systems robust to misinformation, a topic previously unexplored. We study the risk of misinformation to QA models by investigating the behavior of the QA model under contradicting contexts that are mixed with both real and fake information. QA, which contains over 10K human-written and model-generated contradicting pairs of contexts. Experiments show that QA models are vulnerable under contradicting contexts brought by misinformation. To defend against such threat, we build a misinformation-aware QA system as a counter-measure that integrates question answering and misinformation detection in a joint fashion. A typical Question Answering (QA) system (Chen et al., 2017; Yang et al., 2019; Karpukhin et al., 2020; Lewis et al., 2020b) starts by retrieving a set of relevant context documents from the Web, which are then examined by a machine reader to identify the correct answer. Existing work equate Wikipedia as the web corpus. Therefore, all retrieved context documents are assumed to be clean and trustable. However, real-world QA faces a much noisier environment, where the web corpus is tainted with misinformation.
arXiv.org Artificial Intelligence
Oct-14-2021
- Country:
- Asia > Singapore
- Central Region > Singapore (0.04)
- Europe
- Ireland (0.04)
- United Kingdom (0.04)
- North America
- Canada (0.04)
- United States
- California
- San Francisco County > San Francisco (0.06)
- Santa Barbara County > Santa Barbara (0.14)
- Santa Clara County > Santa Clara (0.14)
- Colorado > Denver County
- Denver (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York (0.04)
- California
- Oceania > Australia (0.04)
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay (0.05)
- Asia > Singapore
- Genre:
- Research Report (0.64)
- Industry:
- Leisure & Entertainment > Sports
- Football (1.00)
- Media > News (1.00)
- Leisure & Entertainment > Sports
- Technology: