Goto

Collaborating Authors

 false assumption


Identifying and Answering Questions with False Assumptions: An Interpretable Approach

Wang, Zijie, Blanco, Eduardo

arXiv.org Artificial Intelligence

People often ask questions with false assumptions, a type of question that does not have regular answers. Answering such questions requires first identifying the false assumptions. Large Language Models (LLMs) often generate misleading answers to these questions because of hallucinations. In this paper, we focus on identifying and answering questions with false assumptions in several domains. We first investigate whether the problem reduces to fact verification. Then, we present an approach leveraging external evidence to mitigate hallucinations. Experiments with five LLMs demonstrate that (1) incorporating retrieved evidence is beneficial and (2) generating and validating atomic assumptions yields more improvements and provides an interpretable answer by pinpointing the false assumptions.


Syn-QA2: Evaluating False Assumptions in Long-tail Questions with Synthetic QA Datasets

Daswani, Ashwin, Sawant, Rohan, Kim, Najoung

arXiv.org Artificial Intelligence

Sensitivity to false assumptions (or false premises) in information-seeking questions is critical for robust question-answering (QA) systems. Recent work has shown that false assumptions in naturally occurring questions pose challenges to current models, with low performance on both generative QA and simple detection tasks (Kim et al. 2023). However, the focus of existing work on naturally occurring questions leads to a gap in the analysis of model behavior on the long tail of the distribution of possible questions. To this end, we introduce Syn-(QA)$^2$, a set of two synthetically generated QA datasets: one generated using perturbed relations from Wikidata, and the other by perturbing HotpotQA (Yang et al. 2018). Our findings from evaluating a range of large language models are threefold: (1) false assumptions in QA are challenging, echoing the findings of prior work, (2) the binary detection task is challenging even compared to the difficulty of generative QA itself, possibly due to the linguistic structure of the problem, and (3) the detection task is more challenging with long-tail questions compared to naturally occurring questions, highlighting the utility of our synthetic datasets and generation method.


Open-Domain Conversational Question Answering with Historical Answers

Fang, Hung-Chieh, Hung, Kuo-Han, Huang, Chao-Wei, Chen, Yun-Nung

arXiv.org Artificial Intelligence

Open-domain conversational question answering can be viewed as two tasks: passage retrieval and conversational question answering, where the former relies on selecting candidate passages from a large corpus and the latter requires better understanding of a question with contexts to predict the answers. This paper proposes ConvADR-QA that leverages historical answers to boost retrieval performance and further achieves better answering performance. In our proposed framework, the retrievers use a teacher-student framework to reduce noises from previous turns. Our experiments on the benchmark dataset, OR-QuAC, demonstrate that our model outperforms existing baselines in both extractive and generative reader settings, well justifying the effectiveness of historical answers for open-domain conversational question answering.


Valve denounces third-party gambling sites over Steam use

The Independent - Tech

Nasa has announced that it has found evidence of flowing water on Mars. Scientists have long speculated that Recurring Slope Lineae -- or dark patches -- on Mars were made up of briny water but the new findings prove that those patches are caused by liquid water, which it has established by finding hydrated salts. Several hundred camped outside the London store in Covent Garden. The 6s will have new features like a vastly improved camera and a pressure-sensitive "3D Touch" display


The Age of Intelligence « Kevin Alfred Strom

#artificialintelligence

TECH ENTREPRENEUR Elon Musk has been warning that the Age of the Robots is coming soon -- and it might not be pleasant for us. He may be right and he may be wrong on that, but one thing is sure: One robot certainly gave the anti-Whites a headache just this week. On Wednesday, tech giant Microsoft, the third largest corporation on Earth in terms of market value, launched and then immediately withdrew an Artificial Intelligence robot in the persona of a 19-year-old American girl called "Tay." Tay was a "chatbot," which interacted with real humans on the social media platform Twitter and was designed to learn from its interactions. Tay learned so fast that Microsoft pulled her offline in less than a single day.