AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web
Schlichtkrull, Michael, Guo, Zhijiang, Vlachos, Andreas
–arXiv.org Artificial Intelligence
Existing datasets for automated fact-checking have substantial limitations, such as relying on artificial claims, lacking annotations for evidence and intermediate reasoning, or including evidence published after the claim. In this paper we introduce AVeriTeC, a new dataset of 4,568 real-world claims covering fact-checks by 50 different organizations. Each claim is annotated with question-answer pairs supported by evidence available online, as well as textual justifications explaining how the evidence combines to produce a verdict. Through a multi-round annotation process, we avoid common pitfalls including context dependence, evidence insufficiency, and temporal leakage, and reach a substantial inter-annotator agreement of $\kappa=0.619$ on verdicts. We develop a baseline as well as an evaluation scheme for verifying claims through several question-answering steps against the open web.
arXiv.org Artificial Intelligence
Nov-8-2023
- Country:
- Africa (1.00)
- Asia > Middle East (0.67)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.14)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report > Experimental Study (0.67)
- Industry:
- Energy > Oil & Gas (0.67)
- Government
- Health & Medicine > Epidemiology (0.68)
- Law > Criminal Law (0.66)
- Media > News (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning (1.00)
- Natural Language
- Information Retrieval (0.46)
- Large Language Model (0.46)
- Question Answering (0.67)
- Representation & Reasoning (1.00)
- Communications > Social Media (1.00)
- Information Management > Search (1.00)
- Artificial Intelligence
- Information Technology