End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Models

Yao, Barry Menglong, Shah, Aditya, Sun, Lichao, Cho, Jin-Hee, Huang, Lifu

Jul-6-2023–arXiv.org Artificial Intelligence

We propose end-to-end multimodal fact-checking and explanation generation, where the input is a claim and a large collection of web sources, including articles, images, videos, and tweets, and the goal is to assess the truthfulness of the claim by retrieving relevant evidence and predicting a truthfulness label (e.g., support, refute or not enough information), and to generate a statement to summarize and explain the reasoning and ruling process. To support this research, we construct Mocheg, a large-scale dataset consisting of 15,601 claims where each claim is annotated with a truthfulness label and a ruling statement, and 33,880 textual paragraphs and 12,112 images in total as evidence. To establish baseline performances on Mocheg, we experiment with several state-of-the-art neural architectures on the three pipelined subtasks: multimodal evidence retrieval, claim verification, and explanation generation, and demonstrate that the performance of the state-of-the-art end-to-end multimodal fact-checking does not provide satisfactory outcomes. To the best of our knowledge, we are the first to build the benchmark dataset and solutions for end-to-end multimodal fact-checking and explanation generation. The dataset, source code and model checkpoints are available at https://github.com/VT-NLP/Mocheg.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jul-6-2023

arXiv.org PDF

Add feedback

Country:
- South America > Brazil (0.04)
- Oceania > Australia (0.04)
- North America > United States
  - Virginia (0.04)
  - Massachusetts (0.04)
  - Hawaii (0.04)
  - New York > New York County
    - New York City (0.04)
  - California > San Francisco County
    - San Francisco (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
- Asia
  - Afghanistan (0.14)
  - Pakistan (0.14)
  - Taiwan > Taiwan Province
    - Taipei (0.05)

Genre:
- Research Report (1.00)

Industry:
- Education (1.00)
- Media > News (0.96)
- Health & Medicine > Therapeutic Area (0.94)
- Information Technology (0.67)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Law (0.67)
- Government
  - Military (0.68)
  - Regional Government > North America Government
    - United States Government (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Expert Systems (1.00)
  - Natural Language > Explanation & Argumentation (1.00)
  - Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found