Surprising Efficacy of Fine-Tuned Transformers for Fact-Checking over Larger Language Models

Apr-30-2024–arXiv.org Artificial Intelligence

In this paper, we explore the challenges associated with establishing an end-to-end fact-checking pipeline in a real-world context, covering over 90 languages. Our real-world experimental benchmarks demonstrate that fine-tuning Transformer models specifically for fact-checking tasks, such as claim detection and veracity prediction, provide superior performance over large language models (LLMs) like GPT-4, GPT-3.5-Turbo, and Mistral-7b. However, we illustrate that LLMs excel in generative tasks such as question decomposition for evidence retrieval. Through extensive evaluation, we show the efficacy of fine-tuned models for fact-checking in a multilingual setting and complex claims that include numerical quantities.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Apr-30-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.31)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Media > News (0.69)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.91)
  - Natural Language > Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found