Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction
–arXiv.org Artificial Intelligence
Fact-checking based on commercial LLMs has become mainstream. Although these methods offer high explainability, it falls short in accuracy compared to traditional fine-tuning approaches, and data security is also a significant concern. In this paper, we propose a self-instruction based fine-tuning approach for fact-checking that balances accuracy and explainability. Our method consists of Data Augmentation and Improved DPO fine-tuning. The former starts by instructing the model to generate both positive and negative explanations based on claim-evidence pairs and labels, then sampling the dataset according to our customized difficulty standards. The latter employs our proposed improved DPO to fine-tune the model using the generated samples. We fine-tune the smallest-scale LLaMA-7B model and evaluate it on the challenging fact-checking datasets FEVEROUS and HOVER, utilizing four fine-tuning methods and three few-shot learning methods for comparison. The experiments demonstrate that our approach not only retains accuracy comparable to, or even surpassing, traditional fine-tuning methods, but also generates fluent explanation text. Moreover, it also exhibit high generalization performance. Our method is the first to leverage self-supervised learning for fact-checking and innovatively combines contrastive learning and improved DPO in fine-tuning LLMs, as shown in the experiments.
arXiv.org Artificial Intelligence
May-23-2024
- Country:
- Asia
- Indonesia > Bali (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Russia (0.04)
- Singapore (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- France (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Russia (0.04)
- United Kingdom (0.14)
- Belgium > Brussels-Capital Region
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- United States
- California > Orange County
- Orange (0.04)
- New York > New York County
- New York City (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- California > Orange County
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Government (1.00)
- Information Technology > Security & Privacy (1.00)
- Technology: