FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
Chern, I-Chun, Chern, Steffi, Chen, Shiqi, Yuan, Weizhe, Feng, Kehua, Zhou, Chunting, He, Junxian, Neubig, Graham, Liu, Pengfei
–arXiv.org Artificial Intelligence
The emergence of generative pre-trained models has facilitated the synthesis of high-quality text, but it has also posed challenges in identifying factual errors in the generated text. In particular: (1) A wider range of tasks now face an increasing risk of containing factual errors when handled by generative models. (2) Generated texts tend to be lengthy and lack a clearly defined granularity for individual facts. (3) There is a scarcity of explicit evidence available during the process of fact checking. With the above challenges in mind, in this paper, we propose FacTool, a task and domain agnostic framework for detecting factual errors of texts generated by large language models (e.g., ChatGPT). Experiments on four different tasks (knowledge-based QA, code generation, mathematical reasoning, and scientific literature review) show the efficacy of the proposed method. We release the code of FacTool associated with ChatGPT plugin interface at https://github.com/GAIR-NLP/factool .
arXiv.org Artificial Intelligence
Jul-26-2023
- Country:
- Asia (0.67)
- Europe (0.67)
- North America > United States (1.00)
- Genre:
- Research Report
- Experimental Study (0.46)
- New Finding (0.48)
- Research Report
- Industry:
- Government > Regional Government
- Information Technology > Services (0.68)
- Law (1.00)
- Leisure & Entertainment > Sports (1.00)
- Technology: