Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Wang, Cunxiang, Liu, Xiaoze, Yue, Yuanhao, Tang, Xiangru, Zhang, Tianhang, Jiayang, Cheng, Yao, Yunzhi, Gao, Wenyang, Hu, Xuming, Qi, Zehan, Wang, Yidong, Yang, Linyi, Wang, Jindong, Xie, Xing, Zhang, Zheng, Zhang, Yue
–arXiv.org Artificial Intelligence
This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential consequences and challenges posed by factual errors in LLM outputs. Subsequently, we analyze the mechanisms through which LLMs store and process facts, seeking the primary causes of factual errors. Our discussion then transitions to methodologies for evaluating LLM factuality, emphasizing key metrics, benchmarks, and studies. We further explore strategies for enhancing LLM factuality, including approaches tailored for specific domains. We focus two primary LLM configurations standalone LLMs and Retrieval-Augmented LLMs that utilizes external data, we detail their unique challenges and potential enhancements. Our survey offers a structured guide for researchers aiming to fortify the factual reliability of LLMs.
arXiv.org Artificial Intelligence
Dec-16-2023
- Country:
- Oceania > Australia (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Virginia (0.04)
- Pennsylvania (0.04)
- New Hampshire (0.04)
- Washington > King County
- Seattle (0.14)
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California
- Santa Clara County > Palo Alto (0.04)
- Los Angeles County > Long Beach (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Russia (0.14)
- Austria > Vienna (0.14)
- Germany > Berlin (0.04)
- Switzerland (0.04)
- Belgium > Flanders (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- Italy
- Tuscany > Florence (0.04)
- Calabria > Catanzaro Province
- Catanzaro (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.14)
- France > Auvergne-Rhône-Alpes
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Asia
- Russia (0.14)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- China
- Genre:
- Overview (1.00)
- Research Report
- New Finding (1.00)
- Promising Solution (0.92)
- Experimental Study (0.67)
- Industry:
- Media (0.67)
- Leisure & Entertainment (0.67)
- Law > Criminal Law (0.45)
- Education > Educational Setting (0.45)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.67)
- Diagnostic Medicine (0.67)
- Government > Regional Government
- Technology: