Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Wang, Cunxiang; Liu, Xiaoze; Yue, Yuanhao; Tang, Xiangru; Zhang, Tianhang; Jiayang, Cheng; Yao, Yunzhi; Gao, Wenyang; Hu, Xuming; Qi, Zehan; Wang, Yidong; Yang, Linyi; Wang, Jindong; Xie, Xing; Zhang, Zheng; Zhang, Yue

Computer Science > Computation and Language

arXiv:2310.07521 (cs)

[Submitted on 11 Oct 2023 (v1), last revised 16 Dec 2023 (this version, v3)]

Title:Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Authors:Cunxiang Wang, Xiaoze Liu, Yuanhao Yue, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Wenyang Gao, Xuming Hu, Zehan Qi, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang

View PDF HTML (experimental)

Abstract:This survey addresses the crucial issue of factuality in Large Language Models (LLMs). As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential consequences and challenges posed by factual errors in LLM outputs. Subsequently, we analyze the mechanisms through which LLMs store and process facts, seeking the primary causes of factual errors. Our discussion then transitions to methodologies for evaluating LLM factuality, emphasizing key metrics, benchmarks, and studies. We further explore strategies for enhancing LLM factuality, including approaches tailored for specific domains. We focus two primary LLM configurations standalone LLMs and Retrieval-Augmented LLMs that utilizes external data, we detail their unique challenges and potential enhancements. Our survey offers a structured guide for researchers aiming to fortify the factual reliability of LLMs.

Comments:	62 pages; 300+ references
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.07521 [cs.CL]
	(or arXiv:2310.07521v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.07521

Submission history

From: Cunxiang Wang [view email]
[v1] Wed, 11 Oct 2023 14:18:03 UTC (182 KB)
[v2] Wed, 18 Oct 2023 14:09:19 UTC (187 KB)
[v3] Sat, 16 Dec 2023 12:47:19 UTC (808 KB)

Computer Science > Computation and Language

Title:Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators