Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Zichao Lin; Shuyan Guan; Wending Zhang; Huiyan Zhang; Yugang Li; Huaping Zhang

doi:10.1007/s10462-024-10896-y

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

Zichao Lin, Shuyan Guan, Wending Zhang, Huiyan Zhang, Yugang Li, Huaping Zhang^*

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

16 引用（Scopus）

摘要

Recently, large language models (LLMs) have attracted considerable attention due to their remarkable capabilities. However, LLMs’ generation of biased or hallucinatory content raised significant concerns, posing major challenges for their practical application. Many studies have dedicated efforts to address these critical issues, adopting various approaches to mitigate bias and hallucinations in LLM-generated content. Remarkably, no review papers have synthesized insights on these two primary problems. Addressing this gap, this paper aims to conduct a simultaneous and dual-focused review of the current landscape of research. The discussions encompass widely used and newly proposed benchmarks and evaluation methods on bias and hallucination in LLMs. This paper also investigates advanced mitigation methods and present a taxonomy based on different mitigation strategies. Moreover, a comparative analysis of the sources, mitigation methods, and evaluation methods for bias and hallucination is included. In the end, this paper provides a synthesis of current research trends and suggests potential directions for future research to address bias and hallucination in LLMs, considering the ongoing challenges in this field.

源语言	英语
文章编号	243
期刊	Artificial Intelligence Review
卷	57
期	9
DOI	https://doi.org/10.1007/s10462-024-10896-y
出版状态	已出版 - 9月 2024

访问文件

10.1007/s10462-024-10896-y

其它文件与链接

链接到 Scopus 的出版物

引用此

Lin, Z., Guan, S., Zhang, W., Zhang, H., Li, Y., & Zhang, H. (2024). Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models. Artificial Intelligence Review, 57(9), 文章 243. https://doi.org/10.1007/s10462-024-10896-y

@article{4eff25f0652e4556b3d94869881208e5,

title = "Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models",

abstract = "Recently, large language models (LLMs) have attracted considerable attention due to their remarkable capabilities. However, LLMs{\textquoteright} generation of biased or hallucinatory content raised significant concerns, posing major challenges for their practical application. Many studies have dedicated efforts to address these critical issues, adopting various approaches to mitigate bias and hallucinations in LLM-generated content. Remarkably, no review papers have synthesized insights on these two primary problems. Addressing this gap, this paper aims to conduct a simultaneous and dual-focused review of the current landscape of research. The discussions encompass widely used and newly proposed benchmarks and evaluation methods on bias and hallucination in LLMs. This paper also investigates advanced mitigation methods and present a taxonomy based on different mitigation strategies. Moreover, a comparative analysis of the sources, mitigation methods, and evaluation methods for bias and hallucination is included. In the end, this paper provides a synthesis of current research trends and suggests potential directions for future research to address bias and hallucination in LLMs, considering the ongoing challenges in this field.",

keywords = "Debias, Hallucination, Large language models, Survey",

author = "Zichao Lin and Shuyan Guan and Wending Zhang and Huiyan Zhang and Yugang Li and Huaping Zhang",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = sep,

doi = "10.1007/s10462-024-10896-y",

language = "English",

volume = "57",

journal = "Artificial Intelligence Review",

issn = "0269-2821",

publisher = "Springer Netherlands",

number = "9",

}

TY - JOUR

T1 - Towards trustworthy LLMs

T2 - a review on debiasing and dehallucinating in large language models

AU - Lin, Zichao

AU - Guan, Shuyan

AU - Zhang, Wending

AU - Zhang, Huiyan

AU - Li, Yugang

AU - Zhang, Huaping

PY - 2024/9

Y1 - 2024/9

N2 - Recently, large language models (LLMs) have attracted considerable attention due to their remarkable capabilities. However, LLMs’ generation of biased or hallucinatory content raised significant concerns, posing major challenges for their practical application. Many studies have dedicated efforts to address these critical issues, adopting various approaches to mitigate bias and hallucinations in LLM-generated content. Remarkably, no review papers have synthesized insights on these two primary problems. Addressing this gap, this paper aims to conduct a simultaneous and dual-focused review of the current landscape of research. The discussions encompass widely used and newly proposed benchmarks and evaluation methods on bias and hallucination in LLMs. This paper also investigates advanced mitigation methods and present a taxonomy based on different mitigation strategies. Moreover, a comparative analysis of the sources, mitigation methods, and evaluation methods for bias and hallucination is included. In the end, this paper provides a synthesis of current research trends and suggests potential directions for future research to address bias and hallucination in LLMs, considering the ongoing challenges in this field.

AB - Recently, large language models (LLMs) have attracted considerable attention due to their remarkable capabilities. However, LLMs’ generation of biased or hallucinatory content raised significant concerns, posing major challenges for their practical application. Many studies have dedicated efforts to address these critical issues, adopting various approaches to mitigate bias and hallucinations in LLM-generated content. Remarkably, no review papers have synthesized insights on these two primary problems. Addressing this gap, this paper aims to conduct a simultaneous and dual-focused review of the current landscape of research. The discussions encompass widely used and newly proposed benchmarks and evaluation methods on bias and hallucination in LLMs. This paper also investigates advanced mitigation methods and present a taxonomy based on different mitigation strategies. Moreover, a comparative analysis of the sources, mitigation methods, and evaluation methods for bias and hallucination is included. In the end, this paper provides a synthesis of current research trends and suggests potential directions for future research to address bias and hallucination in LLMs, considering the ongoing challenges in this field.

KW - Debias

KW - Hallucination

KW - Large language models

KW - Survey

UR - http://www.scopus.com/inward/record.url?scp=85201219661&partnerID=8YFLogxK

U2 - 10.1007/s10462-024-10896-y

DO - 10.1007/s10462-024-10896-y

M3 - Article

AN - SCOPUS:85201219661

SN - 0269-2821

VL - 57

JO - Artificial Intelligence Review

JF - Artificial Intelligence Review

IS - 9

M1 - 243

ER -

Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models

摘要

访问文件

其它文件与链接

指纹

引用此