Skip to main navigation Skip to search Skip to main content

Do Retrieval Augmented Language Models Know When They Don’t Know?

  • Youchao Zhou
  • , Heyan Huang*
  • , Yicheng Liu
  • , Rui Dai
  • , Xinglin Wang
  • , Xingchen Zhang
  • , Shumin Shi
  • , Yang Deng
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Singapore Management University

Research output: Contribution to journalConference articlepeer-review

Abstract

Existing large language models (LLMs) occasionally generate plausible yet factually incorrect responses, known as hallucinations. Two main approaches have been proposed to mitigate hallucinations: retrieval-augmented language models (RALMs) and refusal post-training. However, current research predominantly focuses on their individual effectiveness while overlooking the evaluation of the refusal capability of RALMs. Ideally, if RALMs know when they do not know, they should refuse to answer. In this study, we ask the fundamental question: Do RALMs know when they don’t know? Specifically, we investigate three questions. First, are RALMs well calibrated with respect to different internal and external knowledge states?We examine the influence of various factors. Contrary to expectations, when all retrieved documents are irrelevant, RALMs still tend to refuse questions they could have answered correctly. Next, given the model’s pronounced over-refusal behavior, we raise a second question: How does a RALM’s refusal ability align with its calibration quality? Our results show that the over-refusal problem can be mitigated through in-context fine-tuning. However, we observe that improved refusal behavior does not necessarily imply better calibration or higher overall accuracy. Finally, we ask: Can we combine refusal-aware RALMs with uncertainty-based answer abstention to mitigate overrefusal? We develop a simple yet effective refusal mechanism for refusal-post-trained RALMs that improves their overall answer quality by balancing refusal and correct answers. Our study provides a more comprehensive understanding of the factors influencing RALM behavior. Meanwhile, we emphasize that uncertainty estimation for RALMs remains an open problem deserving deeper investigation.

Original languageEnglish
Pages (from-to)35158-35166
Number of pages9
JournalProceedings of the AAAI Conference on Artificial Intelligence
Volume40
Issue number41
DOIs
Publication statusPublished - 2026
Event40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, Singapore
Duration: 20 Jan 202627 Jan 2026

Fingerprint

Dive into the research topics of 'Do Retrieval Augmented Language Models Know When They Don’t Know?'. Together they form a unique fingerprint.

Cite this