Skip to main navigation Skip to search Skip to main content

FlowRAG: Continual Learning for Dynamic Retriever in Retrieval-Augmented Generation

  • Senlei Zhang*
  • , Tongjun Shi
  • , Dandan Song
  • , Luan Zhang
  • , Shuhao Zhang*
  • , Xiaofei Liao
  • , Hai Jin
  • *Corresponding author for this work
  • Huazhong University of Science and Technology
  • National Engineering Research Center for Big Data Technology and System
  • Futu Holdings Limited
  • Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by leveraging external knowledge, where retrieval accuracy directly affects generation quality. However, dense retrievers, commonly employed in RAG, suffer degraded performance in evolving corpora where new documents arrive continuously and distribution shifts accumulate over time. In such settings, continually updating retrievers is crucial, yet conventional retraining is computationally expensive and often impractical. To address this challenge, we propose FlowRAG, a lightweight and effective method for continual retriever adaptation in evolving corpora. FlowRAG augments the encoder with Layer-wise Prompt Embeddings and introduces a Cross-Layer Fusion mechanism to capture hierarchical semantic representations. In addition, a novel Generator-Guided Loss aligns retriever scores and intermediate representations with the LLM's generation likelihoods, encouraging retrieval decisions that are both semantically relevant and beneficial for generation. Experiments on datasets spanning four domains demonstrate that FlowRAG, which updates only about 0.64% of the total model parameters, consistently outperforms strong baselines in retrieval accuracy, generation quality, and robustness to forgetting in non-stationary settings.

Original languageEnglish
Title of host publicationWWW 2026 - Proceedings of the ACM Web Conference 2026
PublisherAssociation for Computing Machinery, Inc
Pages2160-2170
Number of pages11
ISBN (Electronic)9798400723070
DOIs
Publication statusPublished - 12 Apr 2026
Event35th ACM Web Conference, WWW 2026 - Dubai, United Arab Emirates
Duration: 29 Jun 20263 Jul 2026

Publication series

NameWWW 2026 - Proceedings of the ACM Web Conference 2026

Conference

Conference35th ACM Web Conference, WWW 2026
Country/TerritoryUnited Arab Emirates
CityDubai
Period29/06/263/07/26

Keywords

  • continual learning
  • large language model
  • retrieval-augmented generation

Fingerprint

Dive into the research topics of 'FlowRAG: Continual Learning for Dynamic Retriever in Retrieval-Augmented Generation'. Together they form a unique fingerprint.

Cite this