从 RAG 到 SAGE: 现状与展望

Translated title of the contribution: From Retrieval-augmented Generation to SAGE: The State of the Art and Prospects

Yong Lin Tian, Yu Tong Wang, Xing Xia Wang, Jing Yang, Tian Yu Shen, Jian Gong Wang, Li Li Fan, Chao Guo, Shou Wen Wang, Yong Zhao, Wan Sen Wu, Fei Yue Wang*

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract

The emergence of large model technologies has significantly enhanced the efficiency with which humans acquire and utilize knowledge. However, in practical applications, they still confront challenges such as constrained knowledge, transfer obstacles, and hallucinations, which impede the construction of trustworthy and reliable artificial intelligence systems. Retrieval-augmented generation (RAG), by leveraging external knowledge bases and query-related retrieval, has effectively strengthened capability of large models and offers strong support for large models to master real-time, industry-specific, and private knowledge, thereby facilitating the rapid promotion and implementation of large model technologies across diverse scenarios. This paper focuses on RAG, detailing its basic principles, current development status, as well as exemplary applications, and analyzing its advantages and the challenges it faces. Based on RAG, we propose the extended framework of search-augmented generation and extension by incorporating the search module and multi-level cache management module, aiming to create a more flexible and efficient knowledge toolchain for large models.

Translated title of the contributionFrom Retrieval-augmented Generation to SAGE: The State of the Art and Prospects
Original languageChinese (Traditional)
Pages (from-to)1145-1169
Number of pages25
JournalZidonghua Xuebao/Acta Automatica Sinica
Volume51
Issue number6
DOIs
Publication statusPublished - Jun 2025
Externally publishedYes

Fingerprint

Dive into the research topics of 'From Retrieval-augmented Generation to SAGE: The State of the Art and Prospects'. Together they form a unique fingerprint.

Cite this