跳到主要导航 跳到搜索 跳到主要内容

Dynamic token halting for efficient abstractive summarization with importance-aware regularization

  • Heyan Huang
  • , Yu Bai
  • , Yang Gao*
  • , Minpeng Liao
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications
  • Southeast Academy of Information Technology

科研成果: 期刊稿件文章同行评审

摘要

Text summarization involves distilling key information from a lengthy document and presenting it as a concise summary. Unlike extractive summarization, which directly selects phrases or sentences from the source text, abstractive summarization requires comprehending the entire document and generating a summary word by word. Current state-of-the-art abstractive summarization systems rely on large pretrained models, which often suffer from inefficiencies caused by overprocessing irrelevant information. A significant portion of unimportant data is unnecessarily encoded, leading to excessive computational costs. In this paper, we address these inefficiencies by introducing a method that incrementally discards redundant hidden states throughout the encoding process, achieved by leveraging the Adaptive Computation Time (ACT) mechanism. Additionally, we propose a novel Importance-aware Prior Regularization technique that helps the model identify and prioritize crucial parts of the document, ensuring they are processed more thoroughly in deeper layers of the encoder. Our approach reduces the computational demands of pretrained encoders by 25%–35% in terms of floating-point operations (FLOPs) while maintaining performance levels comparable to strong baseline models. Extensive experiments demonstrate that our method is particularly effective at reducing computational costs for longer documents. The code and data will be made publicly available upon acceptance of this paper.

源语言英语
文章编号116105
期刊Knowledge-Based Systems
346
DOI
出版状态已出版 - 8 7月 2026
已对外发布

指纹

探究 'Dynamic token halting for efficient abstractive summarization with importance-aware regularization' 的科研主题。它们共同构成独一无二的指纹。

引用此