Selecting the best rather than ranking correctly: A multi-metrics ranker for summarization

Jianfei Zhao, Feng Zhang, Xin Sun*, Chong Feng*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Abstractive summarization models generally face the challenge of underutilizing the search space, meaning that a larger beam size tends to decrease the model's performance. Most works address this issue by proposing novel models with improved results, but the potential of existing models is always overlooked. In this work, we aim to enhance established models by proposing a ranking model that selects the best candidate in the search paths to improve performance directly. Ranking candidates is a wildly used method for performance improvement, but the primary goal is to identify the best candidate rather than achieve perfect ranking accuracy. To achieve this, we adjust the ranking granularity based on candidate similarity and distribute ranking margins according to evaluated scores. This method aligns the ranking objective more closely with the primary goal and reduces CUDA memory usage during training by 30%. Furthermore, we design our model as a Mixture-of-Experts system, where each expert specializes in a specific criterion, enabling the provision of diverse ranking services. We evaluate our method on three wildly used datasets: CNN/DM, XSUM, and NYT. Experimental results demonstrate that our method achieves higher Top-1 accuracy compared to other ranking models and effectively enhances existing summarization models. Further analyses demonstrate that our method possesses strong generalization capabilities, allowing it to perform ranking tasks on unlearned metrics and untrained datasets.

Original languageEnglish
Article number127144
JournalExpert Systems with Applications
Volume276
DOIs
Publication statusPublished - 1 Jun 2025

Keywords

  • Abstractive summarization
  • Contrastive learning
  • Mixture of experts
  • Ranking model

Fingerprint

Dive into the research topics of 'Selecting the best rather than ranking correctly: A multi-metrics ranker for summarization'. Together they form a unique fingerprint.

Cite this