跳到主要导航 跳到搜索 跳到主要内容

BRIEF: Bi-level Coreset Selection for Efficient Instruction Tuning in LLMs

  • Beijing Institute of Technology
  • University of Arizona
  • Massachusetts Institute of Technology

科研成果: 期刊稿件会议文章同行评审

摘要

Instruction tuning is a key step in adapting large language models (LLMs) to effectively understand and follow human instructions. It enables LLMs to transform general knowledge into task-specific responses that align with user intent. Although many high-quality instruction tuning datasets have been released, efficiently utilizing these data sources during supervised fine-tuning (SFT) is important, as training on the full high-quality corpus can be computationally expensive. To address this inefficiency, we explore whether a compact, high-quality subset of instruction data can achieve comparable performance to full-dataset SFT, thereby reducing training cost without sacrificing effectiveness. To this end, this work proposes to select such a subset (a.k.a., coreset) of instruction examples that maintains comparable downstream performance while improving training efficiency. The key idea is inspired by our discovered decomposition that in instruction tuning, the training loss can be decomposed into two components that effectively quantify the contribution of an instruction to the two fundamental capabilities of LLMs, namely knowledge-related capability and instruction following capability. We then revisit the objective of the classical coreset approaches to balance the two capabilities when selecting instruction examples. Based on a bi-level formulation and a composite gradient distance that makes the objective submodular, we design an effective algorithm to achieve a bounded approximation error. Experiments on 4 datasets across 9 downstream tasks demonstrate that BRIEF reduces computational costs by 3× while improving accuracy by 5% on Llama-3.1-8B, Qwen3-4B and Mistral-Nemo-12B.

源语言英语
页(从-至)1264-1277
页数14
期刊Proceedings of the VLDB Endowment
19
6
DOI
出版状态已出版 - 2026
活动52nd International Conference on Very Large Data Bases, VLDB 2026 - Boston, 美国
期限: 31 8月 20264 9月 2026

指纹

探究 'BRIEF: Bi-level Coreset Selection for Efficient Instruction Tuning in LLMs' 的科研主题。它们共同构成独一无二的指纹。

引用此