Skip to main navigation Skip to search Skip to main content

BRIEF: Bi-level Coreset Selection for Efficient Instruction Tuning in LLMs

  • Beijing Institute of Technology
  • University of Arizona
  • Massachusetts Institute of Technology

Research output: Contribution to journalConference articlepeer-review

Abstract

Instruction tuning is a key step in adapting large language models (LLMs) to effectively understand and follow human instructions. It enables LLMs to transform general knowledge into task-specific responses that align with user intent. Although many high-quality instruction tuning datasets have been released, efficiently utilizing these data sources during supervised fine-tuning (SFT) is important, as training on the full high-quality corpus can be computationally expensive. To address this inefficiency, we explore whether a compact, high-quality subset of instruction data can achieve comparable performance to full-dataset SFT, thereby reducing training cost without sacrificing effectiveness. To this end, this work proposes to select such a subset (a.k.a., coreset) of instruction examples that maintains comparable downstream performance while improving training efficiency. The key idea is inspired by our discovered decomposition that in instruction tuning, the training loss can be decomposed into two components that effectively quantify the contribution of an instruction to the two fundamental capabilities of LLMs, namely knowledge-related capability and instruction following capability. We then revisit the objective of the classical coreset approaches to balance the two capabilities when selecting instruction examples. Based on a bi-level formulation and a composite gradient distance that makes the objective submodular, we design an effective algorithm to achieve a bounded approximation error. Experiments on 4 datasets across 9 downstream tasks demonstrate that BRIEF reduces computational costs by 3× while improving accuracy by 5% on Llama-3.1-8B, Qwen3-4B and Mistral-Nemo-12B.

Original languageEnglish
Pages (from-to)1264-1277
Number of pages14
JournalProceedings of the VLDB Endowment
Volume19
Issue number6
DOIs
Publication statusPublished - 2026
Event52nd International Conference on Very Large Data Bases, VLDB 2026 - Boston, United States
Duration: 31 Aug 20264 Sept 2026

Fingerprint

Dive into the research topics of 'BRIEF: Bi-level Coreset Selection for Efficient Instruction Tuning in LLMs'. Together they form a unique fingerprint.

Cite this