L2T-DFM: Learning to Teach with Dynamic Fused Metric

Zhaoyang Hai; Liyuan Pan; Xiabi Liu; Mengqiao Han

doi:10.1016/j.patcog.2024.111124

L2T-DFM: Learning to Teach with Dynamic Fused Metric

Zhaoyang Hai, Liyuan Pan, Xiabi Liu^*, Mengqiao Han

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

The loss function plays a crucial role in the construction of machine learning algorithms. Employing a teacher model to set loss functions dynamically for student models has attracted attention. In existing works, (1) the characterization of the dynamic loss suffers from some inherent limitations, ie, the computational cost of loss networks and the restricted similarity measurement handcrafted loss functions; and (2) the states of the student model are provided to the teacher model directly without integration, causing the teacher model to underperform when trained on insufficient amounts of data. To alleviate the above-mentioned issues, in this paper, we select and weigh a set of similarity metrics by a confidence-based selection algorithm and a temporal teacher model to enhance the dynamic loss functions. Subsequently, to integrate the states of the student model, we employ statistics to quantify the information loss of the student model. Extensive experiments demonstrate that our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, object detection, and semantic segmentation scenarios.

Original language	English
Article number	111124
Journal	Pattern Recognition
Volume	159
DOIs	https://doi.org/10.1016/j.patcog.2024.111124
Publication status	Published - Mar 2025

Keywords

Dynamic loss function
Learning to teach
Optimization

Access to Document

10.1016/j.patcog.2024.111124

Cite this

Hai, Z., Pan, L., Liu, X., & Han, M. (2025). L2T-DFM: Learning to Teach with Dynamic Fused Metric. Pattern Recognition, 159, Article 111124. https://doi.org/10.1016/j.patcog.2024.111124

@article{b2c2651113044fda9deace320a4fdaa0,

title = "L2T-DFM: Learning to Teach with Dynamic Fused Metric",

abstract = "The loss function plays a crucial role in the construction of machine learning algorithms. Employing a teacher model to set loss functions dynamically for student models has attracted attention. In existing works, (1) the characterization of the dynamic loss suffers from some inherent limitations, ie, the computational cost of loss networks and the restricted similarity measurement handcrafted loss functions; and (2) the states of the student model are provided to the teacher model directly without integration, causing the teacher model to underperform when trained on insufficient amounts of data. To alleviate the above-mentioned issues, in this paper, we select and weigh a set of similarity metrics by a confidence-based selection algorithm and a temporal teacher model to enhance the dynamic loss functions. Subsequently, to integrate the states of the student model, we employ statistics to quantify the information loss of the student model. Extensive experiments demonstrate that our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, object detection, and semantic segmentation scenarios.",

keywords = "Dynamic loss function, Learning to teach, Optimization",

author = "Zhaoyang Hai and Liyuan Pan and Xiabi Liu and Mengqiao Han",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2025",

month = mar,

doi = "10.1016/j.patcog.2024.111124",

language = "English",

volume = "159",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - L2T-DFM

T2 - Learning to Teach with Dynamic Fused Metric

AU - Hai, Zhaoyang

AU - Pan, Liyuan

AU - Liu, Xiabi

AU - Han, Mengqiao

PY - 2025/3

Y1 - 2025/3

N2 - The loss function plays a crucial role in the construction of machine learning algorithms. Employing a teacher model to set loss functions dynamically for student models has attracted attention. In existing works, (1) the characterization of the dynamic loss suffers from some inherent limitations, ie, the computational cost of loss networks and the restricted similarity measurement handcrafted loss functions; and (2) the states of the student model are provided to the teacher model directly without integration, causing the teacher model to underperform when trained on insufficient amounts of data. To alleviate the above-mentioned issues, in this paper, we select and weigh a set of similarity metrics by a confidence-based selection algorithm and a temporal teacher model to enhance the dynamic loss functions. Subsequently, to integrate the states of the student model, we employ statistics to quantify the information loss of the student model. Extensive experiments demonstrate that our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, object detection, and semantic segmentation scenarios.

AB - The loss function plays a crucial role in the construction of machine learning algorithms. Employing a teacher model to set loss functions dynamically for student models has attracted attention. In existing works, (1) the characterization of the dynamic loss suffers from some inherent limitations, ie, the computational cost of loss networks and the restricted similarity measurement handcrafted loss functions; and (2) the states of the student model are provided to the teacher model directly without integration, causing the teacher model to underperform when trained on insufficient amounts of data. To alleviate the above-mentioned issues, in this paper, we select and weigh a set of similarity metrics by a confidence-based selection algorithm and a temporal teacher model to enhance the dynamic loss functions. Subsequently, to integrate the states of the student model, we employ statistics to quantify the information loss of the student model. Extensive experiments demonstrate that our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, object detection, and semantic segmentation scenarios.

KW - Dynamic loss function

KW - Learning to teach

KW - Optimization

UR - http://www.scopus.com/inward/record.url?scp=85208175866&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.111124

DO - 10.1016/j.patcog.2024.111124

M3 - Article

AN - SCOPUS:85208175866

SN - 0031-3203

VL - 159

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 111124

ER -

L2T-DFM: Learning to Teach with Dynamic Fused Metric

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this