L2T-DLN: Learning to Teach with Dynamic Loss Network

Zhaoyang Hai; Liyuan Pan; Xiabi Liu; Zhengzheng Liu; Mirna Yunita

L2T-DLN: Learning to Teach with Dynamic Loss Network

Zhaoyang Hai, Liyuan Pan^*, Xiabi Liu^*, Zhengzheng Liu, Mirna Yunita

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 会议文章 › 同行评审

1 引用（Scopus）

摘要

With the concept of teaching being introduced to the machine learning community, a teacher model start using dynamic loss functions to teach the training of a student model. The dynamic intends to set adaptive loss functions to different phases of student model learning. In existing works, the teacher model 1) merely determines the loss function based on the present states of the student model, i.e., disregards the experience of the teacher; 2) only utilizes the states of the student model, e.g., training iteration number and loss/accuracy from training/validation sets, while ignoring the states of the loss function. In this paper, we first formulate the loss adjustment as a temporal task by designing a teacher model with memory units, and, therefore, enables the student learning to be guided by the experience of the teacher model. Then, with a dynamic loss network, we can additionally use the states of the loss to assist the teacher learning in enhancing the interactions between the teacher and the student model. Extensive experiments demonstrate our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, objective detection, and semantic segmentation scenarios.

源语言	英语
页（从-至）	43084-43096
页数	13
期刊	Advances in Neural Information Processing Systems
卷	36
出版状态	已出版 - 2023
活动	37th Conference on Neural Information Processing Systems, NeurIPS 2023 - New Orleans, 美国期限: 10 12月 2023 → 16 12月 2023

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{7c4bb0849034478a8957edc5cf3ad59c,

title = "L2T-DLN: Learning to Teach with Dynamic Loss Network",

abstract = "With the concept of teaching being introduced to the machine learning community, a teacher model start using dynamic loss functions to teach the training of a student model. The dynamic intends to set adaptive loss functions to different phases of student model learning. In existing works, the teacher model 1) merely determines the loss function based on the present states of the student model, i.e., disregards the experience of the teacher; 2) only utilizes the states of the student model, e.g., training iteration number and loss/accuracy from training/validation sets, while ignoring the states of the loss function. In this paper, we first formulate the loss adjustment as a temporal task by designing a teacher model with memory units, and, therefore, enables the student learning to be guided by the experience of the teacher model. Then, with a dynamic loss network, we can additionally use the states of the loss to assist the teacher learning in enhancing the interactions between the teacher and the student model. Extensive experiments demonstrate our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, objective detection, and semantic segmentation scenarios.",

author = "Zhaoyang Hai and Liyuan Pan and Xiabi Liu and Zhengzheng Liu and Mirna Yunita",

note = "Publisher Copyright: {\textcopyright} 2023 Neural information processing systems foundation. All rights reserved.; 37th Conference on Neural Information Processing Systems, NeurIPS 2023 ; Conference date: 10-12-2023 Through 16-12-2023",

year = "2023",

language = "English",

volume = "36",

pages = "43084--43096",

journal = "Advances in Neural Information Processing Systems",

issn = "1049-5258",

publisher = "Neural information processing systems foundation",

}

TY - JOUR

T1 - L2T-DLN

T2 - 37th Conference on Neural Information Processing Systems, NeurIPS 2023

AU - Hai, Zhaoyang

AU - Pan, Liyuan

AU - Liu, Xiabi

AU - Liu, Zhengzheng

AU - Yunita, Mirna

PY - 2023

Y1 - 2023

N2 - With the concept of teaching being introduced to the machine learning community, a teacher model start using dynamic loss functions to teach the training of a student model. The dynamic intends to set adaptive loss functions to different phases of student model learning. In existing works, the teacher model 1) merely determines the loss function based on the present states of the student model, i.e., disregards the experience of the teacher; 2) only utilizes the states of the student model, e.g., training iteration number and loss/accuracy from training/validation sets, while ignoring the states of the loss function. In this paper, we first formulate the loss adjustment as a temporal task by designing a teacher model with memory units, and, therefore, enables the student learning to be guided by the experience of the teacher model. Then, with a dynamic loss network, we can additionally use the states of the loss to assist the teacher learning in enhancing the interactions between the teacher and the student model. Extensive experiments demonstrate our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, objective detection, and semantic segmentation scenarios.

AB - With the concept of teaching being introduced to the machine learning community, a teacher model start using dynamic loss functions to teach the training of a student model. The dynamic intends to set adaptive loss functions to different phases of student model learning. In existing works, the teacher model 1) merely determines the loss function based on the present states of the student model, i.e., disregards the experience of the teacher; 2) only utilizes the states of the student model, e.g., training iteration number and loss/accuracy from training/validation sets, while ignoring the states of the loss function. In this paper, we first formulate the loss adjustment as a temporal task by designing a teacher model with memory units, and, therefore, enables the student learning to be guided by the experience of the teacher model. Then, with a dynamic loss network, we can additionally use the states of the loss to assist the teacher learning in enhancing the interactions between the teacher and the student model. Extensive experiments demonstrate our approach can enhance student learning and improve the performance of various deep models on real-world tasks, including classification, objective detection, and semantic segmentation scenarios.

UR - http://www.scopus.com/inward/record.url?scp=85189366992&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85189366992

SN - 1049-5258

VL - 36

SP - 43084

EP - 43096

JO - Advances in Neural Information Processing Systems

JF - Advances in Neural Information Processing Systems

Y2 - 10 December 2023 through 16 December 2023

ER -

L2T-DLN: Learning to Teach with Dynamic Loss Network

摘要

其它文件与链接

指纹

引用此