Towards interpreting deep neural networks via layer behavior understanding

Jiezhang Cao; Jincheng Li; Xiping Hu; Xiangmiao Wu; Mingkui Tan

doi:10.1007/s10994-021-06074-8

Towards interpreting deep neural networks via layer behavior understanding

Jiezhang Cao, Jincheng Li, Xiping Hu, Xiangmiao Wu^*, Mingkui Tan^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

Deep neural networks (DNNs) have achieved success in many machine learning tasks. However, how to interpret DNNs is still an open problem. In particular, how do hidden layers behave is not clearly understood. In this paper, relying on a teacher-student paradigm, we seek to understand the layer behaviors of DNNs by “monitoring” the distribution evolution for both across-layer and single-layer along the depth and training epochs, respectively. Relying on the optimal transport theory, we employ the Wasserstein distance (W-distance) to measure the divergence between the layer distribution and the target distribution. Theoretically, we prove that (i) the W-distance between the distribution of any layer and the target distribution tends to decrease along the depth; (ii) for a specific layer, the W-distance between the distribution in an iteration and the target distribution tends to decrease along training epochs; (iii) a deeper layer, however, is not always better than a shallower layer. Relying on these properties, we are able to propose an early-exit inference method to improve the performance of the multi-label classification. Moreover, our results help to analyze the stability of layer distributions and explain why auxiliary losses are helpful in training DNNs. Extensive experiments justify our theoretical findings.

Original language	English
Pages (from-to)	1159-1179
Number of pages	21
Journal	Machine Learning
Volume	111
Issue number	3
DOIs	https://doi.org/10.1007/s10994-021-06074-8
Publication status	Published - Mar 2022
Externally published	Yes

Keywords

Layer behavior
Teacher-student paradigm
Wasserstein distance

Access to Document

10.1007/s10994-021-06074-8

Cite this

@article{541dadd81dc644eb891aeb873e7ace54,

title = "Towards interpreting deep neural networks via layer behavior understanding",

abstract = "Deep neural networks (DNNs) have achieved success in many machine learning tasks. However, how to interpret DNNs is still an open problem. In particular, how do hidden layers behave is not clearly understood. In this paper, relying on a teacher-student paradigm, we seek to understand the layer behaviors of DNNs by “monitoring” the distribution evolution for both across-layer and single-layer along the depth and training epochs, respectively. Relying on the optimal transport theory, we employ the Wasserstein distance (W-distance) to measure the divergence between the layer distribution and the target distribution. Theoretically, we prove that (i) the W-distance between the distribution of any layer and the target distribution tends to decrease along the depth; (ii) for a specific layer, the W-distance between the distribution in an iteration and the target distribution tends to decrease along training epochs; (iii) a deeper layer, however, is not always better than a shallower layer. Relying on these properties, we are able to propose an early-exit inference method to improve the performance of the multi-label classification. Moreover, our results help to analyze the stability of layer distributions and explain why auxiliary losses are helpful in training DNNs. Extensive experiments justify our theoretical findings.",

keywords = "Layer behavior, Teacher-student paradigm, Wasserstein distance",

author = "Jiezhang Cao and Jincheng Li and Xiping Hu and Xiangmiao Wu and Mingkui Tan",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature.",

year = "2022",

month = mar,

doi = "10.1007/s10994-021-06074-8",

language = "English",

volume = "111",

pages = "1159--1179",

journal = "Machine Learning",

issn = "0885-6125",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Towards interpreting deep neural networks via layer behavior understanding

AU - Cao, Jiezhang

AU - Li, Jincheng

AU - Hu, Xiping

AU - Wu, Xiangmiao

AU - Tan, Mingkui

PY - 2022/3

Y1 - 2022/3

N2 - Deep neural networks (DNNs) have achieved success in many machine learning tasks. However, how to interpret DNNs is still an open problem. In particular, how do hidden layers behave is not clearly understood. In this paper, relying on a teacher-student paradigm, we seek to understand the layer behaviors of DNNs by “monitoring” the distribution evolution for both across-layer and single-layer along the depth and training epochs, respectively. Relying on the optimal transport theory, we employ the Wasserstein distance (W-distance) to measure the divergence between the layer distribution and the target distribution. Theoretically, we prove that (i) the W-distance between the distribution of any layer and the target distribution tends to decrease along the depth; (ii) for a specific layer, the W-distance between the distribution in an iteration and the target distribution tends to decrease along training epochs; (iii) a deeper layer, however, is not always better than a shallower layer. Relying on these properties, we are able to propose an early-exit inference method to improve the performance of the multi-label classification. Moreover, our results help to analyze the stability of layer distributions and explain why auxiliary losses are helpful in training DNNs. Extensive experiments justify our theoretical findings.

AB - Deep neural networks (DNNs) have achieved success in many machine learning tasks. However, how to interpret DNNs is still an open problem. In particular, how do hidden layers behave is not clearly understood. In this paper, relying on a teacher-student paradigm, we seek to understand the layer behaviors of DNNs by “monitoring” the distribution evolution for both across-layer and single-layer along the depth and training epochs, respectively. Relying on the optimal transport theory, we employ the Wasserstein distance (W-distance) to measure the divergence between the layer distribution and the target distribution. Theoretically, we prove that (i) the W-distance between the distribution of any layer and the target distribution tends to decrease along the depth; (ii) for a specific layer, the W-distance between the distribution in an iteration and the target distribution tends to decrease along training epochs; (iii) a deeper layer, however, is not always better than a shallower layer. Relying on these properties, we are able to propose an early-exit inference method to improve the performance of the multi-label classification. Moreover, our results help to analyze the stability of layer distributions and explain why auxiliary losses are helpful in training DNNs. Extensive experiments justify our theoretical findings.

KW - Layer behavior

KW - Teacher-student paradigm

KW - Wasserstein distance

UR - http://www.scopus.com/inward/record.url?scp=85123949736&partnerID=8YFLogxK

U2 - 10.1007/s10994-021-06074-8

DO - 10.1007/s10994-021-06074-8

M3 - Article

AN - SCOPUS:85123949736

SN - 0885-6125

VL - 111

SP - 1159

EP - 1179

JO - Machine Learning

JF - Machine Learning

IS - 3

ER -

Towards interpreting deep neural networks via layer behavior understanding

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this