Carrying Out CNN Channel Pruning in a White Box

Yuxin Zhang; Mingbao Lin; Chia Wen Lin; Jie Chen; Yongjian Wu; Yonghong Tian; Rongrong Ji

doi:10.1109/TNNLS.2022.3147269

Carrying Out CNN Channel Pruning in a White Box

Yuxin Zhang, Mingbao Lin, Chia Wen Lin, Jie Chen, Yongjian Wu, Yonghong Tian, Rongrong Ji^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

30 Citations (Scopus)

Abstract

Channel pruning has been long studied to compress convolutional neural networks (CNNs), which significantly reduces the overall computation. Prior works implement channel pruning in an unexplainable manner, which tends to reduce the final classification errors while failing to consider the internal influence of each channel. In this article, we conduct channel pruning in a white box. Through deep visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we choose to preserve channels contributing to most categories. Specifically, to model the contribution of each channel to differentiating categories, we develop a class-wise mask for each channel, implemented in a dynamic training manner with respect to the input image's category. On the basis of the learned class-wise mask, we perform a global voting mechanism to remove channels with less category discrimination. Lastly, a fine-tuning process is conducted to recover the performance of the pruned model. To our best knowledge, it is the first time that CNN interpretability theory is considered to guide channel pruning. Extensive experiments on representative image classification tasks demonstrate the superiority of our White-Box over many state-of-the-arts (SOTAs). For instance, on CIFAR-10, it reduces 65.23% floating point operations per seconds (FLOPs) with even 0.62% accuracy improvement for ResNet-110. On ILSVRC-2012, White-Box achieves a 45.6% FLOP reduction with only a small loss of 0.83% in the top-1 accuracy for ResNet-50. Code is available at https://github.com/zyxxμWhite-Box.

Original language	English
Pages (from-to)	7946-7955
Number of pages	10
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	34
Issue number	10
DOIs	https://doi.org/10.1109/TNNLS.2022.3147269
Publication status	Published - 1 Oct 2023
Externally published	Yes

Keywords

Channel pruning
efficient inference
image classification
network structure

Access to Document

10.1109/TNNLS.2022.3147269

Cite this

@article{00840f9bb3ed490da1a52af5f07dd511,

title = "Carrying Out CNN Channel Pruning in a White Box",

abstract = "Channel pruning has been long studied to compress convolutional neural networks (CNNs), which significantly reduces the overall computation. Prior works implement channel pruning in an unexplainable manner, which tends to reduce the final classification errors while failing to consider the internal influence of each channel. In this article, we conduct channel pruning in a white box. Through deep visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we choose to preserve channels contributing to most categories. Specifically, to model the contribution of each channel to differentiating categories, we develop a class-wise mask for each channel, implemented in a dynamic training manner with respect to the input image's category. On the basis of the learned class-wise mask, we perform a global voting mechanism to remove channels with less category discrimination. Lastly, a fine-tuning process is conducted to recover the performance of the pruned model. To our best knowledge, it is the first time that CNN interpretability theory is considered to guide channel pruning. Extensive experiments on representative image classification tasks demonstrate the superiority of our White-Box over many state-of-the-arts (SOTAs). For instance, on CIFAR-10, it reduces 65.23% floating point operations per seconds (FLOPs) with even 0.62% accuracy improvement for ResNet-110. On ILSVRC-2012, White-Box achieves a 45.6% FLOP reduction with only a small loss of 0.83% in the top-1 accuracy for ResNet-50. Code is available at https://github.com/zyxxμWhite-Box.",

keywords = "Channel pruning, efficient inference, image classification, network structure",

author = "Yuxin Zhang and Mingbao Lin and Lin, {Chia Wen} and Jie Chen and Yongjian Wu and Yonghong Tian and Rongrong Ji",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2023",

month = oct,

day = "1",

doi = "10.1109/TNNLS.2022.3147269",

language = "English",

volume = "34",

pages = "7946--7955",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "10",

}

TY - JOUR

T1 - Carrying Out CNN Channel Pruning in a White Box

AU - Zhang, Yuxin

AU - Lin, Mingbao

AU - Lin, Chia Wen

AU - Chen, Jie

AU - Wu, Yongjian

AU - Tian, Yonghong

AU - Ji, Rongrong

PY - 2023/10/1

Y1 - 2023/10/1

N2 - Channel pruning has been long studied to compress convolutional neural networks (CNNs), which significantly reduces the overall computation. Prior works implement channel pruning in an unexplainable manner, which tends to reduce the final classification errors while failing to consider the internal influence of each channel. In this article, we conduct channel pruning in a white box. Through deep visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we choose to preserve channels contributing to most categories. Specifically, to model the contribution of each channel to differentiating categories, we develop a class-wise mask for each channel, implemented in a dynamic training manner with respect to the input image's category. On the basis of the learned class-wise mask, we perform a global voting mechanism to remove channels with less category discrimination. Lastly, a fine-tuning process is conducted to recover the performance of the pruned model. To our best knowledge, it is the first time that CNN interpretability theory is considered to guide channel pruning. Extensive experiments on representative image classification tasks demonstrate the superiority of our White-Box over many state-of-the-arts (SOTAs). For instance, on CIFAR-10, it reduces 65.23% floating point operations per seconds (FLOPs) with even 0.62% accuracy improvement for ResNet-110. On ILSVRC-2012, White-Box achieves a 45.6% FLOP reduction with only a small loss of 0.83% in the top-1 accuracy for ResNet-50. Code is available at https://github.com/zyxxμWhite-Box.

AB - Channel pruning has been long studied to compress convolutional neural networks (CNNs), which significantly reduces the overall computation. Prior works implement channel pruning in an unexplainable manner, which tends to reduce the final classification errors while failing to consider the internal influence of each channel. In this article, we conduct channel pruning in a white box. Through deep visualization of feature maps activated by different channels, we observe that different channels have a varying contribution to different categories in image classification. Inspired by this, we choose to preserve channels contributing to most categories. Specifically, to model the contribution of each channel to differentiating categories, we develop a class-wise mask for each channel, implemented in a dynamic training manner with respect to the input image's category. On the basis of the learned class-wise mask, we perform a global voting mechanism to remove channels with less category discrimination. Lastly, a fine-tuning process is conducted to recover the performance of the pruned model. To our best knowledge, it is the first time that CNN interpretability theory is considered to guide channel pruning. Extensive experiments on representative image classification tasks demonstrate the superiority of our White-Box over many state-of-the-arts (SOTAs). For instance, on CIFAR-10, it reduces 65.23% floating point operations per seconds (FLOPs) with even 0.62% accuracy improvement for ResNet-110. On ILSVRC-2012, White-Box achieves a 45.6% FLOP reduction with only a small loss of 0.83% in the top-1 accuracy for ResNet-50. Code is available at https://github.com/zyxxμWhite-Box.

KW - Channel pruning

KW - efficient inference

KW - image classification

KW - network structure

UR - http://www.scopus.com/inward/record.url?scp=85124822539&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2022.3147269

DO - 10.1109/TNNLS.2022.3147269

M3 - Article

AN - SCOPUS:85124822539

SN - 2162-237X

VL - 34

SP - 7946

EP - 7955

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 10

ER -

Carrying Out CNN Channel Pruning in a White Box

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this