Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications

Jiaqi Li; Yanan Zhao; Li Gao; Feng Cui

doi:10.1109/ICPR48806.2021.9412687

Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications

Jiaqi Li, Yanan Zhao, Li Gao, Feng Cui

School of Mechanical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Citations (Scopus)

Abstract

Nowadays, in the area of autonomous driving, the computational power of the object detectors is limited by the embedded devices and the public datasets for autonomous driving are over-idealistic. In this paper, we propose a pipeline combining both block-wise pruning and channel-wise pruning to compress the object detection model iteratively. We enforce the introduced factor of the residual blocks and the scale parameters in Batch Normalization (BN) layers to sparsity to select the less important residual blocks and channels. Moreover, a modified loss function has been proposed to remedy the class-imbalance problem. After removing the unimportant structures iteratively, we get the pruned YOLOv3 trained on our datasets which have more abundant and elaborate classes. Evaluated by our validation sets on the server, the pruned YOLOv3 saves 79.7% floating point operations (FLOPs), 93.8% parameter size, 93.8% model volume and 45.4% inference times with only 4.16% mean of average precision (mAP) loss. Evaluated on the embedded device, the pruned model operates about 13 frames per second with 4.53% mAP loss. These results show that the real-time property and accuracy of the pruned YOLOv3 can meet the needs of the embedded devices in complicated autonomous driving environments.

Original language	English
Title of host publication	Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	5107-5114
Number of pages	8
ISBN (Electronic)	9781728188089
DOIs	https://doi.org/10.1109/ICPR48806.2021.9412687
Publication status	Published - 2020
Event	25th International Conference on Pattern Recognition, ICPR 2020 - Virtual, Milan, Italy Duration: 10 Jan 2021 → 15 Jan 2021

Publication series

Name	Proceedings - International Conference on Pattern Recognition
ISSN (Print)	1051-4651

Conference

Conference	25th International Conference on Pattern Recognition, ICPR 2020
Country/Territory	Italy
City	Virtual, Milan
Period	10/01/21 → 15/01/21

Access to Document

10.1109/ICPR48806.2021.9412687

Cite this

Li, J., Zhao, Y., Gao, L., & Cui, F. (2020). Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications. In Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition (pp. 5107-5114). Article 9412687 (Proceedings - International Conference on Pattern Recognition). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICPR48806.2021.9412687

Li, Jiaqi ; Zhao, Yanan ; Gao, Li et al. / Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications. Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 2020. pp. 5107-5114 (Proceedings - International Conference on Pattern Recognition).

@inproceedings{3865cae1614e4e7b87c20b94ecb71610,

title = "Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications",

abstract = "Nowadays, in the area of autonomous driving, the computational power of the object detectors is limited by the embedded devices and the public datasets for autonomous driving are over-idealistic. In this paper, we propose a pipeline combining both block-wise pruning and channel-wise pruning to compress the object detection model iteratively. We enforce the introduced factor of the residual blocks and the scale parameters in Batch Normalization (BN) layers to sparsity to select the less important residual blocks and channels. Moreover, a modified loss function has been proposed to remedy the class-imbalance problem. After removing the unimportant structures iteratively, we get the pruned YOLOv3 trained on our datasets which have more abundant and elaborate classes. Evaluated by our validation sets on the server, the pruned YOLOv3 saves 79.7% floating point operations (FLOPs), 93.8% parameter size, 93.8% model volume and 45.4% inference times with only 4.16% mean of average precision (mAP) loss. Evaluated on the embedded device, the pruned model operates about 13 frames per second with 4.53% mAP loss. These results show that the real-time property and accuracy of the pruned YOLOv3 can meet the needs of the embedded devices in complicated autonomous driving environments.",

author = "Jiaqi Li and Yanan Zhao and Li Gao and Feng Cui",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE; 25th International Conference on Pattern Recognition, ICPR 2020 ; Conference date: 10-01-2021 Through 15-01-2021",

year = "2020",

doi = "10.1109/ICPR48806.2021.9412687",

language = "English",

series = "Proceedings - International Conference on Pattern Recognition",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5107--5114",

booktitle = "Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition",

address = "United States",

}

Li, J, Zhao, Y, Gao, L & Cui, F 2020, Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications. in Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition., 9412687, Proceedings - International Conference on Pattern Recognition, Institute of Electrical and Electronics Engineers Inc., pp. 5107-5114, 25th International Conference on Pattern Recognition, ICPR 2020, Virtual, Milan, Italy, 10/01/21. https://doi.org/10.1109/ICPR48806.2021.9412687

Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications. / Li, Jiaqi; Zhao, Yanan; Gao, Li et al.
Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition. Institute of Electrical and Electronics Engineers Inc., 2020. p. 5107-5114 9412687 (Proceedings - International Conference on Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications

AU - Li, Jiaqi

AU - Zhao, Yanan

AU - Gao, Li

AU - Cui, Feng

PY - 2020

Y1 - 2020

N2 - Nowadays, in the area of autonomous driving, the computational power of the object detectors is limited by the embedded devices and the public datasets for autonomous driving are over-idealistic. In this paper, we propose a pipeline combining both block-wise pruning and channel-wise pruning to compress the object detection model iteratively. We enforce the introduced factor of the residual blocks and the scale parameters in Batch Normalization (BN) layers to sparsity to select the less important residual blocks and channels. Moreover, a modified loss function has been proposed to remedy the class-imbalance problem. After removing the unimportant structures iteratively, we get the pruned YOLOv3 trained on our datasets which have more abundant and elaborate classes. Evaluated by our validation sets on the server, the pruned YOLOv3 saves 79.7% floating point operations (FLOPs), 93.8% parameter size, 93.8% model volume and 45.4% inference times with only 4.16% mean of average precision (mAP) loss. Evaluated on the embedded device, the pruned model operates about 13 frames per second with 4.53% mAP loss. These results show that the real-time property and accuracy of the pruned YOLOv3 can meet the needs of the embedded devices in complicated autonomous driving environments.

AB - Nowadays, in the area of autonomous driving, the computational power of the object detectors is limited by the embedded devices and the public datasets for autonomous driving are over-idealistic. In this paper, we propose a pipeline combining both block-wise pruning and channel-wise pruning to compress the object detection model iteratively. We enforce the introduced factor of the residual blocks and the scale parameters in Batch Normalization (BN) layers to sparsity to select the less important residual blocks and channels. Moreover, a modified loss function has been proposed to remedy the class-imbalance problem. After removing the unimportant structures iteratively, we get the pruned YOLOv3 trained on our datasets which have more abundant and elaborate classes. Evaluated by our validation sets on the server, the pruned YOLOv3 saves 79.7% floating point operations (FLOPs), 93.8% parameter size, 93.8% model volume and 45.4% inference times with only 4.16% mean of average precision (mAP) loss. Evaluated on the embedded device, the pruned model operates about 13 frames per second with 4.53% mAP loss. These results show that the real-time property and accuracy of the pruned YOLOv3 can meet the needs of the embedded devices in complicated autonomous driving environments.

UR - http://www.scopus.com/inward/record.url?scp=85110412422&partnerID=8YFLogxK

U2 - 10.1109/ICPR48806.2021.9412687

DO - 10.1109/ICPR48806.2021.9412687

M3 - Conference contribution

AN - SCOPUS:85110412422

T3 - Proceedings - International Conference on Pattern Recognition

SP - 5107

EP - 5114

BT - Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 25th International Conference on Pattern Recognition, ICPR 2020

Y2 - 10 January 2021 through 15 January 2021

ER -

Li J, Zhao Y, Gao L, Cui F. Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications. In Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition. Institute of Electrical and Electronics Engineers Inc. 2020. p. 5107-5114. 9412687. (Proceedings - International Conference on Pattern Recognition). doi: 10.1109/ICPR48806.2021.9412687

Compression of YOLOv3 via block-wise and channel-wise pruning for real-time and complicated autonomous driving environment sensing applications

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this