Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

Rui Han; Chi Harold Liu; Shilin Li; Shilin Wen; Xue Liu

doi:10.1109/TC.2020.2970917

Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

Rui Han, Chi Harold Liu^*, Shilin Li, Shilin Wen, Xue Liu

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

12 Citations (Scopus)

Abstract

Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

Original language	English
Article number	8977355
Pages (from-to)	1059-1070
Number of pages	12
Journal	IEEE Transactions on Computers
Volume	69
Issue number	7
DOIs	https://doi.org/10.1109/TC.2020.2970917
Publication status	Published - 1 Jul 2020

Keywords

Deep learning
Distributed systems
Massive datasets
Redundant input data

Access to Document

10.1109/TC.2020.2970917

Cite this

Han, R., Liu, C. H., Li, S., Wen, S., & Liu, X. (2020). Accelerating Deep Learning Systems via Critical Set Identification and Model Compression. IEEE Transactions on Computers, 69(7), 1059-1070. Article 8977355. https://doi.org/10.1109/TC.2020.2970917

@article{f645f205cd8e4290af395e1017dd16dc,

title = "Accelerating Deep Learning Systems via Critical Set Identification and Model Compression",

abstract = "Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.",

keywords = "Deep learning, Distributed systems, Massive datasets, Redundant input data",

author = "Rui Han and Liu, {Chi Harold} and Shilin Li and Shilin Wen and Xue Liu",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2020",

month = jul,

day = "1",

doi = "10.1109/TC.2020.2970917",

language = "English",

volume = "69",

pages = "1059--1070",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "7",

}

TY - JOUR

T1 - Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

AU - Han, Rui

AU - Liu, Chi Harold

AU - Li, Shilin

AU - Wen, Shilin

AU - Liu, Xue

PY - 2020/7/1

Y1 - 2020/7/1

N2 - Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

AB - Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

KW - Deep learning

KW - Distributed systems

KW - Massive datasets

KW - Redundant input data

UR - http://www.scopus.com/inward/record.url?scp=85086585852&partnerID=8YFLogxK

U2 - 10.1109/TC.2020.2970917

DO - 10.1109/TC.2020.2970917

M3 - Article

AN - SCOPUS:85086585852

SN - 0018-9340

VL - 69

SP - 1059

EP - 1070

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 7

M1 - 8977355

ER -

Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this