Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

Rui Han; Chi Harold Liu; Shilin Li; Shilin Wen; Xue Liu

doi:10.1109/TC.2020.2970917

Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

Rui Han, Chi Harold Liu^*, Shilin Li, Shilin Wen, Xue Liu

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

12 引用（Scopus）

摘要

Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

源语言	英语
文章编号	8977355
页（从-至）	1059-1070
页数	12
期刊	IEEE Transactions on Computers
卷	69
期	7
DOI	https://doi.org/10.1109/TC.2020.2970917
出版状态	已出版 - 1 7月 2020

访问文件

10.1109/TC.2020.2970917

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{f645f205cd8e4290af395e1017dd16dc,

title = "Accelerating Deep Learning Systems via Critical Set Identification and Model Compression",

abstract = "Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.",

keywords = "Deep learning, Distributed systems, Massive datasets, Redundant input data",

author = "Rui Han and Liu, {Chi Harold} and Shilin Li and Shilin Wen and Xue Liu",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2020",

month = jul,

day = "1",

doi = "10.1109/TC.2020.2970917",

language = "English",

volume = "69",

pages = "1059--1070",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "7",

}

TY - JOUR

T1 - Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

AU - Han, Rui

AU - Liu, Chi Harold

AU - Li, Shilin

AU - Wen, Shilin

AU - Liu, Xue

PY - 2020/7/1

Y1 - 2020/7/1

N2 - Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

AB - Modern distributed engines are increasingly deployed to accelerate large-scaled deep learning (DL) training jobs. While the parallelism of distributed workers/nodes promises the scalability, the computation and communication overheads of the underlying iterative solving algorithms, e.g., stochastic gradient decent, unfortunately become the bottleneck for distributed DL training jobs. Existing approaches address such limitations by designing more efficient synchronization algorithms and model compressing techniques, but do not adequately address issues relating to processing massive datasets. In this article, we propose ClipDL, which accelerates the deep learning systems by simultaneously decreasing the number of model parameters as well as reducing the computations on critical data only. The core component of ClipDL is the estimation of critical set based on the observation that large proportions of input data have little influence on model parameter updating in many prevalent DL algorithms. We implemented ClipDL on Spark (a popular distributed engine for big data) and BigDL (based on de-factor distributed DL training architecture, parameter server), and integrated it with representative model compression techniques. The exhaustive experiments on real DL applications and datasets show ClipDL accelerates model training process by an average of 2.32 times while only incurring accuracy losses of 1.86 percent.

KW - Deep learning

KW - Distributed systems

KW - Massive datasets

KW - Redundant input data

UR - http://www.scopus.com/inward/record.url?scp=85086585852&partnerID=8YFLogxK

U2 - 10.1109/TC.2020.2970917

DO - 10.1109/TC.2020.2970917

M3 - Article

AN - SCOPUS:85086585852

SN - 0018-9340

VL - 69

SP - 1059

EP - 1070

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 7

M1 - 8977355

ER -

Accelerating Deep Learning Systems via Critical Set Identification and Model Compression

摘要

访问文件

其它文件与链接

指纹

引用此