ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Qinglong Zhang; Rui Han; Chi Harold Liu; Guoren Wang; Lydia Y. Chen

doi:10.1109/TC.2024.3375608

ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Qinglong Zhang, Rui Han^*, Chi Harold Liu, Guoren Wang, Lydia Y. Chen

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

源语言	英语
页（从-至）	1616-1630
页数	15
期刊	IEEE Transactions on Computers
卷	73
期	6
DOI	https://doi.org/10.1109/TC.2024.3375608
出版状态	已出版 - 1 6月 2024

访问文件

10.1109/TC.2024.3375608

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{550b6f39cf2140ad98f2262fa9e504b7,

title = "ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge",

abstract = "Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.",

keywords = "Edge vision, deep neural networks, domain adaptation",

author = "Qinglong Zhang and Rui Han and Liu, {Chi Harold} and Guoren Wang and Chen, {Lydia Y.}",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TC.2024.3375608",

language = "English",

volume = "73",

pages = "1616--1630",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "6",

}

TY - JOUR

T1 - ElasticDNN

T2 - On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

AU - Zhang, Qinglong

AU - Han, Rui

AU - Liu, Chi Harold

AU - Wang, Guoren

AU - Chen, Lydia Y.

PY - 2024/6/1

Y1 - 2024/6/1

N2 - Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

AB - Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

KW - Edge vision

KW - deep neural networks

KW - domain adaptation

UR - http://www.scopus.com/inward/record.url?scp=85187999682&partnerID=8YFLogxK

U2 - 10.1109/TC.2024.3375608

DO - 10.1109/TC.2024.3375608

M3 - Article

AN - SCOPUS:85187999682

SN - 0018-9340

VL - 73

SP - 1616

EP - 1630

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 6

ER -

ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

摘要

访问文件

其它文件与链接

指纹

引用此