ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Qinglong Zhang; Rui Han; Chi Harold Liu; Guoren Wang; Lydia Y. Chen

doi:10.1109/TC.2024.3375608

ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Qinglong Zhang, Rui Han^*, Chi Harold Liu, Guoren Wang, Lydia Y. Chen

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

Original language	English
Pages (from-to)	1616-1630
Number of pages	15
Journal	IEEE Transactions on Computers
Volume	73
Issue number	6
DOIs	https://doi.org/10.1109/TC.2024.3375608
Publication status	Published - 1 Jun 2024

Keywords

Edge vision
deep neural networks
domain adaptation

Access to Document

10.1109/TC.2024.3375608

Cite this

Zhang, Q., Han, R., Liu, C. H., Wang, G., & Chen, L. Y. (2024). ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge. IEEE Transactions on Computers, 73(6), 1616-1630. https://doi.org/10.1109/TC.2024.3375608

@article{550b6f39cf2140ad98f2262fa9e504b7,

title = "ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge",

abstract = "Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.",

keywords = "Edge vision, deep neural networks, domain adaptation",

author = "Qinglong Zhang and Rui Han and Liu, {Chi Harold} and Guoren Wang and Chen, {Lydia Y.}",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TC.2024.3375608",

language = "English",

volume = "73",

pages = "1616--1630",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "6",

}

TY - JOUR

T1 - ElasticDNN

T2 - On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

AU - Zhang, Qinglong

AU - Han, Rui

AU - Liu, Chi Harold

AU - Wang, Guoren

AU - Chen, Lydia Y.

PY - 2024/6/1

Y1 - 2024/6/1

N2 - Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

AB - Executing deep neural networks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.

KW - Edge vision

KW - deep neural networks

KW - domain adaptation

UR - http://www.scopus.com/inward/record.url?scp=85187999682&partnerID=8YFLogxK

U2 - 10.1109/TC.2024.3375608

DO - 10.1109/TC.2024.3375608

M3 - Article

AN - SCOPUS:85187999682

SN - 0018-9340

VL - 73

SP - 1616

EP - 1630

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 6

ER -

ElasticDNN: On-Device Neural Network Remodeling for Adapting Evolving Vision Domains at Edge

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this