TreeNet Based Fast Task Decomposition for Resource-Constrained Edge Intelligence

Dong Lu; Yanlong Zhai; Jun Shen; Mahdi Fahmideh; Jianqing Wu; Jude Tchaye-Kondi; Liehuang Zhu

doi:10.1109/TSC.2022.3187118

TreeNet Based Fast Task Decomposition for Resource-Constrained Edge Intelligence

Dong Lu, Yanlong Zhai^*, Jun Shen, Mahdi Fahmideh, Jianqing Wu, Jude Tchaye-Kondi, Liehuang Zhu

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

Edge intelligence is an emerging technology that integrates edge computing and deep learning to bring AI to the network's edge. It has gained wide attention for its lower network latency and better privacy preservation abilities. However, the inference of deep neural networks is computationally demanding and results in poor real-time performance, making it challenging for resource-constrained edge devices. In this article, we propose a hierarchical deep learning model based on TreeNet to reduce the computational cost for edge devices. Based on the similarity of the classification categories, we decompose a given task into disjoint sub-tasks to reduce the complexity of the required model. Then a lightweight binary classifier is proposed for evaluating the sub-task inference result. If the inference result of a sub-task is unreliable, our system will forward the input samples to the cloud server for further processing. We also proposed a new strategy for finding and sharing common features across sub-tasks to improve training speed and accuracy. The experimental results on several popular datasets demonstrate the effectiveness of our approach in speeding up inferences while processing most of the input data with a low error rate.

源语言	英语
页（从-至）	2254-2266
页数	13
期刊	IEEE Transactions on Services Computing
卷	16
期	3
DOI	https://doi.org/10.1109/TSC.2022.3187118
出版状态	已出版 - 1 5月 2023

访问文件

10.1109/TSC.2022.3187118

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{8772a06324494235993c554c13cd2bb3,

title = "TreeNet Based Fast Task Decomposition for Resource-Constrained Edge Intelligence",

abstract = "Edge intelligence is an emerging technology that integrates edge computing and deep learning to bring AI to the network's edge. It has gained wide attention for its lower network latency and better privacy preservation abilities. However, the inference of deep neural networks is computationally demanding and results in poor real-time performance, making it challenging for resource-constrained edge devices. In this article, we propose a hierarchical deep learning model based on TreeNet to reduce the computational cost for edge devices. Based on the similarity of the classification categories, we decompose a given task into disjoint sub-tasks to reduce the complexity of the required model. Then a lightweight binary classifier is proposed for evaluating the sub-task inference result. If the inference result of a sub-task is unreliable, our system will forward the input samples to the cloud server for further processing. We also proposed a new strategy for finding and sharing common features across sub-tasks to improve training speed and accuracy. The experimental results on several popular datasets demonstrate the effectiveness of our approach in speeding up inferences while processing most of the input data with a low error rate.",

keywords = "Edge computing, deep learning, edge intelligence, model acceleration, model compression, resource-constrained",

author = "Dong Lu and Yanlong Zhai and Jun Shen and Mahdi Fahmideh and Jianqing Wu and Jude Tchaye-Kondi and Liehuang Zhu",

note = "Publisher Copyright: {\textcopyright} 2008-2012 IEEE.",

year = "2023",

month = may,

day = "1",

doi = "10.1109/TSC.2022.3187118",

language = "English",

volume = "16",

pages = "2254--2266",

journal = "IEEE Transactions on Services Computing",

issn = "1939-1374",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - TreeNet Based Fast Task Decomposition for Resource-Constrained Edge Intelligence

AU - Lu, Dong

AU - Zhai, Yanlong

AU - Shen, Jun

AU - Fahmideh, Mahdi

AU - Wu, Jianqing

AU - Tchaye-Kondi, Jude

AU - Zhu, Liehuang

PY - 2023/5/1

Y1 - 2023/5/1

N2 - Edge intelligence is an emerging technology that integrates edge computing and deep learning to bring AI to the network's edge. It has gained wide attention for its lower network latency and better privacy preservation abilities. However, the inference of deep neural networks is computationally demanding and results in poor real-time performance, making it challenging for resource-constrained edge devices. In this article, we propose a hierarchical deep learning model based on TreeNet to reduce the computational cost for edge devices. Based on the similarity of the classification categories, we decompose a given task into disjoint sub-tasks to reduce the complexity of the required model. Then a lightweight binary classifier is proposed for evaluating the sub-task inference result. If the inference result of a sub-task is unreliable, our system will forward the input samples to the cloud server for further processing. We also proposed a new strategy for finding and sharing common features across sub-tasks to improve training speed and accuracy. The experimental results on several popular datasets demonstrate the effectiveness of our approach in speeding up inferences while processing most of the input data with a low error rate.

AB - Edge intelligence is an emerging technology that integrates edge computing and deep learning to bring AI to the network's edge. It has gained wide attention for its lower network latency and better privacy preservation abilities. However, the inference of deep neural networks is computationally demanding and results in poor real-time performance, making it challenging for resource-constrained edge devices. In this article, we propose a hierarchical deep learning model based on TreeNet to reduce the computational cost for edge devices. Based on the similarity of the classification categories, we decompose a given task into disjoint sub-tasks to reduce the complexity of the required model. Then a lightweight binary classifier is proposed for evaluating the sub-task inference result. If the inference result of a sub-task is unreliable, our system will forward the input samples to the cloud server for further processing. We also proposed a new strategy for finding and sharing common features across sub-tasks to improve training speed and accuracy. The experimental results on several popular datasets demonstrate the effectiveness of our approach in speeding up inferences while processing most of the input data with a low error rate.

KW - Edge computing

KW - deep learning

KW - edge intelligence

KW - model acceleration

KW - model compression

KW - resource-constrained

UR - http://www.scopus.com/inward/record.url?scp=85133779924&partnerID=8YFLogxK

U2 - 10.1109/TSC.2022.3187118

DO - 10.1109/TSC.2022.3187118

M3 - Article

AN - SCOPUS:85133779924

SN - 1939-1374

VL - 16

SP - 2254

EP - 2266

JO - IEEE Transactions on Services Computing

JF - IEEE Transactions on Services Computing

IS - 3

ER -

TreeNet Based Fast Task Decomposition for Resource-Constrained Edge Intelligence

摘要

访问文件

其它文件与链接

指纹

引用此