Automatic Operator Performance Tumng in a Machine Learning System on Edge

Peng Xu; Xinyu Chang; Jianxin Zhao; Chi Harold Liu

doi:10.1109/ICPADS56603.2022.00109

Automatic Operator Performance Tumng in a Machine Learning System on Edge

Peng Xu^*, Xinyu Chang, Jianxin Zhao, Chi Harold Liu

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.

源语言	英语
主期刊名	Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022
出版商	IEEE Computer Society
页	802-809
页数	8
ISBN（电子版）	9781665473156
DOI	https://doi.org/10.1109/ICPADS56603.2022.00109
出版状态	已出版 - 2023
活动	28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 - Nanjing, 中国期限: 10 1月 2023 → 12 1月 2023

出版系列

姓名	Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
卷	2023-January
ISSN（印刷版）	1521-9097

会议

会议	28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022
国家/地区	中国
市	Nanjing
时期	10/01/23 → 12/01/23

访问文件

10.1109/ICPADS56603.2022.00109

其它文件与链接

链接到 Scopus 的出版物

引用此

Xu, P., Chang, X., Zhao, J., & Liu, C. H. (2023). Automatic Operator Performance Tumng in a Machine Learning System on Edge. 在 Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022 (页码 802-809). (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; 卷 2023-January). IEEE Computer Society. https://doi.org/10.1109/ICPADS56603.2022.00109

@inproceedings{0970b380d89e494599d8a3d0ff84ad36,

title = "Automatic Operator Performance Tumng in a Machine Learning System on Edge",

abstract = "With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.",

keywords = "automatic tuning, convolution, machine learning system, optimization",

author = "Peng Xu and Xinyu Chang and Jianxin Zhao and Liu, {Chi Harold}",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 ; Conference date: 10-01-2023 Through 12-01-2023",

year = "2023",

doi = "10.1109/ICPADS56603.2022.00109",

language = "English",

series = "Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS",

publisher = "IEEE Computer Society",

pages = "802--809",

booktitle = "Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022",

address = "United States",

}

Xu, P, Chang, X, Zhao, J & Liu, CH 2023, Automatic Operator Performance Tumng in a Machine Learning System on Edge. 在 Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, 卷 2023-January, IEEE Computer Society, 页码 802-809, 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022, Nanjing, 中国, 10/01/23. https://doi.org/10.1109/ICPADS56603.2022.00109

Automatic Operator Performance Tumng in a Machine Learning System on Edge. / Xu, Peng; Chang, Xinyu; Zhao, Jianxin 等.
Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. IEEE Computer Society, 2023. 页码 802-809 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; 卷 2023-January).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Automatic Operator Performance Tumng in a Machine Learning System on Edge

AU - Xu, Peng

AU - Chang, Xinyu

AU - Zhao, Jianxin

AU - Liu, Chi Harold

PY - 2023

Y1 - 2023

N2 - With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.

AB - With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.

KW - automatic tuning

KW - convolution

KW - machine learning system

KW - optimization

UR - http://www.scopus.com/inward/record.url?scp=85152928988&partnerID=8YFLogxK

U2 - 10.1109/ICPADS56603.2022.00109

DO - 10.1109/ICPADS56603.2022.00109

M3 - Conference contribution

AN - SCOPUS:85152928988

T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

SP - 802

EP - 809

BT - Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022

PB - IEEE Computer Society

T2 - 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022

Y2 - 10 January 2023 through 12 January 2023

ER -

Xu P, Chang X, Zhao J, Liu CH. Automatic Operator Performance Tumng in a Machine Learning System on Edge. 在 Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022. IEEE Computer Society. 2023. 页码 802-809. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS). doi: 10.1109/ICPADS56603.2022.00109

Automatic Operator Performance Tumng in a Machine Learning System on Edge

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此