Automatic Operator Performance Tumng in a Machine Learning System on Edge

Peng Xu*, Xinyu Chang, Jianxin Zhao, Chi Harold Liu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.

源语言英语
主期刊名Proceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022
出版商IEEE Computer Society
802-809
页数8
ISBN(电子版)9781665473156
DOI
出版状态已出版 - 2023
活动28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 - Nanjing, 中国
期限: 10 1月 202312 1月 2023

出版系列

姓名Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
2023-January
ISSN(印刷版)1521-9097

会议

会议28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022
国家/地区中国
Nanjing
时期10/01/2312/01/23

指纹

探究 'Automatic Operator Performance Tumng in a Machine Learning System on Edge' 的科研主题。它们共同构成独一无二的指纹。

引用此