Automatic Operator Performance Tumng in a Machine Learning System on Edge

Peng Xu*, Xinyu Chang, Jianxin Zhao, Chi Harold Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the current large scale deployment of machine learning technologies, such as those on cloud servers and edge and IoT hardwares, machine learning systems have been widely prevalence. Practical requirement has driven their performance increase in both academia and industry. However, the application requirement varies greatly across different applications, and directly using off-the-shelf systems might not be sufficient in many cases. In this work, we first propose to implement a series of techniques to optimize performance of convolution operation, one of the most important operations, in constructing deep learning networks. Besides, we also propose to apply the automated empirical optimisation of software approach to improve the performance of operators in machine learning system, most notably across various hardware platforms. Evaluation compared to existing libraries on different hardware devices has proved the efficiency of our proposed method.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE 28th International Conference on Parallel and Distributed Systems, ICPADS 2022
PublisherIEEE Computer Society
Pages802-809
Number of pages8
ISBN (Electronic)9781665473156
DOIs
Publication statusPublished - 2023
Event28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022 - Nanjing, China
Duration: 10 Jan 202312 Jan 2023

Publication series

NameProceedings of the International Conference on Parallel and Distributed Systems - ICPADS
Volume2023-January
ISSN (Print)1521-9097

Conference

Conference28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022
Country/TerritoryChina
CityNanjing
Period10/01/2312/01/23

Keywords

  • automatic tuning
  • convolution
  • machine learning system
  • optimization

Fingerprint

Dive into the research topics of 'Automatic Operator Performance Tumng in a Machine Learning System on Edge'. Together they form a unique fingerprint.

Cite this