Memory Relevant Hyperparameters Optimization for DNN Training at Edge

Yidong Xu*, Rui Han, Junyan Ouyang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The full utilization of the low communication latency and privacy-preserving capabilities of edge computing is contingent upon the deployment of an increasing number of deep learning tasks on edge devices for training and inference. However, there are memory-constrained challenges in implementing training models on edge devices. Existing techniques optimise the activation memory for training with lossless accuracy by freezing network parameters, gradient checkpoint, and gradient accumulation. While these methods can also be effective for memory compression, there are fewer memory-integrated optimization techniques for combining multiple techniques to fully utilize the computing power of edge devices, especially when integrating these techniques for edge-side training tasks. In this paper, we propose a Memory Integration Controller (MIC) based on edge-end training resource optimization, which can integrate three complex memory tuning techniques and perform simple tuning through memory-related hyperparameters. The main feature of MIC is that it can provide a lightweight and simple memory tuning means to adapt to the extreme computing environment at the edge under dynamic resource environment. We implement MIC on prevalent deep neural networks (DNNs) and demonstrate its effectiveness against other state-of-the-art techniques. Its integrated memory optimization reduces memory usage by 25x on average, while reducing deployment memory and response time by 28% and 11.8% respectively.

Original languageEnglish
Title of host publication2024 IEEE 4th International Conference on Electronic Technology, Communication and Information, ICETCI 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages194-198
Number of pages5
ISBN (Electronic)9798350361643
DOIs
Publication statusPublished - 2024
Event4th IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2024 - Changchun, China
Duration: 24 May 202426 May 2024

Publication series

Name2024 IEEE 4th International Conference on Electronic Technology, Communication and Information, ICETCI 2024

Conference

Conference4th IEEE International Conference on Electronic Technology, Communication and Information, ICETCI 2024
Country/TerritoryChina
CityChangchun
Period24/05/2426/05/24

Keywords

  • deep neural networks (DNNs)
  • edge computing
  • memory integration control
  • memory optimization

Fingerprint

Dive into the research topics of 'Memory Relevant Hyperparameters Optimization for DNN Training at Edge'. Together they form a unique fingerprint.

Cite this