Lightweight NPU Method Based on Hyper-Threading Technology

Ao Zhang*, Yongrui Li, Wenlong Li, Yizhuang Xie, Zhihan Zhang, Zhu Yang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the rapid development of deep learning, the processing power of Neural Processing Unit(NPU) continues to improve. However, this leads to an increasing scale of logical resources for NPU, making it challenging to deploy NPU on Field-Programmable Gate Array(FPGA). To address this issue and enable the deployment of high-computational-power NPU on FPGA, this paper proposes a lightweight NPU method based on hyper-threading technology, considering the characteristics of NPU hardware architecture. This method effectively reduces the logical resources usage of the NPU while ensuring only a minimal decrease in computational power, thus enabling successful FPGA deployment. The overall experiment is based on the Virtex UltraScale+ HBM VCU128 FPGA platform. After testing, it was found that the NPU, previously undeployable, could be successfully deployed after lightweight processing. The resource usage ratio of LUTs decreased by about 10%, and the computational power only decreased by 4%. In other words, by using this method, the scale of logical resource usage of the NPU was effectively reduced while ensuring a minimal decrease in computational power, improving the deployment situation of the NPU on FPGA. This approach has certain reference value and significance for the lightweight design of NPU.

Original languageEnglish
Title of host publicationIEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331515669
DOIs
Publication statusPublished - 2024
Event2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 - Zhuhai, China
Duration: 22 Nov 202424 Nov 2024

Publication series

NameIEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

Conference

Conference2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Country/TerritoryChina
CityZhuhai
Period22/11/2424/11/24

Keywords

  • FPGA
  • Hyper-threading Technology
  • Lightweighting
  • NPU

Fingerprint

Dive into the research topics of 'Lightweight NPU Method Based on Hyper-Threading Technology'. Together they form a unique fingerprint.

Cite this