面向端到端目标检测神经网络的高效硬件加速系统设计

Translated title of the contribution: Efficient Hardware Acceleration System Design for End-to-End Object Detection Neural Network

Shiwei Ren, Chaojia Liu, Jianzheng Li, Rongkun Jiang, Xiaohua Wang, Chengbo Xue*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

To solve the problem of limited hardware resources and sensitive power consumption in the application of neural network object detection system for edge computing devices, a YOLOv3-Tiny neural network object detection hardware acceleration system was proposed based on field programmable gate array (FPGA). The scale of YOLOv3-Tiny network was reduced by using network structure reorganization, inter layer fusion and dynamic numerical quantization. Based on channel parallel and weight resident hardware acceleration algorithm, tight pipeline processing flow and hardware operation unit reuse, the utilization efficiency of hardware resources was improved. The designed end-to-end object detection acceleration system was deployed on UltraScale+ XCZU9EG FPGA. The result shows that it can achieve 96.6 GOPS throughput, 17.3 FPS detection frame rate and 4.12 W power consumption. The hardware resource utilization efficiency is 0.32 GOPS/DSP and 2.68 GOPS/kLUT. Maintaining efficient and accurate object detection capability, the utilization efficiency of hardware resources is better than other existing YOLOv3-Tiny object detection hardware accelerators.

Translated title of the contributionEfficient Hardware Acceleration System Design for End-to-End Object Detection Neural Network
Original languageChinese (Traditional)
Pages (from-to)1312-1320
Number of pages9
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume42
Issue number12
DOIs
Publication statusPublished - Dec 2022

Fingerprint

Dive into the research topics of 'Efficient Hardware Acceleration System Design for End-to-End Object Detection Neural Network'. Together they form a unique fingerprint.

Cite this