TY - JOUR
T1 - Hardware accelerator for high accuracy sign language recognition with residual network based on FPGAs
AU - Yang, Dong
AU - Li, Jianwu
AU - Hao, Guocun
AU - Chen, Qirui
AU - Wei, Xi
AU - Dai, Zirui
AU - Hou, Zixian
AU - Zhang, Lei
AU - Li, Xiaoran
N1 - Publisher Copyright:
Copyright © 2024 The Institute of Electronics, Information and Communication Engineers.
PY - 2024/2
Y1 - 2024/2
N2 - The ResNet series of networks has demonstrated powerful capabilities in the fields of object detection and image classification, garnering increasing attention from researchers. However, due to their deep network architectures, accelerator development based on FPGA faces challenges associated with limited on-chip resources and lengthy design cycles. This paper presents a versatile hardware acceleration system based on FPGA, achieving optimization through both hardware implementation and software inference architecture. The system reduces network complexity by employing techniques such as inter-layer fusion and dynamic quantization, while enhancing hardware resource utilization through channel parallelism and tightly-pipelined hardware design principles. By configuring and reusing computation units, the forward inference process of ResNet series networks can be rapidly deployed on FPGA, shortening the development and validation cycles. The proposed system is validated using the ResNet-18 model on a PYNQ-Z2 development board within a gesture recognition application scenario. The overall power consumption of the system is 2.136 W, with hardware inference accuracy reaching 98.87%.
AB - The ResNet series of networks has demonstrated powerful capabilities in the fields of object detection and image classification, garnering increasing attention from researchers. However, due to their deep network architectures, accelerator development based on FPGA faces challenges associated with limited on-chip resources and lengthy design cycles. This paper presents a versatile hardware acceleration system based on FPGA, achieving optimization through both hardware implementation and software inference architecture. The system reduces network complexity by employing techniques such as inter-layer fusion and dynamic quantization, while enhancing hardware resource utilization through channel parallelism and tightly-pipelined hardware design principles. By configuring and reusing computation units, the forward inference process of ResNet series networks can be rapidly deployed on FPGA, shortening the development and validation cycles. The proposed system is validated using the ResNet-18 model on a PYNQ-Z2 development board within a gesture recognition application scenario. The overall power consumption of the system is 2.136 W, with hardware inference accuracy reaching 98.87%.
KW - convolutional neural network (CNN)
KW - field programmable gate array (FPGA)
KW - hardware acceleration
KW - high-level synthesis (HLS)
KW - residual network
UR - http://www.scopus.com/inward/record.url?scp=85186476518&partnerID=8YFLogxK
U2 - 10.1587/elex.21.20230579
DO - 10.1587/elex.21.20230579
M3 - Article
AN - SCOPUS:85186476518
SN - 1349-2543
VL - 21
JO - IEICE Electronics Express
JF - IEICE Electronics Express
IS - 4
M1 - 20230579
ER -