Design and Implementation of CNN Accelerator Based on FPGA

Xin Guan, Zhanqing Wang*, Hao Fang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

As one of the important research directions in the field of computer vision, CNN based object detection methods are constantly being updated, which puts higher requirements on the computational performance of hardware acceleration platforms. FPGA achieves a balance between acceleration performance, power consumption, and configurability, making it a good compromise for hardware accelerator platform options. Based on this, this paper implements a FPGA-based CNN accelerator with certain universality by designing universal operators. This paper mainly uses ping-pong buffer mechanism to optimize the loading of parameters and data and at the same time uses pipeline to optimize the calculation. Based on the concept of reverse positioning and sequence calculation, the operator is designed to achieve weight reuse and output feature map calculation. YOLOv4-Tiny is built on ZYNQ7020 for verification, which has an acceleration ratio of 1162 compared to using only the ARM processor and has an energy efficiency ratio of 2.847GOPS/W compared to GPU's 2.311GOPS/W.

Original languageEnglish
Title of host publicationProceedings of the 43rd Chinese Control Conference, CCC 2024
EditorsJing Na, Jian Sun
PublisherIEEE Computer Society
Pages8969-8974
Number of pages6
ISBN (Electronic)9789887581581
DOIs
Publication statusPublished - 2024
Event43rd Chinese Control Conference, CCC 2024 - Kunming, China
Duration: 28 Jul 202431 Jul 2024

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference43rd Chinese Control Conference, CCC 2024
Country/TerritoryChina
CityKunming
Period28/07/2431/07/24

Keywords

  • CNN Accelerator
  • FPGA
  • Universal Operators
  • YOLOv4-Tiny

Fingerprint

Dive into the research topics of 'Design and Implementation of CNN Accelerator Based on FPGA'. Together they form a unique fingerprint.

Cite this