ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs

Yi Wan, Xianzhong Xie, Junfan Chen, Kunpeng Xie, Dezhi Yi, Ye Lu*, Keke Gai*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Lightweight convolutional neural networks (CNNs) enable lower inference latency and data traffic, facilitating deployment on resource-constrained edge devices such as field-programmable gate arrays (FPGAs). However, CNNs inference requires access to off-chip synchronous dynamic random-access memory (SDRAM), which significantly degrades inference speed and system power efficiency. In this paper, we propose an adaptive dataflow scheduling method for lightweight CNN accelerator on FPGAs named ADS-CNN. The key idea of ADS-CNN is to efficiently utilize on-chip resources and reduce the amount of SDRAM access. To achieve the reuse of logical resources, we design a time division multiplexing calculation engine to be integrated in ADS-CNN. We implement a configurable module for the convolution controller to adapt to the data reuse of different convolution layers, thus reducing the off-chip access. Furthermore, we exploit on-chip memory blocks as buffers based on the configuration of different layers in lightweight CNNs. On the resource-constrained Intel CycloneV SoC 5CSEBA6 FPGA platform, we evaluated six common lightweight CNN models to demonstrate the performance advantages of ADS-CNN. The evaluation results indicate that, compared with accelerators that use traditional tiling strategy dataflow, our ADS-CNN can achieve up to 1.29× speedup with the overall dataflow scale compression of 23.7%.

Original languageEnglish
Pages (from-to)138-149
Number of pages12
JournalFuture Generation Computer Systems
Volume158
DOIs
Publication statusPublished - Sept 2024

Keywords

  • Accelerator
  • Adaptive dataflow
  • FPGA
  • Lightweight convolutional neural networks
  • Tiling strategy
  • Unified computing engine

Fingerprint

Dive into the research topics of 'ADS-CNN: Adaptive Dataflow Scheduling for lightweight CNN accelerator on FPGAs'. Together they form a unique fingerprint.

Cite this