High-Throughput and Energy-Efficient FPGA-Based Accelerator for All Adder Neural Networks

Ning Zhang, Shuo Ni, Liang Chen, Tong Wang, He Chen*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Neural networks have been extensively applied across various Internet of Things (IoT) applications, such as drone- and satellite-based remote sensing and autonomous driving. With the increasing resolution and amount of data captured by sensors, the demand for real-time response in IoT applications is markedly increasing. However, it is difficult for existing convolutional neural network (CNN) accelerators for IoT applications on field-programmable gate array (FPGA) platforms to achieve high throughput because of the inherent dense multiplication operations of CNNs, memory bandwidth limitations and inefficient mapping mechanisms. In this article, a high-throughput and energy-efficient all adder neural network (A2NN) accelerator for IoT applications on FPGA platform is proposed to solve this problem. First, a series of hardware-oriented algorithm optimization methods are proposed to simplify the processing flow of A2NN and further minimize its deployment overhead. Second, a novel hardware architecture based on the idea of near-memory computation (NMC) is proposed to eliminate off-chip memory access completely and accelerate the reconstructed A2NN in the pipeline. Third, a set of quantitative analysis methods for the proposed accelerator is presented to balance throughput and energy consumption, allowing the accelerator to adapt to the varying demands of different IoT application scenarios. Extensive experimental results on the AMD-Xilinx VC709 board demonstrate that the proposed accelerator achieves state-of-the-art performance in terms of throughput, energy efficiency, and throughput efficiency. Moreover, experiments on the AMD-Xilinx KV260 board highlight the architecture’s exceptional scalability and energy efficiency, enabling a balance between speed and power consumption tailored to the specific requirements of IoT application scenarios.

Original languageEnglish
Pages (from-to)20357-20376
Number of pages20
JournalIEEE Internet of Things Journal
Volume12
Issue number12
DOIs
Publication statusPublished - 2025

Keywords

  • All adder neural network
  • accelerator
  • energy efficient
  • field-programmable gate array (FPGA)
  • high throughput
  • low power

Fingerprint

Dive into the research topics of 'High-Throughput and Energy-Efficient FPGA-Based Accelerator for All Adder Neural Networks'. Together they form a unique fingerprint.

Cite this