High-Throughput Energy-Efficient Accelerator With Collaborative-Trainable Sparse-Quantization Method for On-Board Remote Sensing Processing

  • Tong Wang
  • , He Chen
  • , Ning Zhang*
  • , Shuo Ni
  • , Xi Zhang
  • , Liang Chen
  • , Wei Li
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Convolutional neural networks (CNNs) have achieved remarkable breakthroughs on remote sensing tasks in recent years. However, deploying CNNs for real-time remote sensing on-board processing still remains a challenge due to power consumption, real-time, and other limitations. Therefore, in this article, a satellite-based real-time remote sensing accelerator is proposed, where algorithm and hardware approaches are proposed to jointly optimize CNNs’ deployment on edge-side aerospace devices. First, a collaborative-trainable sparse-quantization (CTSQ) method is proposed to reduce the model’s storage overhead. In the CTSQ method, analysis of the errors is performed for the sparsity-quantization composition. Besides, the interchannel correlations among parameters are leveraged, where the structured sparsity and quantization are performed with fine-grained units. Second, a modular-system co-optimized (MoSyC) architecture is proposed. A hardware-mapped sparse access (HMSA) strategy is proposed to effectively filter out zero elements in sparse parameters. Moreover, a high-throughput architecture is designed for parallel and pipelined data flow control. Finally, extensive experiments are conducted on both scene classification and object detection tasks with ResNet and YOLOv5 models. The results show that the proposed CTSQ method achieves the compression ratio of more than 13.81× , and the proposed MoSyC architecture achieves the throughput of more than 1815 giga operations per second (GOPS), demonstrating the effectiveness of the proposed accelerator.

Original languageEnglish
Article number5646518
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume63
DOIs
Publication statusPublished - 2025
Externally publishedYes

Keywords

  • Convolutional neural network (CNN)
  • field programmable gate array (FPGA)
  • quantization
  • real-time
  • remote sensing
  • sparsity

Fingerprint

Dive into the research topics of 'High-Throughput Energy-Efficient Accelerator With Collaborative-Trainable Sparse-Quantization Method for On-Board Remote Sensing Processing'. Together they form a unique fingerprint.

Cite this