An efficient FPGA-based implementation for quantized remote sensing image scene classification network

Xiaoli Zhang; Xin Wei; Qianbo Sang; He Chen; Yizhuang Xie

doi:10.3390/electronics9091344

An efficient FPGA-based implementation for quantized remote sensing image scene classification network

Xiaoli Zhang, Xin Wei, Qianbo Sang, He Chen, Yizhuang Xie^*

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

18 引用（Scopus）

摘要

Deep Convolutional Neural Network (DCNN)-based image scene classification models play an important role in a wide variety of remote sensing applications and achieve great success. However, the large-scale remote sensing images and the intensive computations make the deployment of these DCNN-based models on low-power processing systems (e.g., spaceborne or airborne) a challenging problem. To solve this problem, this paper proposes a high-performance Field-Programmable Gate Array (FPGA)-based DCNN accelerator by combining an efficient network compression scheme and reasonable hardware architecture. Firstly, this paper applies the network quantization to a high-accuracy remote sensing scene classification network, an improved oriented response network (IORN). The volume of the parameters and feature maps in the network is greatly reduced. Secondly, an efficient hardware architecture for network implementation is proposed. The architecture employs dual-channel Double Data Rate Synchronous Dynamic Random-Access Memory (DDR) access mode, rational on-chip data processing scheme and efficient processing engine design. Finally, we implement the quantized IORN (Q-IORN) with the proposed architecture on a Xilinx VC709 development board. The experimental results show that the proposed accelerator has 88.31% top-1 classification accuracy and achieves a throughput of 209.60 Giga-Operations Per Second (GOP/s) with a 6.32 W on-chip power consumption at 200 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.

源语言	英语
文章编号	1344
页（从-至）	1-20
页数	20
期刊	Electronics (Switzerland)
卷	9
期	9
DOI	https://doi.org/10.3390/electronics9091344
出版状态	已出版 - 9月 2020

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.3390/electronics9091344

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, X., Wei, X., Sang, Q., Chen, H., & Xie, Y. (2020). An efficient FPGA-based implementation for quantized remote sensing image scene classification network. Electronics (Switzerland), 9(9), 1-20. 文章 1344. https://doi.org/10.3390/electronics9091344

@article{bf9752544213464fae3437635eb46069,

title = "An efficient FPGA-based implementation for quantized remote sensing image scene classification network",

abstract = "Deep Convolutional Neural Network (DCNN)-based image scene classification models play an important role in a wide variety of remote sensing applications and achieve great success. However, the large-scale remote sensing images and the intensive computations make the deployment of these DCNN-based models on low-power processing systems (e.g., spaceborne or airborne) a challenging problem. To solve this problem, this paper proposes a high-performance Field-Programmable Gate Array (FPGA)-based DCNN accelerator by combining an efficient network compression scheme and reasonable hardware architecture. Firstly, this paper applies the network quantization to a high-accuracy remote sensing scene classification network, an improved oriented response network (IORN). The volume of the parameters and feature maps in the network is greatly reduced. Secondly, an efficient hardware architecture for network implementation is proposed. The architecture employs dual-channel Double Data Rate Synchronous Dynamic Random-Access Memory (DDR) access mode, rational on-chip data processing scheme and efficient processing engine design. Finally, we implement the quantized IORN (Q-IORN) with the proposed architecture on a Xilinx VC709 development board. The experimental results show that the proposed accelerator has 88.31% top-1 classification accuracy and achieves a throughput of 209.60 Giga-Operations Per Second (GOP/s) with a 6.32 W on-chip power consumption at 200 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.",

keywords = "Accelerator, DCNN, FPGA, Quantization, Remote sensing image, Scene classification",

author = "Xiaoli Zhang and Xin Wei and Qianbo Sang and He Chen and Yizhuang Xie",

note = "Publisher Copyright: {\textcopyright} 2020 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2020",

month = sep,

doi = "10.3390/electronics9091344",

language = "English",

volume = "9",

pages = "1--20",

journal = "Electronics (Switzerland)",

issn = "2079-9292",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "9",

}

TY - JOUR

T1 - An efficient FPGA-based implementation for quantized remote sensing image scene classification network

AU - Zhang, Xiaoli

AU - Wei, Xin

AU - Sang, Qianbo

AU - Chen, He

AU - Xie, Yizhuang

PY - 2020/9

Y1 - 2020/9

N2 - Deep Convolutional Neural Network (DCNN)-based image scene classification models play an important role in a wide variety of remote sensing applications and achieve great success. However, the large-scale remote sensing images and the intensive computations make the deployment of these DCNN-based models on low-power processing systems (e.g., spaceborne or airborne) a challenging problem. To solve this problem, this paper proposes a high-performance Field-Programmable Gate Array (FPGA)-based DCNN accelerator by combining an efficient network compression scheme and reasonable hardware architecture. Firstly, this paper applies the network quantization to a high-accuracy remote sensing scene classification network, an improved oriented response network (IORN). The volume of the parameters and feature maps in the network is greatly reduced. Secondly, an efficient hardware architecture for network implementation is proposed. The architecture employs dual-channel Double Data Rate Synchronous Dynamic Random-Access Memory (DDR) access mode, rational on-chip data processing scheme and efficient processing engine design. Finally, we implement the quantized IORN (Q-IORN) with the proposed architecture on a Xilinx VC709 development board. The experimental results show that the proposed accelerator has 88.31% top-1 classification accuracy and achieves a throughput of 209.60 Giga-Operations Per Second (GOP/s) with a 6.32 W on-chip power consumption at 200 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.

AB - Deep Convolutional Neural Network (DCNN)-based image scene classification models play an important role in a wide variety of remote sensing applications and achieve great success. However, the large-scale remote sensing images and the intensive computations make the deployment of these DCNN-based models on low-power processing systems (e.g., spaceborne or airborne) a challenging problem. To solve this problem, this paper proposes a high-performance Field-Programmable Gate Array (FPGA)-based DCNN accelerator by combining an efficient network compression scheme and reasonable hardware architecture. Firstly, this paper applies the network quantization to a high-accuracy remote sensing scene classification network, an improved oriented response network (IORN). The volume of the parameters and feature maps in the network is greatly reduced. Secondly, an efficient hardware architecture for network implementation is proposed. The architecture employs dual-channel Double Data Rate Synchronous Dynamic Random-Access Memory (DDR) access mode, rational on-chip data processing scheme and efficient processing engine design. Finally, we implement the quantized IORN (Q-IORN) with the proposed architecture on a Xilinx VC709 development board. The experimental results show that the proposed accelerator has 88.31% top-1 classification accuracy and achieves a throughput of 209.60 Giga-Operations Per Second (GOP/s) with a 6.32 W on-chip power consumption at 200 MHz. The comparison results with off-the-shelf devices and recent state-of-the-art implementations illustrate that the proposed accelerator has obvious advantages in terms of energy efficiency.

KW - Accelerator

KW - DCNN

KW - FPGA

KW - Quantization

KW - Remote sensing image

KW - Scene classification

UR - http://www.scopus.com/inward/record.url?scp=85089655904&partnerID=8YFLogxK

U2 - 10.3390/electronics9091344

DO - 10.3390/electronics9091344

M3 - Article

AN - SCOPUS:85089655904

SN - 2079-9292

VL - 9

SP - 1

EP - 20

JO - Electronics (Switzerland)

JF - Electronics (Switzerland)

IS - 9

M1 - 1344

ER -

An efficient FPGA-based implementation for quantized remote sensing image scene classification network

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此