Acceleration of radar echo coherent accumulation system based on half-precision format and tensor core

Luming Wang, Defeng Chen, Dongliang Wang, Chao Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The processing speed of radar echo coherent accumulation system is an important factor affecting the real-time performance of space target detection. In this paper, based on GPU V100, adopting the concept of half-precision and tensor core, we design the radar echo coherent accumulation system and achieve the acceleration effect. The design of the system includes optimizing the process of coherent accumulation system, designing the scaling coefficient and using tcFFT library to realize FFT with the method of WMMA. We use FP32, FPl6 and FP16tensor core to compare the speed of coherent accumulation system. In FP32 and FP16, we use CUFFT library to realize FFT operation, and in FP16tensor core, we call tcFFT library to realize FFT operation. Nsight Compute is used to test the speed. The test results show that: (a) The time of creating FFT plan in tcFFT is less than CUFFT. (b) In the case of single batch, FP16 achieves 1.18X-1.39X acceleration effect compared with FP32 in the whole coherent accumulation process; In the case of multiple batches, the parallel batch processing method is proposed, and in two-dimensional FFT, compared with FP16, FP16tensor core can achieve 2.23X-3.17X acceleration effect, in the whole phase-coherent accumulation process, it can achieve 1.54X-1.77X acceleration effect.

Original languageEnglish
Title of host publicationIMCEC 2022 - IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference
EditorsBing Xu, Bing Xu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages990-995
Number of pages6
ISBN (Electronic)9781665479677
DOIs
Publication statusPublished - 2022
Event5th IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, IMCEC 2022 - Chongqing, China
Duration: 16 Dec 202218 Dec 2022

Publication series

NameIMCEC 2022 - IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference

Conference

Conference5th IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, IMCEC 2022
Country/TerritoryChina
CityChongqing
Period16/12/2218/12/22

Keywords

  • GPU
  • coherent accumulation system
  • half precision
  • tensor core

Fingerprint

Dive into the research topics of 'Acceleration of radar echo coherent accumulation system based on half-precision format and tensor core'. Together they form a unique fingerprint.

Cite this