An Asynchronous Parallel Implementation of Multilevel Fast Multipole Algorithm on GPU Cluster for 3D Electromagnetic Scattering Problems

Rong Ping Xi, We Jia He, Ming Lin Yang, Xin Qing Sheng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a CPU/GPU asynchronous computing pattern based improved parallel multilevel fast multipole algorithm (MLFMA) for 3D electromagnetic scattering problems on GPU Cluster. In the presented parallel implementation, the matrix assembly process of the MLFMA is decomposed into CPU execution and GPU execution two parts. The former is performed on CPU using OpenMP multi-threading programming model, while the latter is performed on GPU with CUDA programming model. The execution time between the two parts is overlapped by using the feature of asynchronous execution between CPU and GPU. The performance of the proposed parallel implementation is investigated in terms of accuracy and efficiency. Numerical results show that, with the proposed parallel approach, over 10% speed-up can be attained, compared with the original parallel implementation.

Original languageEnglish
Title of host publication2021 International Applied Computational Electromagnetics Society Symposium, ACES-China 2021, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781733509619
DOIs
Publication statusPublished - 28 Jul 2021
Event4th International Applied Computational Electromagnetics Society Symposium in China, ACES-China 2021 - Chengdu, China
Duration: 28 Jul 202131 Jul 2021

Publication series

Name2021 International Applied Computational Electromagnetics Society Symposium, ACES-China 2021, Proceedings

Conference

Conference4th International Applied Computational Electromagnetics Society Symposium in China, ACES-China 2021
Country/TerritoryChina
CityChengdu
Period28/07/2131/07/21

Keywords

  • Asynchronous Computing
  • CUDA
  • Multilevel fast multipole algorithm
  • OpenMP
  • scattering

Fingerprint

Dive into the research topics of 'An Asynchronous Parallel Implementation of Multilevel Fast Multipole Algorithm on GPU Cluster for 3D Electromagnetic Scattering Problems'. Together they form a unique fingerprint.

Cite this