Abstract
A parallel scheme that combines the OpenMP and the vector arithmetic logic unit (VALU) hardware acceleration is presented to speed up the multilevel fast multipole algorithm (MLFMA) on shared-memory computers. Performance of the hybrid parallel OpenMP-VALU MLFMA is investigated and several strategies are employed to improve the overall speedup and parallel efficiency. Effectiveness of the hybrid parallel scheme is verified by numerical results of the electromagnetic (EM) scattering examples, and related numerical stability issue is discussed as well.
Original language | English |
---|---|
Article number | 6808505 |
Pages (from-to) | 3884-3889 |
Number of pages | 6 |
Journal | IEEE Transactions on Antennas and Propagation |
Volume | 62 |
Issue number | 7 |
DOIs | |
Publication status | Published - Jul 2014 |
Keywords
- Electromagnetic scattering
- OpenMP
- multilevel fast multipole algorithm
- parallel computing
- vector arithmetic logic unit (VALU)