Algorithm parallelization and optimization of CoSMPs system based on the HPM model

Yu Gang Li*, Xiang Zhen Qiao, Zhi Yong Liu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

First, based on the parallel computing model HPM, the characteristics of the architecture of the SMPS cluster are investigated. Focusing on the two aspects: the parallelism and the locality (storage and communication) of the architecture of CoSMPs, the main factors that influence the performance of the parallel applications are analyzed, and the problems of how to parallelize and optimize applications are investigated. Both the merits and the demerits of the two programming modes: the MPI mode and the MPI+SMP (OMP) directive mode are investigated. Then, some techniques about how to parallelize and optimize applications on CoSMPs are investigated in detail. Finally, the performance of the two communicational modes (the loop-exchange mode and the border exchange mode) on CoSMPs is investigated on an instance of the cluster of SMPs - the Dawning 3000 super computer. Via two examples - the matrix multiply algorithm for the loop-exchange and the five-point algorithm for the border-exchange mode - the methods are tested and the results are consistent with the theoretical conclusion.

源语言英语
页(从-至)621-629
页数9
期刊Jisuanji Yanjiu yu Fazhan/Computer Research and Development
41
4
出版状态已出版 - 4月 2004
已对外发布

指纹

探究 'Algorithm parallelization and optimization of CoSMPs system based on the HPM model' 的科研主题。它们共同构成独一无二的指纹。

引用此