Load Balancing Optimizations for Distributed GMRES Algorithm

Yuxiang Zhang, Shuaizhe Guo, Jianhua Gao*, Weixing Ji, Yizhuo Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The Generalized Minimal Residual Method (GMRES) is one of the most important iterative algorithms for solving large-scale sparse linear systems, which are widely used in fields such as computational fluid dynamics and computational electromagnetics. As the scale of problems increases, multi-node distributed systems become one of the most popular running environments. Communication efficiency is usually the primary performance bottleneck for distributed GMRES. Traditional work reduces communication load mainly by balancing the computation load, while the balance of communication load is also important. This paper proposes a rule-based algorithm and a reinforcement learning (RL)-based algorithm to balance the communication load. By optimizing the partitioning of sparse matrices using the rule-based algorithm, the balance of both computation and communication loads among devices is improved. Experimental results show that the speedup can reach up to 1.34x. Moreover, RL-based algorithm improves the efficiency of the iterative algorithm by optimizing the task allocation of the partitioned sub-matrices. Experimental results present that the speedup can reach up to 1.30x.

源语言英语
主期刊名Algorithms and Architectures for Parallel Processing - 24th International Conference, ICA3PP 2024, Macau, China, October 29–31, 2024, Proceedings
编辑Tianqing Zhu, Jin Li, Aniello Castiglione
出版商Springer Science and Business Media Deutschland GmbH
47-56
页数10
ISBN(印刷版)9789819615506
DOI
出版状态已出版 - 2025
活动24th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2024 - Macau, 中国
期限: 29 10月 202431 10月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
15256 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议24th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2024
国家/地区中国
Macau
时期29/10/2431/10/24

指纹

探究 'Load Balancing Optimizations for Distributed GMRES Algorithm' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, Y., Guo, S., Gao, J., Ji, W., & Wang, Y. (2025). Load Balancing Optimizations for Distributed GMRES Algorithm. 在 T. Zhu, J. Li, & A. Castiglione (编辑), Algorithms and Architectures for Parallel Processing - 24th International Conference, ICA3PP 2024, Macau, China, October 29–31, 2024, Proceedings (页码 47-56). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 15256 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-96-1551-3_5