Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs

Bingxin Wei; Yizhuo Wang; Fangli Chang; Jianhua Gao; Weixing Ji

doi:10.1177/10943420241231928

Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs

Bingxin Wei, Yizhuo Wang^*, Fangli Chang, Jianhua Gao, Weixing Ji

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Sparse General Matrix-Matrix Multiplication (SpGEMM) has played an important role in a number of applications. So far, many efficient algorithms have been proposed to improve the performance of SpGEMM on GPUs. However, the performance of each algorithm for matrices of different structures varies a lot. There is no algorithm that can achieve the optimal performance of SpGEMM computation on all matrices. In this article, we design a machine learning based approach for predicting the optimal SpGEMM algorithm on input matrices. By extracting features from input matrices, we utilize LightGBM and XGBoost to train different lightweight models. The models are capable of predicting the best performing algorithm with low inference overhead and high accuracy for the given input matrices. We also investigate the impact of tree depth on model accuracy and inference overhead. Our evaluation shows that an increase in tree depth leads to a corresponding increase in prediction accuracy, reaching a maximum of approximately 85%, while resulting in increased inference overhead of approximately 10 µs. Compared with the state-of-the-art algorithms on three GPU platforms, our method achieves better overall performance.

Original language	English
Pages (from-to)	245-259
Number of pages	15
Journal	International Journal of High Performance Computing Applications
Volume	38
Issue number	3
DOIs	https://doi.org/10.1177/10943420241231928
Publication status	Published - May 2024

Keywords

Sparse general matrix-matrix multiplication
gradient boosting decision tree
graphics processing unit
machine learning

Access to Document

10.1177/10943420241231928

Cite this

@article{2f1ec3396f3f478b9f0d6a784d1b4163,

title = "Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs",

abstract = "Sparse General Matrix-Matrix Multiplication (SpGEMM) has played an important role in a number of applications. So far, many efficient algorithms have been proposed to improve the performance of SpGEMM on GPUs. However, the performance of each algorithm for matrices of different structures varies a lot. There is no algorithm that can achieve the optimal performance of SpGEMM computation on all matrices. In this article, we design a machine learning based approach for predicting the optimal SpGEMM algorithm on input matrices. By extracting features from input matrices, we utilize LightGBM and XGBoost to train different lightweight models. The models are capable of predicting the best performing algorithm with low inference overhead and high accuracy for the given input matrices. We also investigate the impact of tree depth on model accuracy and inference overhead. Our evaluation shows that an increase in tree depth leads to a corresponding increase in prediction accuracy, reaching a maximum of approximately 85%, while resulting in increased inference overhead of approximately 10 µs. Compared with the state-of-the-art algorithms on three GPU platforms, our method achieves better overall performance.",

keywords = "Sparse general matrix-matrix multiplication, gradient boosting decision tree, graphics processing unit, machine learning",

author = "Bingxin Wei and Yizhuo Wang and Fangli Chang and Jianhua Gao and Weixing Ji",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = may,

doi = "10.1177/10943420241231928",

language = "English",

volume = "38",

pages = "245--259",

journal = "International Journal of High Performance Computing Applications",

issn = "1094-3420",

publisher = "SAGE Publications Inc.",

number = "3",

}

TY - JOUR

T1 - Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs

AU - Wei, Bingxin

AU - Wang, Yizhuo

AU - Chang, Fangli

AU - Gao, Jianhua

AU - Ji, Weixing

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/5

Y1 - 2024/5

N2 - Sparse General Matrix-Matrix Multiplication (SpGEMM) has played an important role in a number of applications. So far, many efficient algorithms have been proposed to improve the performance of SpGEMM on GPUs. However, the performance of each algorithm for matrices of different structures varies a lot. There is no algorithm that can achieve the optimal performance of SpGEMM computation on all matrices. In this article, we design a machine learning based approach for predicting the optimal SpGEMM algorithm on input matrices. By extracting features from input matrices, we utilize LightGBM and XGBoost to train different lightweight models. The models are capable of predicting the best performing algorithm with low inference overhead and high accuracy for the given input matrices. We also investigate the impact of tree depth on model accuracy and inference overhead. Our evaluation shows that an increase in tree depth leads to a corresponding increase in prediction accuracy, reaching a maximum of approximately 85%, while resulting in increased inference overhead of approximately 10 µs. Compared with the state-of-the-art algorithms on three GPU platforms, our method achieves better overall performance.

AB - Sparse General Matrix-Matrix Multiplication (SpGEMM) has played an important role in a number of applications. So far, many efficient algorithms have been proposed to improve the performance of SpGEMM on GPUs. However, the performance of each algorithm for matrices of different structures varies a lot. There is no algorithm that can achieve the optimal performance of SpGEMM computation on all matrices. In this article, we design a machine learning based approach for predicting the optimal SpGEMM algorithm on input matrices. By extracting features from input matrices, we utilize LightGBM and XGBoost to train different lightweight models. The models are capable of predicting the best performing algorithm with low inference overhead and high accuracy for the given input matrices. We also investigate the impact of tree depth on model accuracy and inference overhead. Our evaluation shows that an increase in tree depth leads to a corresponding increase in prediction accuracy, reaching a maximum of approximately 85%, while resulting in increased inference overhead of approximately 10 µs. Compared with the state-of-the-art algorithms on three GPU platforms, our method achieves better overall performance.

KW - Sparse general matrix-matrix multiplication

KW - gradient boosting decision tree

KW - graphics processing unit

KW - machine learning

UR - http://www.scopus.com/inward/record.url?scp=85184234341&partnerID=8YFLogxK

U2 - 10.1177/10943420241231928

DO - 10.1177/10943420241231928

M3 - Article

AN - SCOPUS:85184234341

SN - 1094-3420

VL - 38

SP - 245

EP - 259

JO - International Journal of High Performance Computing Applications

JF - International Journal of High Performance Computing Applications

IS - 3

ER -

Predicting optimal sparse general matrix-matrix multiplication algorithm on GPUs

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this