Machine learning approach for the predicting performance of SpMV on GPU

Akrem Benatia; Weixing Ji; Yizhuo Wang; Feng Shi

doi:10.1109/ICPADS.2016.0120

Machine learning approach for the predicting performance of SpMV on GPU

Akrem Benatia, Weixing Ji, Yizhuo Wang, Feng Shi

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

15 Citations (Scopus)

Abstract

Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed recently for optimizing this kernel on the GPU side. Since the performance of the SpMV varies significantly according to the sparsity characteristics of the input matrix and the hardware features, developing an accurate performance model for this kernel is a challenging task. The traditional approach of building such models by analytical modeling is difficult in practice and requires a thorough understanding of the interaction between the GPU hardware and the sparse code. In this paper, we propose to use a machine learning approach to predict the performance of the SpMV kernel using several sparse formats (COO, CSR, ELL, and HYB) on GPU. We used two popular machine learning algorithms, Support Vector Regression (SVR) and Multilayer Perceptron neural network (MLP). Our experimental results on two different GPUs (Fermi GTX 512 and Maxwell GTX 980 Ti) show that the SVR models deliver the best accuracy with average prediction error ranging between 7% and 14%.

Original language	English
Title of host publication	Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016
Editors	Xiaofei Liao, Robert Lovas, Xipeng Shen, Ran Zheng
Publisher	IEEE Computer Society
Pages	894-901
Number of pages	8
ISBN (Electronic)	9781509044573
DOIs	https://doi.org/10.1109/ICPADS.2016.0120
Publication status	Published - 2 Jul 2016
Event	22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016 - Wuhan, Hubei, China Duration: 13 Dec 2016 → 16 Dec 2016

Publication series

Name	Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS
Volume	0
ISSN (Print)	1521-9097

Conference

Conference	22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016
Country/Territory	China
City	Wuhan, Hubei
Period	13/12/16 → 16/12/16

Keywords

GPU computing
Multilayer Perceptron (MLP)
Performance modeling
Sparse Matrix-Vector multiplication (SpMV)
Support Vector Regression (SVR)

Access to Document

10.1109/ICPADS.2016.0120

Cite this

Benatia, A., Ji, W., Wang, Y., & Shi, F. (2016). Machine learning approach for the predicting performance of SpMV on GPU. In X. Liao, R. Lovas, X. Shen, & R. Zheng (Eds.), Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016 (pp. 894-901). Article 7823835 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; Vol. 0). IEEE Computer Society. https://doi.org/10.1109/ICPADS.2016.0120

Benatia, Akrem ; Ji, Weixing ; Wang, Yizhuo et al. / Machine learning approach for the predicting performance of SpMV on GPU. Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016. editor / Xiaofei Liao ; Robert Lovas ; Xipeng Shen ; Ran Zheng. IEEE Computer Society, 2016. pp. 894-901 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS).

@inproceedings{4317363f9b6947928c663c358997a34c,

title = "Machine learning approach for the predicting performance of SpMV on GPU",

abstract = "Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed recently for optimizing this kernel on the GPU side. Since the performance of the SpMV varies significantly according to the sparsity characteristics of the input matrix and the hardware features, developing an accurate performance model for this kernel is a challenging task. The traditional approach of building such models by analytical modeling is difficult in practice and requires a thorough understanding of the interaction between the GPU hardware and the sparse code. In this paper, we propose to use a machine learning approach to predict the performance of the SpMV kernel using several sparse formats (COO, CSR, ELL, and HYB) on GPU. We used two popular machine learning algorithms, Support Vector Regression (SVR) and Multilayer Perceptron neural network (MLP). Our experimental results on two different GPUs (Fermi GTX 512 and Maxwell GTX 980 Ti) show that the SVR models deliver the best accuracy with average prediction error ranging between 7% and 14%.",

keywords = "GPU computing, Multilayer Perceptron (MLP), Performance modeling, Sparse Matrix-Vector multiplication (SpMV), Support Vector Regression (SVR)",

author = "Akrem Benatia and Weixing Ji and Yizhuo Wang and Feng Shi",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.; 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016 ; Conference date: 13-12-2016 Through 16-12-2016",

year = "2016",

month = jul,

day = "2",

doi = "10.1109/ICPADS.2016.0120",

language = "English",

series = "Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS",

publisher = "IEEE Computer Society",

pages = "894--901",

editor = "Xiaofei Liao and Robert Lovas and Xipeng Shen and Ran Zheng",

booktitle = "Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016",

address = "United States",

}

Benatia, A, Ji, W, Wang, Y & Shi, F 2016, Machine learning approach for the predicting performance of SpMV on GPU. in X Liao, R Lovas, X Shen & R Zheng (eds), Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016., 7823835, Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS, vol. 0, IEEE Computer Society, pp. 894-901, 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016, Wuhan, Hubei, China, 13/12/16. https://doi.org/10.1109/ICPADS.2016.0120

Machine learning approach for the predicting performance of SpMV on GPU. / Benatia, Akrem; Ji, Weixing; Wang, Yizhuo et al.
Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016. ed. / Xiaofei Liao; Robert Lovas; Xipeng Shen; Ran Zheng. IEEE Computer Society, 2016. p. 894-901 7823835 (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS; Vol. 0).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Machine learning approach for the predicting performance of SpMV on GPU

AU - Benatia, Akrem

AU - Ji, Weixing

AU - Wang, Yizhuo

AU - Shi, Feng

PY - 2016/7/2

Y1 - 2016/7/2

N2 - Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed recently for optimizing this kernel on the GPU side. Since the performance of the SpMV varies significantly according to the sparsity characteristics of the input matrix and the hardware features, developing an accurate performance model for this kernel is a challenging task. The traditional approach of building such models by analytical modeling is difficult in practice and requires a thorough understanding of the interaction between the GPU hardware and the sparse code. In this paper, we propose to use a machine learning approach to predict the performance of the SpMV kernel using several sparse formats (COO, CSR, ELL, and HYB) on GPU. We used two popular machine learning algorithms, Support Vector Regression (SVR) and Multilayer Perceptron neural network (MLP). Our experimental results on two different GPUs (Fermi GTX 512 and Maxwell GTX 980 Ti) show that the SVR models deliver the best accuracy with average prediction error ranging between 7% and 14%.

AB - Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed recently for optimizing this kernel on the GPU side. Since the performance of the SpMV varies significantly according to the sparsity characteristics of the input matrix and the hardware features, developing an accurate performance model for this kernel is a challenging task. The traditional approach of building such models by analytical modeling is difficult in practice and requires a thorough understanding of the interaction between the GPU hardware and the sparse code. In this paper, we propose to use a machine learning approach to predict the performance of the SpMV kernel using several sparse formats (COO, CSR, ELL, and HYB) on GPU. We used two popular machine learning algorithms, Support Vector Regression (SVR) and Multilayer Perceptron neural network (MLP). Our experimental results on two different GPUs (Fermi GTX 512 and Maxwell GTX 980 Ti) show that the SVR models deliver the best accuracy with average prediction error ranging between 7% and 14%.

KW - GPU computing

KW - Multilayer Perceptron (MLP)

KW - Performance modeling

KW - Sparse Matrix-Vector multiplication (SpMV)

KW - Support Vector Regression (SVR)

UR - http://www.scopus.com/inward/record.url?scp=85018508629&partnerID=8YFLogxK

U2 - 10.1109/ICPADS.2016.0120

DO - 10.1109/ICPADS.2016.0120

M3 - Conference contribution

AN - SCOPUS:85018508629

T3 - Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS

SP - 894

EP - 901

BT - Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016

A2 - Liao, Xiaofei

A2 - Lovas, Robert

A2 - Shen, Xipeng

A2 - Zheng, Ran

PB - IEEE Computer Society

T2 - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016

Y2 - 13 December 2016 through 16 December 2016

ER -

Benatia A, Ji W, Wang Y, Shi F. Machine learning approach for the predicting performance of SpMV on GPU. In Liao X, Lovas R, Shen X, Zheng R, editors, Proceedings - 22nd IEEE International Conference on Parallel and Distributed Systems, ICPADS 2016. IEEE Computer Society. 2016. p. 894-901. 7823835. (Proceedings of the International Conference on Parallel and Distributed Systems - ICPADS). doi: 10.1109/ICPADS.2016.0120

Machine learning approach for the predicting performance of SpMV on GPU

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this