BestSF: A sparse meta-format for optimizing SpMV on GPU

Akrem Benatia; Weixing Ji; Yizhuo Wang; Feng Shi

doi:10.1145/3226228

BestSF: A sparse meta-format for optimizing SpMV on GPU

Akrem Benatia, Weixing Ji, Yizhuo Wang, Feng Shi

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

24 Citations (Scopus)

Abstract

The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no “best-for-all” sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.

Original language	English
Article number	29
Journal	Transactions on Architecture and Code Optimization
Volume	15
Issue number	3
DOIs	https://doi.org/10.1145/3226228
Publication status	Published - Aug 2018

Keywords

Energy efficiency
GPU computing
Iterative solvers
Performance modeling
Sparse matrix-vector multiplication (SpMV)

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1145/3226228

Cite this

@article{2ded737436a9475489b93053c1b30eb4,

title = "BestSF: A sparse meta-format for optimizing SpMV on GPU",

abstract = "The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no “best-for-all” sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.",

keywords = "Energy efficiency, GPU computing, Iterative solvers, Performance modeling, Sparse matrix-vector multiplication (SpMV)",

author = "Akrem Benatia and Weixing Ji and Yizhuo Wang and Feng Shi",

note = "Publisher Copyright: {\textcopyright} 2018 Association for Computing Machinery.",

year = "2018",

month = aug,

doi = "10.1145/3226228",

language = "English",

volume = "15",

journal = "Transactions on Architecture and Code Optimization",

issn = "1544-3566",

publisher = "Association for Computing Machinery (ACM)",

number = "3",

}

TY - JOUR

T1 - BestSF

T2 - A sparse meta-format for optimizing SpMV on GPU

AU - Benatia, Akrem

AU - Ji, Weixing

AU - Wang, Yizhuo

AU - Shi, Feng

PY - 2018/8

Y1 - 2018/8

N2 - The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no “best-for-all” sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.

AB - The Sparse Matrix-Vector Multiplication (SpMV) kernel dominates the computing cost in numerous scientific applications. Many implementations based on different sparse formats were proposed to improve this kernel on the recent GPU architectures. However, it has been widely observed that there is no “best-for-all” sparse format for the SpMV kernel on GPU. Indeed, serious performance degradation of an order of magnitude can be observed without a careful selection of the sparse format to use. To address this problem, we propose in this article BestSF (Best Sparse Format), a new learning-based sparse meta-format that automatically selects the most appropriate sparse format for a given input matrix. To do so, BestSF relies on a cost-sensitive classification system trained using Weighted Support Vector Machines (WSVMs) to predict the best sparse format for each input sparse matrix. Our experimental results on two different NVIDIA GPU architectures using a large number of real-world sparse matrices show that BestSF achieved a noticeable overall performance improvement over using a single sparse format. While BestSF is trained to select the best sparse format in terms of performance (GFLOPS), our further experimental investigations revealed that using BestSF also led, in most of the test cases, to the best energy efficiency (MFLOPS/W). To prove its practical effectiveness, we also evaluate the performance and energy efficiency improvement achieved when using BestSF as a building block in a GPU-based Preconditioned Conjugate Gradient (PCG) iterative solver.

KW - Energy efficiency

KW - GPU computing

KW - Iterative solvers

KW - Performance modeling

KW - Sparse matrix-vector multiplication (SpMV)

UR - http://www.scopus.com/inward/record.url?scp=85053543948&partnerID=8YFLogxK

U2 - 10.1145/3226228

DO - 10.1145/3226228

M3 - Article

AN - SCOPUS:85053543948

SN - 1544-3566

VL - 15

JO - Transactions on Architecture and Code Optimization

JF - Transactions on Architecture and Code Optimization

IS - 3

M1 - 29

ER -

BestSF: A sparse meta-format for optimizing SpMV on GPU

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this