Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training

Jiaxin Tan; Chao Yao; Zehua Guo

doi:10.1145/3600061.3603140

Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training

Jiaxin Tan, Chao Yao, Zehua Guo^*

^*Corresponding author for this work

School of Automation

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Distributed machine learning is a promising machine learning approach for academia and industry. It can generate a machine learning model for dispersed training data via iterative training in a distributed fashion. To speed up the training process of distributed machine learning, it is essential to reduce the communication load among training nodes. In this paper, we propose a layer-wise gradient compression scheme based on principal component analysis and error accumulation. The key of our solution is to consider the gradient characteristics and architecture of neural networks by taking advantage of the compression ability enabled by PCA and the feedback ability enabled by error accumulation. The preliminary results on image classification task show that our scheme achieves good performance and reduces 97% of the gradient transmission.

Original language	English
Title of host publication	Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023
Publisher	Association for Computing Machinery, Inc
Pages	217-218
Number of pages	2
ISBN (Electronic)	9798400707827
DOIs	https://doi.org/10.1145/3600061.3603140
Publication status	Published - 29 Jun 2023
Event	7th Asia-Pacific Workshop on Networking, APNET 2023 - Hong Kong, China Duration: 29 Jun 2023 → 30 Jun 2023

Publication series

Name	Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023

Conference

Conference	7th Asia-Pacific Workshop on Networking, APNET 2023
Country/Territory	China
City	Hong Kong
Period	29/06/23 → 30/06/23

Access to Document

10.1145/3600061.3603140

Cite this

Tan, J., Yao, C., & Guo, Z. (2023). Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training. In Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023 (pp. 217-218). (Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023). Association for Computing Machinery, Inc. https://doi.org/10.1145/3600061.3603140

@inproceedings{f13f39d4cf7a433382e6ea265cd8d40d,

title = "Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training",

abstract = "Distributed machine learning is a promising machine learning approach for academia and industry. It can generate a machine learning model for dispersed training data via iterative training in a distributed fashion. To speed up the training process of distributed machine learning, it is essential to reduce the communication load among training nodes. In this paper, we propose a layer-wise gradient compression scheme based on principal component analysis and error accumulation. The key of our solution is to consider the gradient characteristics and architecture of neural networks by taking advantage of the compression ability enabled by PCA and the feedback ability enabled by error accumulation. The preliminary results on image classification task show that our scheme achieves good performance and reduces 97% of the gradient transmission.",

author = "Jiaxin Tan and Chao Yao and Zehua Guo",

note = "Publisher Copyright: {\textcopyright} 2023 Owner/Author.; 7th Asia-Pacific Workshop on Networking, APNET 2023 ; Conference date: 29-06-2023 Through 30-06-2023",

year = "2023",

month = jun,

day = "29",

doi = "10.1145/3600061.3603140",

language = "English",

series = "Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023",

publisher = "Association for Computing Machinery, Inc",

pages = "217--218",

booktitle = "Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023",

}

Tan, J, Yao, C & Guo, Z 2023, Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training. in Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023. Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023, Association for Computing Machinery, Inc, pp. 217-218, 7th Asia-Pacific Workshop on Networking, APNET 2023, Hong Kong, China, 29/06/23. https://doi.org/10.1145/3600061.3603140

Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training. / Tan, Jiaxin; Yao, Chao; Guo, Zehua.
Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023. Association for Computing Machinery, Inc, 2023. p. 217-218 (Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training

AU - Tan, Jiaxin

AU - Yao, Chao

AU - Guo, Zehua

PY - 2023/6/29

Y1 - 2023/6/29

N2 - Distributed machine learning is a promising machine learning approach for academia and industry. It can generate a machine learning model for dispersed training data via iterative training in a distributed fashion. To speed up the training process of distributed machine learning, it is essential to reduce the communication load among training nodes. In this paper, we propose a layer-wise gradient compression scheme based on principal component analysis and error accumulation. The key of our solution is to consider the gradient characteristics and architecture of neural networks by taking advantage of the compression ability enabled by PCA and the feedback ability enabled by error accumulation. The preliminary results on image classification task show that our scheme achieves good performance and reduces 97% of the gradient transmission.

AB - Distributed machine learning is a promising machine learning approach for academia and industry. It can generate a machine learning model for dispersed training data via iterative training in a distributed fashion. To speed up the training process of distributed machine learning, it is essential to reduce the communication load among training nodes. In this paper, we propose a layer-wise gradient compression scheme based on principal component analysis and error accumulation. The key of our solution is to consider the gradient characteristics and architecture of neural networks by taking advantage of the compression ability enabled by PCA and the feedback ability enabled by error accumulation. The preliminary results on image classification task show that our scheme achieves good performance and reduces 97% of the gradient transmission.

UR - http://www.scopus.com/inward/record.url?scp=85173844945&partnerID=8YFLogxK

U2 - 10.1145/3600061.3603140

DO - 10.1145/3600061.3603140

M3 - Conference contribution

AN - SCOPUS:85173844945

T3 - Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023

SP - 217

EP - 218

BT - Proceedings of the 7th Asia-Pacific Workshop on Networking, APNET 2023

PB - Association for Computing Machinery, Inc

T2 - 7th Asia-Pacific Workshop on Networking, APNET 2023

Y2 - 29 June 2023 through 30 June 2023

ER -

Efficient and Structural Gradient Compression with Principal Component Analysis for Distributed Training

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this