Accelerating Wireless Federated Learning via Nesterov&#x2019;s Momentum and Distributed Principle Component Analysis

Yanjie Dong; Luya Wang; Jia Wang; Xiping Hu; Haijun Zhang; Fei Richard Yu; Victor C.M. Leung

doi:10.1109/TWC.2023.3329375

Accelerating Wireless Federated Learning via Nesterov’s Momentum and Distributed Principle Component Analysis

Yanjie Dong, Luya Wang, Jia Wang, Xiping Hu, Haijun Zhang, Fei Richard Yu, Victor C.M. Leung

School of Medical and Technology

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

A wireless federated learning system is investigated by allowing a server and multiple workers to exchange uncoded information via orthogonal wireless channels. Since the workers frequently upload local gradients to the server via band-limited channels, the uplink transmission from the workers to the server becomes a communication bottleneck. Therefore, a one-shot distributed principle component analysis (PCA) is leveraged to reduce the dimension of uploaded gradients to relieve the communication bottleneck. A PCA-based wireless federated learning (PCA-WFL) algorithm and its accelerated version (i.e., PCA-AWFL) are proposed based on the low-dimensional gradients and the Nesterov’s momentum. For the non-convex empirical risk, a finite-time analysis is performed to quantify the impacts of system hyper-parameters on the convergence of the PCA-WFL and PCA-AWFL algorithms. The PCA-AWFL algorithm is theoretically certified to converge faster than the PCA-WFL algorithm. Besides, the convergence rates of PCA-WFL and PCA-AWFL algorithms quantitatively reveal the linear speedup with respect to the number of workers over the vanilla gradient descent algorithm. Numerical results are used to demonstrate the improved convergence rates of the proposed PCA-WFL and PCA-AWFL algorithms over the benchmarks.

Original language	English
Pages (from-to)	1
Number of pages	1
Journal	IEEE Transactions on Wireless Communications
DOIs	https://doi.org/10.1109/TWC.2023.3329375
Publication status	Accepted/In press - 2023

Keywords

Federated learning
distributed principle component analysis
momentum acceleration

Access to Document

10.1109/TWC.2023.3329375

Cite this

Dong, Y., Wang, L., Wang, J., Hu, X., Zhang, H., Yu, F. R., & Leung, V. C. M. (Accepted/In press). Accelerating Wireless Federated Learning via Nesterov’s Momentum and Distributed Principle Component Analysis. IEEE Transactions on Wireless Communications, 1. https://doi.org/10.1109/TWC.2023.3329375

@article{933b8c0e81db47f7999518fdad36aa95,

title = "Accelerating Wireless Federated Learning via Nesterov{\textquoteright}s Momentum and Distributed Principle Component Analysis",

abstract = "A wireless federated learning system is investigated by allowing a server and multiple workers to exchange uncoded information via orthogonal wireless channels. Since the workers frequently upload local gradients to the server via band-limited channels, the uplink transmission from the workers to the server becomes a communication bottleneck. Therefore, a one-shot distributed principle component analysis (PCA) is leveraged to reduce the dimension of uploaded gradients to relieve the communication bottleneck. A PCA-based wireless federated learning (PCA-WFL) algorithm and its accelerated version (i.e., PCA-AWFL) are proposed based on the low-dimensional gradients and the Nesterov{\textquoteright}s momentum. For the non-convex empirical risk, a finite-time analysis is performed to quantify the impacts of system hyper-parameters on the convergence of the PCA-WFL and PCA-AWFL algorithms. The PCA-AWFL algorithm is theoretically certified to converge faster than the PCA-WFL algorithm. Besides, the convergence rates of PCA-WFL and PCA-AWFL algorithms quantitatively reveal the linear speedup with respect to the number of workers over the vanilla gradient descent algorithm. Numerical results are used to demonstrate the improved convergence rates of the proposed PCA-WFL and PCA-AWFL algorithms over the benchmarks.",

keywords = "Federated learning, distributed principle component analysis, momentum acceleration",

author = "Yanjie Dong and Luya Wang and Jia Wang and Xiping Hu and Haijun Zhang and Yu, {Fei Richard} and Leung, {Victor C.M.}",

note = "Publisher Copyright: IEEE",

year = "2023",

doi = "10.1109/TWC.2023.3329375",

language = "English",

pages = "1",

journal = "IEEE Transactions on Wireless Communications",

issn = "1536-1276",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Accelerating Wireless Federated Learning via Nesterov’s Momentum and Distributed Principle Component Analysis

AU - Dong, Yanjie

AU - Wang, Luya

AU - Wang, Jia

AU - Hu, Xiping

AU - Zhang, Haijun

AU - Yu, Fei Richard

AU - Leung, Victor C.M.

N1 - Publisher Copyright: IEEE

PY - 2023

Y1 - 2023

N2 - A wireless federated learning system is investigated by allowing a server and multiple workers to exchange uncoded information via orthogonal wireless channels. Since the workers frequently upload local gradients to the server via band-limited channels, the uplink transmission from the workers to the server becomes a communication bottleneck. Therefore, a one-shot distributed principle component analysis (PCA) is leveraged to reduce the dimension of uploaded gradients to relieve the communication bottleneck. A PCA-based wireless federated learning (PCA-WFL) algorithm and its accelerated version (i.e., PCA-AWFL) are proposed based on the low-dimensional gradients and the Nesterov’s momentum. For the non-convex empirical risk, a finite-time analysis is performed to quantify the impacts of system hyper-parameters on the convergence of the PCA-WFL and PCA-AWFL algorithms. The PCA-AWFL algorithm is theoretically certified to converge faster than the PCA-WFL algorithm. Besides, the convergence rates of PCA-WFL and PCA-AWFL algorithms quantitatively reveal the linear speedup with respect to the number of workers over the vanilla gradient descent algorithm. Numerical results are used to demonstrate the improved convergence rates of the proposed PCA-WFL and PCA-AWFL algorithms over the benchmarks.

AB - A wireless federated learning system is investigated by allowing a server and multiple workers to exchange uncoded information via orthogonal wireless channels. Since the workers frequently upload local gradients to the server via band-limited channels, the uplink transmission from the workers to the server becomes a communication bottleneck. Therefore, a one-shot distributed principle component analysis (PCA) is leveraged to reduce the dimension of uploaded gradients to relieve the communication bottleneck. A PCA-based wireless federated learning (PCA-WFL) algorithm and its accelerated version (i.e., PCA-AWFL) are proposed based on the low-dimensional gradients and the Nesterov’s momentum. For the non-convex empirical risk, a finite-time analysis is performed to quantify the impacts of system hyper-parameters on the convergence of the PCA-WFL and PCA-AWFL algorithms. The PCA-AWFL algorithm is theoretically certified to converge faster than the PCA-WFL algorithm. Besides, the convergence rates of PCA-WFL and PCA-AWFL algorithms quantitatively reveal the linear speedup with respect to the number of workers over the vanilla gradient descent algorithm. Numerical results are used to demonstrate the improved convergence rates of the proposed PCA-WFL and PCA-AWFL algorithms over the benchmarks.

KW - Federated learning

KW - distributed principle component analysis

KW - momentum acceleration

UR - http://www.scopus.com/inward/record.url?scp=85177042285&partnerID=8YFLogxK

U2 - 10.1109/TWC.2023.3329375

DO - 10.1109/TWC.2023.3329375

M3 - Article

AN - SCOPUS:85177042285

SN - 1536-1276

SP - 1

JO - IEEE Transactions on Wireless Communications

JF - IEEE Transactions on Wireless Communications

ER -

Accelerating Wireless Federated Learning via Nesterov’s Momentum and Distributed Principle Component Analysis

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this

Accelerating Wireless Federated Learning via Nesterov&#x2019;s Momentum and Distributed Principle Component Analysis

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this

Accelerating Wireless Federated Learning via Nesterov’s Momentum and Distributed Principle Component Analysis