Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning

Jude Tchaye-Kondi; Yanlong Zhai; Jun Shen; Akbar Telikani; Liehuang Zhu

doi:10.1109/TMC.2024.3416312

Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning

Jude Tchaye-Kondi, Yanlong Zhai^*, Jun Shen, Akbar Telikani, Liehuang Zhu

^*Corresponding author for this work

School of Cyberspace Science and Technology

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

Federated Learning is particularly challenging in IoT environments, where edge and cloud nodes have imbalanced computation capacity and networking bandwidth. The main scalability barrier in distributed stochastic gradient descent-based machine learning frameworks is the communication overhead from frequent model parameter exchanges between workers and the central server. One way to reduce this overhead is by employing constant and periodic averaging, which sends model parameters to the server after a few iterations of local updates from workers. However, investigations have shown that the optimal communication period for balancing communication and convergence is not constant. Although some studies have explored the effectiveness of federated learning with a constant period, dynamically adjusting the period for optimal convergence remains under-explored. To address this, we investigate the impact of the period on global model convergence and propose an adaptive period control mechanism (AdaPC). This mechanism adaptively adjusts the aggregation period of the federated learning framework to achieve fast convergence with minimal communication. Our theoretical and empirical findings demonstrate that our proposed solution achieves faster convergence, lower final training loss, and minimized communication overhead compared to the constant period averaging strategy and other existing solutions.

Original language	English
Pages (from-to)	12572-12586
Number of pages	15
Journal	IEEE Transactions on Mobile Computing
Volume	23
Issue number	12
DOIs	https://doi.org/10.1109/TMC.2024.3416312
Publication status	Published - 2024

Keywords

Adaptive communication
Internet of Things
distributed SGD
edge AI
federated learning
sparse averaging

Access to Document

10.1109/TMC.2024.3416312

Cite this

Tchaye-Kondi, J., Zhai, Y., Shen, J., Telikani, A., & Zhu, L. (2024). Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning. IEEE Transactions on Mobile Computing, 23(12), 12572-12586. https://doi.org/10.1109/TMC.2024.3416312

@article{15a234f9abae4f44b0f7147fc117cccd,

title = "Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning",

abstract = "Federated Learning is particularly challenging in IoT environments, where edge and cloud nodes have imbalanced computation capacity and networking bandwidth. The main scalability barrier in distributed stochastic gradient descent-based machine learning frameworks is the communication overhead from frequent model parameter exchanges between workers and the central server. One way to reduce this overhead is by employing constant and periodic averaging, which sends model parameters to the server after a few iterations of local updates from workers. However, investigations have shown that the optimal communication period for balancing communication and convergence is not constant. Although some studies have explored the effectiveness of federated learning with a constant period, dynamically adjusting the period for optimal convergence remains under-explored. To address this, we investigate the impact of the period on global model convergence and propose an adaptive period control mechanism (AdaPC). This mechanism adaptively adjusts the aggregation period of the federated learning framework to achieve fast convergence with minimal communication. Our theoretical and empirical findings demonstrate that our proposed solution achieves faster convergence, lower final training loss, and minimized communication overhead compared to the constant period averaging strategy and other existing solutions.",

keywords = "Adaptive communication, Internet of Things, distributed SGD, edge AI, federated learning, sparse averaging",

author = "Jude Tchaye-Kondi and Yanlong Zhai and Jun Shen and Akbar Telikani and Liehuang Zhu",

note = "Publisher Copyright: {\textcopyright} 2002-2012 IEEE.",

year = "2024",

doi = "10.1109/TMC.2024.3416312",

language = "English",

volume = "23",

pages = "12572--12586",

journal = "IEEE Transactions on Mobile Computing",

issn = "1536-1233",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning

AU - Tchaye-Kondi, Jude

AU - Zhai, Yanlong

AU - Shen, Jun

AU - Telikani, Akbar

AU - Zhu, Liehuang

PY - 2024

Y1 - 2024

N2 - Federated Learning is particularly challenging in IoT environments, where edge and cloud nodes have imbalanced computation capacity and networking bandwidth. The main scalability barrier in distributed stochastic gradient descent-based machine learning frameworks is the communication overhead from frequent model parameter exchanges between workers and the central server. One way to reduce this overhead is by employing constant and periodic averaging, which sends model parameters to the server after a few iterations of local updates from workers. However, investigations have shown that the optimal communication period for balancing communication and convergence is not constant. Although some studies have explored the effectiveness of federated learning with a constant period, dynamically adjusting the period for optimal convergence remains under-explored. To address this, we investigate the impact of the period on global model convergence and propose an adaptive period control mechanism (AdaPC). This mechanism adaptively adjusts the aggregation period of the federated learning framework to achieve fast convergence with minimal communication. Our theoretical and empirical findings demonstrate that our proposed solution achieves faster convergence, lower final training loss, and minimized communication overhead compared to the constant period averaging strategy and other existing solutions.

AB - Federated Learning is particularly challenging in IoT environments, where edge and cloud nodes have imbalanced computation capacity and networking bandwidth. The main scalability barrier in distributed stochastic gradient descent-based machine learning frameworks is the communication overhead from frequent model parameter exchanges between workers and the central server. One way to reduce this overhead is by employing constant and periodic averaging, which sends model parameters to the server after a few iterations of local updates from workers. However, investigations have shown that the optimal communication period for balancing communication and convergence is not constant. Although some studies have explored the effectiveness of federated learning with a constant period, dynamically adjusting the period for optimal convergence remains under-explored. To address this, we investigate the impact of the period on global model convergence and propose an adaptive period control mechanism (AdaPC). This mechanism adaptively adjusts the aggregation period of the federated learning framework to achieve fast convergence with minimal communication. Our theoretical and empirical findings demonstrate that our proposed solution achieves faster convergence, lower final training loss, and minimized communication overhead compared to the constant period averaging strategy and other existing solutions.

KW - Adaptive communication

KW - Internet of Things

KW - distributed SGD

KW - edge AI

KW - federated learning

KW - sparse averaging

UR - http://www.scopus.com/inward/record.url?scp=85196718897&partnerID=8YFLogxK

U2 - 10.1109/TMC.2024.3416312

DO - 10.1109/TMC.2024.3416312

M3 - Article

AN - SCOPUS:85196718897

SN - 1536-1233

VL - 23

SP - 12572

EP - 12586

JO - IEEE Transactions on Mobile Computing

JF - IEEE Transactions on Mobile Computing

IS - 12

ER -

Adaptive Period Control for Communication Efficient and Fast Convergent Federated Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this