AsyFed: Accelerated Federated Learning with Asynchronous Communication Mechanism

Zhixin Li; Chunpu Huang; Keke Gai; Zhihui Lu; Jie Wu; Lulu Chen; Yangchuan Xu; Kim Kwang Raymond Choo

doi:10.1109/JIOT.2022.3231913

AsyFed: Accelerated Federated Learning with Asynchronous Communication Mechanism

Zhixin Li, Chunpu Huang, Keke Gai^*, Zhihui Lu, Jie Wu, Lulu Chen, Yangchuan Xu, Kim Kwang Raymond Choo

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

9 引用（Scopus）

摘要

As a new distributed machine learning (ML) framework for privacy protection, federated learning (FL) enables substantial Internet of Things (IoT) devices (e.g., mobile phones, tablets, etc.) to participate in collaborative training of an ML model. FL can protect the data privacy of IoT devices without exposing their raw data. However, the diversity of IoT devices may degrade the overall training process due to the straggler issue. To tackle this problem, we propose a gear-based asynchronous FL (AsyFed) architecture. It adds a gear layer between the clients and the FL server as a mediator to store the model parameters. The key insight is that we group these clients with similar training abilities into the same gear. The clients within the same gear conduct synchronous training. These gears then communicate with the global FL server asynchronously. Besides, we propose a T-step mechanism to reduce the weight from the slow gear when they are communicating with the FL server. The extensive experiment evaluations indicate that AsyFed outperforms FedAvg (baseline synchronous FL scheme) and some state-of-the-art asynchronous FL methods in terms of training accuracy or speed under different data distributions. The only negligible overhead is that we leverage the extra layer (gear layer) to preserve part of the model parameters.

源语言	英语
页（从-至）	8670-8683
页数	14
期刊	IEEE Internet of Things Journal
卷	10
期	10
DOI	https://doi.org/10.1109/JIOT.2022.3231913
出版状态	已出版 - 15 5月 2023

访问文件

10.1109/JIOT.2022.3231913

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{c6dfe251e4474795802ca86eea95058b,

title = "AsyFed: Accelerated Federated Learning with Asynchronous Communication Mechanism",

abstract = "As a new distributed machine learning (ML) framework for privacy protection, federated learning (FL) enables substantial Internet of Things (IoT) devices (e.g., mobile phones, tablets, etc.) to participate in collaborative training of an ML model. FL can protect the data privacy of IoT devices without exposing their raw data. However, the diversity of IoT devices may degrade the overall training process due to the straggler issue. To tackle this problem, we propose a gear-based asynchronous FL (AsyFed) architecture. It adds a gear layer between the clients and the FL server as a mediator to store the model parameters. The key insight is that we group these clients with similar training abilities into the same gear. The clients within the same gear conduct synchronous training. These gears then communicate with the global FL server asynchronously. Besides, we propose a T-step mechanism to reduce the weight from the slow gear when they are communicating with the FL server. The extensive experiment evaluations indicate that AsyFed outperforms FedAvg (baseline synchronous FL scheme) and some state-of-the-art asynchronous FL methods in terms of training accuracy or speed under different data distributions. The only negligible overhead is that we leverage the extra layer (gear layer) to preserve part of the model parameters.",

keywords = "Federated learning (FL), asynchronous update, communication overhead, device heterogeneity",

author = "Zhixin Li and Chunpu Huang and Keke Gai and Zhihui Lu and Jie Wu and Lulu Chen and Yangchuan Xu and Choo, {Kim Kwang Raymond}",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2023",

month = may,

day = "15",

doi = "10.1109/JIOT.2022.3231913",

language = "English",

volume = "10",

pages = "8670--8683",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - AsyFed

T2 - Accelerated Federated Learning with Asynchronous Communication Mechanism

AU - Li, Zhixin

AU - Huang, Chunpu

AU - Gai, Keke

AU - Lu, Zhihui

AU - Wu, Jie

AU - Chen, Lulu

AU - Xu, Yangchuan

AU - Choo, Kim Kwang Raymond

PY - 2023/5/15

Y1 - 2023/5/15

N2 - As a new distributed machine learning (ML) framework for privacy protection, federated learning (FL) enables substantial Internet of Things (IoT) devices (e.g., mobile phones, tablets, etc.) to participate in collaborative training of an ML model. FL can protect the data privacy of IoT devices without exposing their raw data. However, the diversity of IoT devices may degrade the overall training process due to the straggler issue. To tackle this problem, we propose a gear-based asynchronous FL (AsyFed) architecture. It adds a gear layer between the clients and the FL server as a mediator to store the model parameters. The key insight is that we group these clients with similar training abilities into the same gear. The clients within the same gear conduct synchronous training. These gears then communicate with the global FL server asynchronously. Besides, we propose a T-step mechanism to reduce the weight from the slow gear when they are communicating with the FL server. The extensive experiment evaluations indicate that AsyFed outperforms FedAvg (baseline synchronous FL scheme) and some state-of-the-art asynchronous FL methods in terms of training accuracy or speed under different data distributions. The only negligible overhead is that we leverage the extra layer (gear layer) to preserve part of the model parameters.

AB - As a new distributed machine learning (ML) framework for privacy protection, federated learning (FL) enables substantial Internet of Things (IoT) devices (e.g., mobile phones, tablets, etc.) to participate in collaborative training of an ML model. FL can protect the data privacy of IoT devices without exposing their raw data. However, the diversity of IoT devices may degrade the overall training process due to the straggler issue. To tackle this problem, we propose a gear-based asynchronous FL (AsyFed) architecture. It adds a gear layer between the clients and the FL server as a mediator to store the model parameters. The key insight is that we group these clients with similar training abilities into the same gear. The clients within the same gear conduct synchronous training. These gears then communicate with the global FL server asynchronously. Besides, we propose a T-step mechanism to reduce the weight from the slow gear when they are communicating with the FL server. The extensive experiment evaluations indicate that AsyFed outperforms FedAvg (baseline synchronous FL scheme) and some state-of-the-art asynchronous FL methods in terms of training accuracy or speed under different data distributions. The only negligible overhead is that we leverage the extra layer (gear layer) to preserve part of the model parameters.

KW - Federated learning (FL)

KW - asynchronous update

KW - communication overhead

KW - device heterogeneity

UR - http://www.scopus.com/inward/record.url?scp=85146217231&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2022.3231913

DO - 10.1109/JIOT.2022.3231913

M3 - Article

AN - SCOPUS:85146217231

SN - 2327-4662

VL - 10

SP - 8670

EP - 8683

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 10

ER -

AsyFed: Accelerated Federated Learning with Asynchronous Communication Mechanism

摘要

访问文件

其它文件与链接

指纹

引用此