Adaptive Federated Learning on Non-IID Data With Resource Constraint

Jie Zhang; Song Guo; Zhihao Qu; Deze Zeng; Yufeng Zhan; Qifeng Liu; Rajendra Akerkar

doi:10.1109/TC.2021.3099723

Adaptive Federated Learning on Non-IID Data With Resource Constraint

Jie Zhang, Song Guo, Zhihao Qu^*, Deze Zeng, Yufeng Zhan, Qifeng Liu, Rajendra Akerkar

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

86 Citations (Scopus)

Abstract

Federated learning (FL) has been widely recognized as a promising approach by enabling individual end-devices to cooperatively train a global model without exposing their own data. One of the key challenges in FL is the non-independent and identically distributed (Non-IID) data across the clients, which decreases the efficiency of stochastic gradient descent (SGD) based training process. Moreover, clients with different data distributions may cause bias to the global model update, resulting in a degraded model accuracy. To tackle the Non-IID problem in FL, we aim to optimize the local training process and global aggregation simultaneously. For local training, we analyze the effect of hyperparameters (e.g., the batch size, the number of local updates) on the training performance of FL. Guided by the toy example and theoretical analysis, we are motivated to mitigate the negative impacts incurred by Non-IID data via selecting a subset of participants and adaptively adjust their batch size. A deep reinforcement learning based approach has been proposed to adaptively control the training of local models and the phase of global aggregation. Extensive experiments on different datasets show that our method can improve the model accuracy by up to 30 percent, as compared to the state-of-the-art approaches.

Original language	English
Pages (from-to)	1655-1667
Number of pages	13
Journal	IEEE Transactions on Computers
Volume	71
Issue number	7
DOIs	https://doi.org/10.1109/TC.2021.3099723
Publication status	Published - 1 Jul 2022

Keywords

Federated learning
batch size adaption
deep reinforcement learning
non-IID data

Access to Document

10.1109/TC.2021.3099723

Cite this

Zhang, J., Guo, S., Qu, Z., Zeng, D., Zhan, Y., Liu, Q., & Akerkar, R. (2022). Adaptive Federated Learning on Non-IID Data With Resource Constraint. IEEE Transactions on Computers, 71(7), 1655-1667. https://doi.org/10.1109/TC.2021.3099723

@article{65230b72837b440e8f581681f7680de9,

title = "Adaptive Federated Learning on Non-IID Data With Resource Constraint",

abstract = "Federated learning (FL) has been widely recognized as a promising approach by enabling individual end-devices to cooperatively train a global model without exposing their own data. One of the key challenges in FL is the non-independent and identically distributed (Non-IID) data across the clients, which decreases the efficiency of stochastic gradient descent (SGD) based training process. Moreover, clients with different data distributions may cause bias to the global model update, resulting in a degraded model accuracy. To tackle the Non-IID problem in FL, we aim to optimize the local training process and global aggregation simultaneously. For local training, we analyze the effect of hyperparameters (e.g., the batch size, the number of local updates) on the training performance of FL. Guided by the toy example and theoretical analysis, we are motivated to mitigate the negative impacts incurred by Non-IID data via selecting a subset of participants and adaptively adjust their batch size. A deep reinforcement learning based approach has been proposed to adaptively control the training of local models and the phase of global aggregation. Extensive experiments on different datasets show that our method can improve the model accuracy by up to 30 percent, as compared to the state-of-the-art approaches.",

keywords = "Federated learning, batch size adaption, deep reinforcement learning, non-IID data",

author = "Jie Zhang and Song Guo and Zhihao Qu and Deze Zeng and Yufeng Zhan and Qifeng Liu and Rajendra Akerkar",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2022",

month = jul,

day = "1",

doi = "10.1109/TC.2021.3099723",

language = "English",

volume = "71",

pages = "1655--1667",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "7",

}

TY - JOUR

T1 - Adaptive Federated Learning on Non-IID Data With Resource Constraint

AU - Zhang, Jie

AU - Guo, Song

AU - Qu, Zhihao

AU - Zeng, Deze

AU - Zhan, Yufeng

AU - Liu, Qifeng

AU - Akerkar, Rajendra

PY - 2022/7/1

Y1 - 2022/7/1

N2 - Federated learning (FL) has been widely recognized as a promising approach by enabling individual end-devices to cooperatively train a global model without exposing their own data. One of the key challenges in FL is the non-independent and identically distributed (Non-IID) data across the clients, which decreases the efficiency of stochastic gradient descent (SGD) based training process. Moreover, clients with different data distributions may cause bias to the global model update, resulting in a degraded model accuracy. To tackle the Non-IID problem in FL, we aim to optimize the local training process and global aggregation simultaneously. For local training, we analyze the effect of hyperparameters (e.g., the batch size, the number of local updates) on the training performance of FL. Guided by the toy example and theoretical analysis, we are motivated to mitigate the negative impacts incurred by Non-IID data via selecting a subset of participants and adaptively adjust their batch size. A deep reinforcement learning based approach has been proposed to adaptively control the training of local models and the phase of global aggregation. Extensive experiments on different datasets show that our method can improve the model accuracy by up to 30 percent, as compared to the state-of-the-art approaches.

AB - Federated learning (FL) has been widely recognized as a promising approach by enabling individual end-devices to cooperatively train a global model without exposing their own data. One of the key challenges in FL is the non-independent and identically distributed (Non-IID) data across the clients, which decreases the efficiency of stochastic gradient descent (SGD) based training process. Moreover, clients with different data distributions may cause bias to the global model update, resulting in a degraded model accuracy. To tackle the Non-IID problem in FL, we aim to optimize the local training process and global aggregation simultaneously. For local training, we analyze the effect of hyperparameters (e.g., the batch size, the number of local updates) on the training performance of FL. Guided by the toy example and theoretical analysis, we are motivated to mitigate the negative impacts incurred by Non-IID data via selecting a subset of participants and adaptively adjust their batch size. A deep reinforcement learning based approach has been proposed to adaptively control the training of local models and the phase of global aggregation. Extensive experiments on different datasets show that our method can improve the model accuracy by up to 30 percent, as compared to the state-of-the-art approaches.

KW - Federated learning

KW - batch size adaption

KW - deep reinforcement learning

KW - non-IID data

UR - http://www.scopus.com/inward/record.url?scp=85112664813&partnerID=8YFLogxK

U2 - 10.1109/TC.2021.3099723

DO - 10.1109/TC.2021.3099723

M3 - Article

AN - SCOPUS:85112664813

SN - 0018-9340

VL - 71

SP - 1655

EP - 1667

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 7

ER -

Adaptive Federated Learning on Non-IID Data With Resource Constraint

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this