HFML: heterogeneous hierarchical federated mutual learning on non-IID data

Yang Li; Jie Li; Kan Li

doi:10.1007/s10479-023-05203-x

HFML: heterogeneous hierarchical federated mutual learning on non-IID data

Yang Li, Jie Li, Kan Li^*

^*Corresponding author for this work

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

Non-independent and identical distribution (Non-IID) data and model heterogeneity pose a great challenge for federated learning in cloud-based and edge-based systems. They are easy to lead to inconsistency of gradient updates during the training stage and mismatch of gradient dimensions during the aggregation stage, resulting in the degradation of the global model performance and the consumption of a lot of training time. To solve these problems, this paper proposes a Heterogeneous Hierarchical Federated Mutual Learning (HFML) method in an edge-based system. We design a model assignment mechanism in which clients and edge servers individually fork global models of different structures, and the untrained local models learn mutually with the edge models in deep mutual learning. We use partial periodic aggregation to approximate global aggregation to achieve fast convergence. Our experiments show that HFML obtains state-of-the-art performance than three approaches on common datasets like CIFAR-10/100. Our method improves accuracy up to 2.9% and reduces training time by 30% under homogeneous and heterogeneous models.

Original language	English
Journal	Annals of Operations Research
DOIs	https://doi.org/10.1007/s10479-023-05203-x
Publication status	Accepted/In press - 2023

Keywords

Deep mutual learning
Federated learning
Heterogeneous models
Non-independent and identical distribution (Non-IID)

Access to Document

10.1007/s10479-023-05203-x

Cite this

@article{d33ec03b9ba84f599597ed89950a1ad9,

title = "HFML: heterogeneous hierarchical federated mutual learning on non-IID data",

abstract = "Non-independent and identical distribution (Non-IID) data and model heterogeneity pose a great challenge for federated learning in cloud-based and edge-based systems. They are easy to lead to inconsistency of gradient updates during the training stage and mismatch of gradient dimensions during the aggregation stage, resulting in the degradation of the global model performance and the consumption of a lot of training time. To solve these problems, this paper proposes a Heterogeneous Hierarchical Federated Mutual Learning (HFML) method in an edge-based system. We design a model assignment mechanism in which clients and edge servers individually fork global models of different structures, and the untrained local models learn mutually with the edge models in deep mutual learning. We use partial periodic aggregation to approximate global aggregation to achieve fast convergence. Our experiments show that HFML obtains state-of-the-art performance than three approaches on common datasets like CIFAR-10/100. Our method improves accuracy up to 2.9% and reduces training time by 30% under homogeneous and heterogeneous models.",

keywords = "Deep mutual learning, Federated learning, Heterogeneous models, Non-independent and identical distribution (Non-IID)",

author = "Yang Li and Jie Li and Kan Li",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2023",

doi = "10.1007/s10479-023-05203-x",

language = "English",

journal = "Annals of Operations Research",

issn = "0254-5330",

publisher = "Springer Netherlands",

}

TY - JOUR

T1 - HFML

T2 - heterogeneous hierarchical federated mutual learning on non-IID data

AU - Li, Yang

AU - Li, Jie

AU - Li, Kan

PY - 2023

Y1 - 2023

N2 - Non-independent and identical distribution (Non-IID) data and model heterogeneity pose a great challenge for federated learning in cloud-based and edge-based systems. They are easy to lead to inconsistency of gradient updates during the training stage and mismatch of gradient dimensions during the aggregation stage, resulting in the degradation of the global model performance and the consumption of a lot of training time. To solve these problems, this paper proposes a Heterogeneous Hierarchical Federated Mutual Learning (HFML) method in an edge-based system. We design a model assignment mechanism in which clients and edge servers individually fork global models of different structures, and the untrained local models learn mutually with the edge models in deep mutual learning. We use partial periodic aggregation to approximate global aggregation to achieve fast convergence. Our experiments show that HFML obtains state-of-the-art performance than three approaches on common datasets like CIFAR-10/100. Our method improves accuracy up to 2.9% and reduces training time by 30% under homogeneous and heterogeneous models.

AB - Non-independent and identical distribution (Non-IID) data and model heterogeneity pose a great challenge for federated learning in cloud-based and edge-based systems. They are easy to lead to inconsistency of gradient updates during the training stage and mismatch of gradient dimensions during the aggregation stage, resulting in the degradation of the global model performance and the consumption of a lot of training time. To solve these problems, this paper proposes a Heterogeneous Hierarchical Federated Mutual Learning (HFML) method in an edge-based system. We design a model assignment mechanism in which clients and edge servers individually fork global models of different structures, and the untrained local models learn mutually with the edge models in deep mutual learning. We use partial periodic aggregation to approximate global aggregation to achieve fast convergence. Our experiments show that HFML obtains state-of-the-art performance than three approaches on common datasets like CIFAR-10/100. Our method improves accuracy up to 2.9% and reduces training time by 30% under homogeneous and heterogeneous models.

KW - Deep mutual learning

KW - Federated learning

KW - Heterogeneous models

KW - Non-independent and identical distribution (Non-IID)

UR - http://www.scopus.com/inward/record.url?scp=85147351176&partnerID=8YFLogxK

U2 - 10.1007/s10479-023-05203-x

DO - 10.1007/s10479-023-05203-x

M3 - Article

AN - SCOPUS:85147351176

SN - 0254-5330

JO - Annals of Operations Research

JF - Annals of Operations Research

ER -

HFML: heterogeneous hierarchical federated mutual learning on non-IID data

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this