Federated deep long-tailed learning: A survey

Kan Li; Yang Li; Ji Zhang; Xin Liu; Zhichao Ma

doi:10.1016/j.neucom.2024.127906

Federated deep long-tailed learning: A survey

Kan Li^*, Yang Li, Ji Zhang, Xin Liu, Zhichao Ma

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Contribution to journal › Short survey › peer-review

1 Citation (Scopus)

Abstract

The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

Original language	English
Article number	127906
Journal	Neurocomputing
Volume	595
DOIs	https://doi.org/10.1016/j.neucom.2024.127906
Publication status	Published - 28 Aug 2024

Keywords

Agnostic distribution
Deep learning
Federated learning
Long-tailed distribution

Access to Document

10.1016/j.neucom.2024.127906

Cite this

@article{3e675113b42243b980e46e8b3c8e524c,

title = "Federated deep long-tailed learning: A survey",

abstract = "The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.",

keywords = "Agnostic distribution, Deep learning, Federated learning, Long-tailed distribution",

author = "Kan Li and Yang Li and Ji Zhang and Xin Liu and Zhichao Ma",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2024",

month = aug,

day = "28",

doi = "10.1016/j.neucom.2024.127906",

language = "English",

volume = "595",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Federated deep long-tailed learning

T2 - A survey

AU - Li, Kan

AU - Li, Yang

AU - Zhang, Ji

AU - Liu, Xin

AU - Ma, Zhichao

PY - 2024/8/28

Y1 - 2024/8/28

N2 - The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

AB - The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

KW - Agnostic distribution

KW - Deep learning

KW - Federated learning

KW - Long-tailed distribution

UR - http://www.scopus.com/inward/record.url?scp=85194149542&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.127906

DO - 10.1016/j.neucom.2024.127906

M3 - Short survey

AN - SCOPUS:85194149542

SN - 0925-2312

VL - 595

JO - Neurocomputing

JF - Neurocomputing

M1 - 127906

ER -

Federated deep long-tailed learning: A survey

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this