Federated deep long-tailed learning: A survey

Kan Li; Yang Li; Ji Zhang; Xin Liu; Zhichao Ma

doi:10.1016/j.neucom.2024.127906

Federated deep long-tailed learning: A survey

Kan Li^*, Yang Li, Ji Zhang, Xin Liu, Zhichao Ma

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 短篇评述 › 同行评审

1 引用（Scopus）

摘要

The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

源语言	英语
文章编号	127906
期刊	Neurocomputing
卷	595
DOI	https://doi.org/10.1016/j.neucom.2024.127906
出版状态	已出版 - 28 8月 2024

访问文件

10.1016/j.neucom.2024.127906

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3e675113b42243b980e46e8b3c8e524c,

title = "Federated deep long-tailed learning: A survey",

abstract = "The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.",

keywords = "Agnostic distribution, Deep learning, Federated learning, Long-tailed distribution",

author = "Kan Li and Yang Li and Ji Zhang and Xin Liu and Zhichao Ma",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2024",

month = aug,

day = "28",

doi = "10.1016/j.neucom.2024.127906",

language = "English",

volume = "595",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Federated deep long-tailed learning

T2 - A survey

AU - Li, Kan

AU - Li, Yang

AU - Zhang, Ji

AU - Liu, Xin

AU - Ma, Zhichao

PY - 2024/8/28

Y1 - 2024/8/28

N2 - The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

AB - The federated learning privacy-preserving framework has achieved fruitful results in training deep models across clients. This survey aims to provide a systematic overview of federated deep long-tailed learning. We analyze the problems of federated deep long-tailed learning of class imbalance/missing, different long-tailed distributions, and biased training, and summarize the current approaches that fall into the following three categories: information enhancement, model component optimization, and algorithm-based calibration. Meanwhile, we also sort out the representative open-source datasets for different tasks. We conduct abundant experiments on CIFAR-10/100-LT using LeNet-5/ResNet-8/ResNet-34 and evaluate the model performance with multiple metrics. We also consider a text classification task and evaluate the performance of multiple methods using LSTM on the 20NewsGroups-LT. We discuss the challenges posed by data heterogeneity, model heterogeneity, fairness, and security, and identify future research directions for the follow-up studies.

KW - Agnostic distribution

KW - Deep learning

KW - Federated learning

KW - Long-tailed distribution

UR - http://www.scopus.com/inward/record.url?scp=85194149542&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.127906

DO - 10.1016/j.neucom.2024.127906

M3 - Short survey

AN - SCOPUS:85194149542

SN - 0925-2312

VL - 595

JO - Neurocomputing

JF - Neurocomputing

M1 - 127906

ER -

Federated deep long-tailed learning: A survey

摘要

访问文件

其它文件与链接

指纹

引用此