DaFKD: Domain-aware Federated Knowledge Distillation

Haozhao Wang; Yichen Li; Wenchao Xu; Ruixuan Li; Yufeng Zhan; Zhigang Zeng

doi:10.1109/CVPR52729.2023.01955

DaFKD: Domain-aware Federated Knowledge Distillation

Haozhao Wang, Yichen Li, Wenchao Xu, Ruixuan Li^*, Yufeng Zhan, Zhigang Zeng

^*此作品的通讯作者

自动化学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

27 引用（Scopus）

摘要

Federated Distillation (FD) has recently attracted increasing attention for its efficiency in aggregating multiple diverse local models trained from statistically heterogeneous data of distributed clients. Existing FD methods generally treat these models equally by merely computing the average of their output soft predictions for some given input distillation sample, which does not take the diversity across all local models into account, thus leading to degraded performance of the aggregated model, especially when some local models learn little knowledge about the sample. In this paper, we propose a new perspective that treats the local data in each client as a specific domain and design a novel domain knowledge aware federated distillation method, dubbed DaFKD, that can discern the importance of each model to the distillation sample, and thus is able to optimize the ensemble of soft predictions from diverse models. Specifically, we employ a domain discriminator for each client, which is trained to identify the correlation factor between the sample and the corresponding domain. Then, to facilitate the training of the domain discriminator while saving communication costs, we propose sharing its partial parameters with the classification model. Extensive experiments on various datasets and settings show that the proposed method can improve the model accuracy by up to 6.02% compared to state-of-the-art baselines.

源语言	英语
主期刊名	Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
出版商	IEEE Computer Society
页	20412-20421
页数	10
ISBN（电子版）	9798350301298
DOI	https://doi.org/10.1109/CVPR52729.2023.01955
出版状态	已出版 - 2023
活动	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Vancouver, 加拿大期限: 18 6月 2023 → 22 6月 2023

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2023-June
ISSN（印刷版）	1063-6919

会议

会议	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023
国家/地区	加拿大
市	Vancouver
时期	18/06/23 → 22/06/23

访问文件

10.1109/CVPR52729.2023.01955

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, H., Li, Y., Xu, W., Li, R., Zhan, Y., & Zeng, Z. (2023). DaFKD: Domain-aware Federated Knowledge Distillation. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 (页码 20412-20421). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2023-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52729.2023.01955

@inproceedings{a6b7a9ddac3c4d6dbac1cd43bd3a4bca,

title = "DaFKD: Domain-aware Federated Knowledge Distillation",

abstract = "Federated Distillation (FD) has recently attracted increasing attention for its efficiency in aggregating multiple diverse local models trained from statistically heterogeneous data of distributed clients. Existing FD methods generally treat these models equally by merely computing the average of their output soft predictions for some given input distillation sample, which does not take the diversity across all local models into account, thus leading to degraded performance of the aggregated model, especially when some local models learn little knowledge about the sample. In this paper, we propose a new perspective that treats the local data in each client as a specific domain and design a novel domain knowledge aware federated distillation method, dubbed DaFKD, that can discern the importance of each model to the distillation sample, and thus is able to optimize the ensemble of soft predictions from diverse models. Specifically, we employ a domain discriminator for each client, which is trained to identify the correlation factor between the sample and the corresponding domain. Then, to facilitate the training of the domain discriminator while saving communication costs, we propose sharing its partial parameters with the classification model. Extensive experiments on various datasets and settings show that the proposed method can improve the model accuracy by up to 6.02% compared to state-of-the-art baselines.",

keywords = "Others",

author = "Haozhao Wang and Yichen Li and Wenchao Xu and Ruixuan Li and Yufeng Zhan and Zhigang Zeng",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 ; Conference date: 18-06-2023 Through 22-06-2023",

year = "2023",

doi = "10.1109/CVPR52729.2023.01955",

language = "English",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "20412--20421",

booktitle = "Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023",

address = "United States",

}

Wang, H, Li, Y, Xu, W, Li, R, Zhan, Y & Zeng, Z 2023, DaFKD: Domain-aware Federated Knowledge Distillation. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2023-June, IEEE Computer Society, 页码 20412-20421, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, 加拿大, 18/06/23. https://doi.org/10.1109/CVPR52729.2023.01955

DaFKD: Domain-aware Federated Knowledge Distillation. / Wang, Haozhao; Li, Yichen; Xu, Wenchao 等.
Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. IEEE Computer Society, 2023. 页码 20412-20421 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2023-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - DaFKD

T2 - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023

AU - Wang, Haozhao

AU - Li, Yichen

AU - Xu, Wenchao

AU - Li, Ruixuan

AU - Zhan, Yufeng

AU - Zeng, Zhigang

PY - 2023

Y1 - 2023

N2 - Federated Distillation (FD) has recently attracted increasing attention for its efficiency in aggregating multiple diverse local models trained from statistically heterogeneous data of distributed clients. Existing FD methods generally treat these models equally by merely computing the average of their output soft predictions for some given input distillation sample, which does not take the diversity across all local models into account, thus leading to degraded performance of the aggregated model, especially when some local models learn little knowledge about the sample. In this paper, we propose a new perspective that treats the local data in each client as a specific domain and design a novel domain knowledge aware federated distillation method, dubbed DaFKD, that can discern the importance of each model to the distillation sample, and thus is able to optimize the ensemble of soft predictions from diverse models. Specifically, we employ a domain discriminator for each client, which is trained to identify the correlation factor between the sample and the corresponding domain. Then, to facilitate the training of the domain discriminator while saving communication costs, we propose sharing its partial parameters with the classification model. Extensive experiments on various datasets and settings show that the proposed method can improve the model accuracy by up to 6.02% compared to state-of-the-art baselines.

AB - Federated Distillation (FD) has recently attracted increasing attention for its efficiency in aggregating multiple diverse local models trained from statistically heterogeneous data of distributed clients. Existing FD methods generally treat these models equally by merely computing the average of their output soft predictions for some given input distillation sample, which does not take the diversity across all local models into account, thus leading to degraded performance of the aggregated model, especially when some local models learn little knowledge about the sample. In this paper, we propose a new perspective that treats the local data in each client as a specific domain and design a novel domain knowledge aware federated distillation method, dubbed DaFKD, that can discern the importance of each model to the distillation sample, and thus is able to optimize the ensemble of soft predictions from diverse models. Specifically, we employ a domain discriminator for each client, which is trained to identify the correlation factor between the sample and the corresponding domain. Then, to facilitate the training of the domain discriminator while saving communication costs, we propose sharing its partial parameters with the classification model. Extensive experiments on various datasets and settings show that the proposed method can improve the model accuracy by up to 6.02% compared to state-of-the-art baselines.

KW - Others

UR - http://www.scopus.com/inward/record.url?scp=85172428367&partnerID=8YFLogxK

U2 - 10.1109/CVPR52729.2023.01955

DO - 10.1109/CVPR52729.2023.01955

M3 - Conference contribution

AN - SCOPUS:85172428367

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 20412

EP - 20421

BT - Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023

PB - IEEE Computer Society

Y2 - 18 June 2023 through 22 June 2023

ER -

Wang H, Li Y, Xu W, Li R, Zhan Y, Zeng Z. DaFKD: Domain-aware Federated Knowledge Distillation. 在 Proceedings - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023. IEEE Computer Society. 2023. 页码 20412-20421. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52729.2023.01955

DaFKD: Domain-aware Federated Knowledge Distillation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此