CDKT-FL: Cross-device knowledge transfer using proxy dataset in federated learning

Huy Q. Le; Minh N.H. Nguyen; Shashi Raj Pandey; Chaoning Zhang; Choong Seon Hong

doi:10.1016/j.engappai.2024.108093

CDKT-FL: Cross-device knowledge transfer using proxy dataset in federated learning

Huy Q. Le, Minh N.H. Nguyen, Shashi Raj Pandey, Chaoning Zhang, Choong Seon Hong^*

^*Corresponding author for this work

Research output: Contribution to journal › Journal article › Research › peer-review

Abstract

In a practical setting, how to enable robust Federated Learning (FL) systems, both in terms of generalization and personalization abilities, is one important research question. It is a challenging issue due to the consequences of non-i.i.d. properties of client's data, often referred to as statistical heterogeneity, and small local data samples from the various data distributions. Therefore, to develop robust generalized global and personalized models, conventional FL methods need to redesign the knowledge aggregation from biased local models while considering huge divergence of learning parameters due to skewed client data. In this work, we demonstrate that the knowledge transfer mechanism achieves these objectives and develop a novel knowledge distillation-based approach to study the extent of knowledge transfer between the global model and local models. Henceforth, our method considers the suitability of transferring the outcome distribution and (or) the embedding vector of representation from trained models during cross-device knowledge transfer using a small proxy dataset in heterogeneous FL. In doing so, we alternatively perform cross-device knowledge transfer following general formulations as (1) global knowledge transfer and (2) on-device knowledge transfer. Through simulations on three federated datasets, we show the proposed method achieves significant speedups and high personalized performance of local models. Furthermore, the proposed approach offers a more stable algorithm than other baselines during the training, with minimal communication data load when exchanging the trained model's outcomes and representation.

Original language	English
Article number	108093
Journal	Engineering Applications of Artificial Intelligence
Volume	133
ISSN	0952-1976
DOIs	https://doi.org/10.1016/j.engappai.2024.108093
Publication status	Published - Jul 2024

Bibliographical note

Publisher Copyright:
© 2024

Keywords

Federated learning
Knowledge distillation
Representation learning

Access to Document

10.1016/j.engappai.2024.108093

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{d5068207e4a542ea99d9e45c4c1dd8da,

title = "CDKT-FL: Cross-device knowledge transfer using proxy dataset in federated learning",

abstract = "In a practical setting, how to enable robust Federated Learning (FL) systems, both in terms of generalization and personalization abilities, is one important research question. It is a challenging issue due to the consequences of non-i.i.d. properties of client's data, often referred to as statistical heterogeneity, and small local data samples from the various data distributions. Therefore, to develop robust generalized global and personalized models, conventional FL methods need to redesign the knowledge aggregation from biased local models while considering huge divergence of learning parameters due to skewed client data. In this work, we demonstrate that the knowledge transfer mechanism achieves these objectives and develop a novel knowledge distillation-based approach to study the extent of knowledge transfer between the global model and local models. Henceforth, our method considers the suitability of transferring the outcome distribution and (or) the embedding vector of representation from trained models during cross-device knowledge transfer using a small proxy dataset in heterogeneous FL. In doing so, we alternatively perform cross-device knowledge transfer following general formulations as (1) global knowledge transfer and (2) on-device knowledge transfer. Through simulations on three federated datasets, we show the proposed method achieves significant speedups and high personalized performance of local models. Furthermore, the proposed approach offers a more stable algorithm than other baselines during the training, with minimal communication data load when exchanging the trained model's outcomes and representation.",

keywords = "Federated learning, Knowledge distillation, Representation learning",

author = "Le, {Huy Q.} and Nguyen, {Minh N.H.} and Pandey, {Shashi Raj} and Chaoning Zhang and Hong, {Choong Seon}",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2024",

month = jul,

doi = "10.1016/j.engappai.2024.108093",

language = "English",

volume = "133",

journal = "Engineering Applications of Artificial Intelligence",

issn = "0952-1976",

publisher = "Pergamon Press",

}

TY - JOUR

T1 - CDKT-FL

T2 - Cross-device knowledge transfer using proxy dataset in federated learning

AU - Le, Huy Q.

AU - Nguyen, Minh N.H.

AU - Pandey, Shashi Raj

AU - Zhang, Chaoning

AU - Hong, Choong Seon

PY - 2024/7

Y1 - 2024/7

N2 - In a practical setting, how to enable robust Federated Learning (FL) systems, both in terms of generalization and personalization abilities, is one important research question. It is a challenging issue due to the consequences of non-i.i.d. properties of client's data, often referred to as statistical heterogeneity, and small local data samples from the various data distributions. Therefore, to develop robust generalized global and personalized models, conventional FL methods need to redesign the knowledge aggregation from biased local models while considering huge divergence of learning parameters due to skewed client data. In this work, we demonstrate that the knowledge transfer mechanism achieves these objectives and develop a novel knowledge distillation-based approach to study the extent of knowledge transfer between the global model and local models. Henceforth, our method considers the suitability of transferring the outcome distribution and (or) the embedding vector of representation from trained models during cross-device knowledge transfer using a small proxy dataset in heterogeneous FL. In doing so, we alternatively perform cross-device knowledge transfer following general formulations as (1) global knowledge transfer and (2) on-device knowledge transfer. Through simulations on three federated datasets, we show the proposed method achieves significant speedups and high personalized performance of local models. Furthermore, the proposed approach offers a more stable algorithm than other baselines during the training, with minimal communication data load when exchanging the trained model's outcomes and representation.

AB - In a practical setting, how to enable robust Federated Learning (FL) systems, both in terms of generalization and personalization abilities, is one important research question. It is a challenging issue due to the consequences of non-i.i.d. properties of client's data, often referred to as statistical heterogeneity, and small local data samples from the various data distributions. Therefore, to develop robust generalized global and personalized models, conventional FL methods need to redesign the knowledge aggregation from biased local models while considering huge divergence of learning parameters due to skewed client data. In this work, we demonstrate that the knowledge transfer mechanism achieves these objectives and develop a novel knowledge distillation-based approach to study the extent of knowledge transfer between the global model and local models. Henceforth, our method considers the suitability of transferring the outcome distribution and (or) the embedding vector of representation from trained models during cross-device knowledge transfer using a small proxy dataset in heterogeneous FL. In doing so, we alternatively perform cross-device knowledge transfer following general formulations as (1) global knowledge transfer and (2) on-device knowledge transfer. Through simulations on three federated datasets, we show the proposed method achieves significant speedups and high personalized performance of local models. Furthermore, the proposed approach offers a more stable algorithm than other baselines during the training, with minimal communication data load when exchanging the trained model's outcomes and representation.

KW - Federated learning

KW - Knowledge distillation

KW - Representation learning

UR - http://www.scopus.com/inward/record.url?scp=85186443771&partnerID=8YFLogxK

U2 - 10.1016/j.engappai.2024.108093

DO - 10.1016/j.engappai.2024.108093

M3 - Journal article

AN - SCOPUS:85186443771

SN - 0952-1976

VL - 133

JO - Engineering Applications of Artificial Intelligence

JF - Engineering Applications of Artificial Intelligence

M1 - 108093

ER -

CDKT-FL: Cross-device knowledge transfer using proxy dataset in federated learning

Abstract

Bibliographical note

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this