Scholars@Duke publication: GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs

GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs

Publication , Conference

Wang, B; Li, A; Pang, M; Li, H; Chen, Y

Published in: Proceedings - IEEE International Conference on Data Mining, ICDM

January 1, 2022

Graph-based semi-supervised node classification (GraphSSC) has wide applications, ranging from networking and security to data mining and machine learning, etc. However, existing centralized GraphSSC methods are impractical to solve many real-world graph-based problems, as collecting the entire graph and labeling a reasonable number of labels is time-consuming and costly, and data privacy may be also violated. Federated learning (FL) is an emerging learning paradigm that enables collaborative learning among multiple clients, which can mitigate the issue of label scarcity and protect data privacy as well. Therefore, performing GraphSSC under the FL setting is a promising solution to solve real-world graph-based problems. However, existing FL methods 1) perform poorly when data across clients are non independent identically distributed (nonIID), 2) cannot handle data with new label domains, and 3) cannot leverage unlabeled data, while all these issues naturally happen in real-world graph-based problems. To address the above issues, we propose the first FL framework, namely GraphFL, for semi-supervised node classification on graphs. Our framework is motivated by meta-learning methods. Specifically, we propose two GraphFL methods to respectively address the non-IID issue in graph data and handle the tasks with new label domains. Furthermore, we design a self-training method to leverage unlabeled graph data. We adopt representative graph neural networks as GraphSSC methods and evaluate GraphFL on multiple graph datasets. Experimental results on various benchmark datasets demonstrate that GraphFL significantly outperforms the compared FL baseline, GraphFL can handle data with new label domains, and GraphFL with selftraining can obtain better performance. Source code is available at https://github.com/binghuiivang/GraphFL.

Duke Scholars

Author Yiran Chen Electrical and Computer Engineering

Published In

Proceedings - IEEE International Conference on Data Mining, ICDM

DOI

10.1109/ICDM54844.2022.00060

ISSN

1550-4786

Publication Date

January 1, 2022

Volume

2022-November

Start / End Page

498 / 507

Citation

APA

Chicago

ICMJE

MLA

NLM

Wang, B., Li, A., Pang, M., Li, H., & Chen, Y. (2022). GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs. In Proceedings - IEEE International Conference on Data Mining, ICDM (Vol. 2022-November, pp. 498–507). https://doi.org/10.1109/ICDM54844.2022.00060

Wang, B., A. Li, M. Pang, H. Li, and Y. Chen. “GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs.” In Proceedings - IEEE International Conference on Data Mining, ICDM, 2022-November:498–507, 2022. https://doi.org/10.1109/ICDM54844.2022.00060.

Wang B, Li A, Pang M, Li H, Chen Y. GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs. In: Proceedings - IEEE International Conference on Data Mining, ICDM. 2022. p. 498–507.

Wang, B., et al. “GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs.” Proceedings - IEEE International Conference on Data Mining, ICDM, vol. 2022-November, 2022, pp. 498–507. Scopus, doi:10.1109/ICDM54844.2022.00060.

Wang B, Li A, Pang M, Li H, Chen Y. GraphFL: A Federated Learning Framework for Semi-Supervised Node Classification on Graphs. Proceedings - IEEE International Conference on Data Mining, ICDM. 2022. p. 498–507.

Published In

Proceedings - IEEE International Conference on Data Mining, ICDM

DOI

10.1109/ICDM54844.2022.00060

ISSN

1550-4786

Publication Date

January 1, 2022

Volume

2022-November

Start / End Page

498 / 507