Communication-Efficient Federated Learning for Decision Trees
The increasing concerns about data privacy and security have driven the emergence of federated learning, which preserves privacy by collaborative learning across multiple clients without sharing their raw data. In this paper, we propose a communication-efficient federated learning algorithm for decision trees, referred to as FL-DT. The key idea is to exchange the statistics of a small number of features among the server and all clients, enabling identification of the optimal feature to split each decision tree node without compromising privacy. To efficiently find the splitting feature based on the partially available information at each decision tree node, a novel formulation is derived to estimate the lower and upper bounds of Gini indexes of all features by solving a sequence of mixed-integer convex programming problems. Our experimental results based on various public datasets demonstrate that FL-DT can reduce the communication overhead substantially without surrendering any classification accuracy, compared to other conventional methods.
Duke Scholars
Published In
DOI
EISSN
Publication Date
Related Subject Headings
- 4611 Machine learning
- 4603 Computer vision and multimedia computation
- 4602 Artificial intelligence
Citation
Published In
DOI
EISSN
Publication Date
Related Subject Headings
- 4611 Machine learning
- 4603 Computer vision and multimedia computation
- 4602 Artificial intelligence