Skip to main content

Graph edge partitioning via neighborhood heuristic

Publication ,  Conference
Zhang, C; Wei, F; Liu, Q; Tang, ZG; Li, Z
Published in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 13, 2017

We consider the edge partitioning problem that partitions the edges of an input graph into multiple balanced components, while minimizing the total number of vertices replicated (one vertex might appear in more than one partition). This problem is critical in minimizing communication costs and running time for several large-scale distributed graph computation platforms (e.g., PowerGraph, Spark GraphX). We first prove that this problem is NP-hard, and then present a new partitioning heuristic with polynomial running time. We provide a worst-case upper bound of replication factor for our heuristic on general graphs. To our knowledge, we are the first to provide such bound for edge partitioning algorithms on general graphs. Applying this bound to random power-law graphs greatly improves the previous bounds of expected replication factor. Extensive experiments demonstrated that our partitioning algorithm consistently produces much smaller replication factors on various benchmark data sets than the state-of-the-art. When deployed in the production graph engine, PowerGraph, in average it reduces replication factor, communication, and running time by 54%, 66%, and 21%, respectively.

Duke Scholars

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

August 13, 2017

Volume

Part F129685

Start / End Page

605 / 614
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhang, C., Wei, F., Liu, Q., Tang, Z. G., & Li, Z. (2017). Graph edge partitioning via neighborhood heuristic. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Vol. Part F129685, pp. 605–614). https://doi.org/10.1145/3097983.3098033
Zhang, C., F. Wei, Q. Liu, Z. G. Tang, and Z. Li. “Graph edge partitioning via neighborhood heuristic.” In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Part F129685:605–14, 2017. https://doi.org/10.1145/3097983.3098033.
Zhang C, Wei F, Liu Q, Tang ZG, Li Z. Graph edge partitioning via neighborhood heuristic. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2017. p. 605–14.
Zhang, C., et al. “Graph edge partitioning via neighborhood heuristic.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, vol. Part F129685, 2017, pp. 605–14. Scopus, doi:10.1145/3097983.3098033.
Zhang C, Wei F, Liu Q, Tang ZG, Li Z. Graph edge partitioning via neighborhood heuristic. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2017. p. 605–614.

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

August 13, 2017

Volume

Part F129685

Start / End Page

605 / 614