Skip to main content

On mining cross-graph quasi-cliques

Publication ,  Conference
Pei, J; Jiang, D; Zhang, A
Published in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
December 1, 2005

Joint mining of multiple data sets can often discover interesting, novel, and reliable patterns which cannot be obtained solely from any single source. For example, in cross-market customer segmentation, a group of customers who behave similarly in multiple markets should be considered as a more coherent and more reliable cluster than clusters found in a single market. As another example, in bioinformatics, by joint mining of gene expression data and protein interaction data, we can find clusters of genes which show coherent expression patterns and also produce interacting proteins. Such clusters may be potential pathways. In this paper, we investigate a novel data mining problem, mining cross-graph quasi-cliques, which is generalized from several interesting applications such as cross-market customer segmentation and joint mining of gene expression data and protein interaction data. We build a general model for mining cross-graph quasi-cliques, show why the complete set of cross-graph quasi-cliques cannot be found by previous data mining methods, and study the complexity of the problem. While the problem is difficult, we develop an efficient algorithm, Crochet, which exploits several interesting and effective techniques and heuristics to efficaciously mine cross-graph quasi-cliques. A systematic performance study is reported on both synthetic and real data sets. We demonstrate some interesting and meaningful cross-graph quasi-cliques in bioinformatics. The experimental results also show that algorithm Crochet is efficient and scalable. Copyright 2005 ACM.

Duke Scholars

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

December 1, 2005

Start / End Page

228 / 238
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Pei, J., Jiang, D., & Zhang, A. (2005). On mining cross-graph quasi-cliques. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 228–238). https://doi.org/10.1145/1081870.1081898
Pei, J., D. Jiang, and A. Zhang. “On mining cross-graph quasi-cliques.” In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 228–38, 2005. https://doi.org/10.1145/1081870.1081898.
Pei J, Jiang D, Zhang A. On mining cross-graph quasi-cliques. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2005. p. 228–38.
Pei, J., et al. “On mining cross-graph quasi-cliques.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005, pp. 228–38. Scopus, doi:10.1145/1081870.1081898.
Pei J, Jiang D, Zhang A. On mining cross-graph quasi-cliques. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2005. p. 228–238.

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

December 1, 2005

Start / End Page

228 / 238