Scholars@Duke publication: Mining phenotypes and informative genes from gene expression data

Mining phenotypes and informative genes from gene expression data

Publication , Conference

Tang, C; Zhang, A; Pei, J

Published in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

December 1, 2003

Mining microarray gene expression data is an important research topic in bioinformatics with broad applications. While most of the previous studies focus on clustering either genes or samples, it is interesting to ask whether we can partition the complete set of samples into exclusive groups (called phenotypes) and find a set of informative genes that can manifest the phenotype structure. In this paper, we propose a new problem of simultaneously mining phenotypes and informative genes from gene expression data. Some statistics-based metrics are proposed to measure the quality of the mining results. Two interesting algorithms are developed: the heuristic search and the mutual reinforcing adjustment method. We present an extensive performance study on both real-world data sets and synthetic data sets. The mining results from the two proposed methods are clearly better than those from the previous methods. They are ready for the real-world applications. Between the two methods, the mutual reinforcing adjustment method is in general more scalable, more effective and with better quality of the mining results. Copyright 2003 ACM.

Duke Scholars

Author Jian Pei Computer Science

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

10.1145/956750.956835

Publication Date

December 1, 2003

Start / End Page

655 / 660

Citation

APA

Chicago

ICMJE

MLA

NLM

Tang, C., Zhang, A., & Pei, J. (2003). Mining phenotypes and informative genes from gene expression data. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 655–660). https://doi.org/10.1145/956750.956835

Tang, C., A. Zhang, and J. Pei. “Mining phenotypes and informative genes from gene expression data.” In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 655–60, 2003. https://doi.org/10.1145/956750.956835.

Tang C, Zhang A, Pei J. Mining phenotypes and informative genes from gene expression data. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003. p. 655–60.

Tang, C., et al. “Mining phenotypes and informative genes from gene expression data.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2003, pp. 655–60. Scopus, doi:10.1145/956750.956835.

Tang C, Zhang A, Pei J. Mining phenotypes and informative genes from gene expression data. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003. p. 655–660.

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

10.1145/956750.956835

Publication Date

December 1, 2003

Start / End Page

655 / 660