Scholars@Duke publication: Joint classifier and feature optimization for cancer diagnosis using gene expression data

Joint classifier and feature optimization for cancer diagnosis using gene expression data

Publication , Conference

Krishnapuram, B; Carin, L; Hartemink, AJ

Published in: Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB

January 1, 2003

Recent research has demonstrated quite convincingly that accurate cancer diagnosis can be achieved by constructing classifiers that arc designed to compare the gene expression profile of a tissue of unknown cancer status to a database of stored expression profiles from tissues of known cancer status. This paper introduces the JCFO, a novel algorithm that uses a sparse Bayesian approach to jointly identify both the optimal nonlinear classifier for diagnosis and the optimal set of genes on which to base that diagnosis. We show that the diagnostic classification accuracy of the proposed algorithm is superior to a number of current state-of-the-art methods in a full leave-one-out cross-validation study of two widely used benchmark datasets. In addition to its superior classification accuracy, the algorithm is designed to automatically identify a small subset of genes (typically around twenty in our experiments) that are capable of providing complete discriminatory information for diagnosis. Focusing attention on a small subset of genes is not only useful because it produces a classifier with good generalization capacity, but also because this set of genes may provide insights into the mechanisms responsible for the disease itself. A number of the genes identified by the JCFO in our experiments are already in use as clinical markers for cancer diagnosis; some of the remaining genes may be excellent candidates for further clinical investigation. If it is possible to identify a small set of genes that is indeed capable of providing complete discrimination, inexpensive diagnostic assays might be widely deployable in clinical settings.

Duke Scholars

Author Lawrence Carin Electrical and Computer Engineering

Author Alexander J. Hartemink Computer Science

Published In

Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB

DOI

10.1145/640075.640097

Publication Date

January 1, 2003

Start / End Page

167 / 175

Citation

APA

Chicago

ICMJE

MLA

NLM

Krishnapuram, B., Carin, L., & Hartemink, A. J. (2003). Joint classifier and feature optimization for cancer diagnosis using gene expression data. In Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB (pp. 167–175). https://doi.org/10.1145/640075.640097

Krishnapuram, B., L. Carin, and A. J. Hartemink. “Joint classifier and feature optimization for cancer diagnosis using gene expression data.” In Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB, 167–75, 2003. https://doi.org/10.1145/640075.640097.

Krishnapuram B, Carin L, Hartemink AJ. Joint classifier and feature optimization for cancer diagnosis using gene expression data. In: Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB. 2003. p. 167–75.

Krishnapuram, B., et al. “Joint classifier and feature optimization for cancer diagnosis using gene expression data.” Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB, 2003, pp. 167–75. Scopus, doi:10.1145/640075.640097.

Krishnapuram B, Carin L, Hartemink AJ. Joint classifier and feature optimization for cancer diagnosis using gene expression data. Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB. 2003. p. 167–175.

Published In

Proceedings of the Annual International Conference on Computational Molecular Biology RECOMB

DOI

10.1145/640075.640097

Publication Date

January 1, 2003

Start / End Page

167 / 175