Skip to main content
Journal cover image

Discriminative variable subsets in Bayesian classification with mixture models, with application in flow cytometry studies.

Publication ,  Journal Article
Lin, L; Chan, C; West, M
Published in: Biostatistics
January 2016

We discuss the evaluation of subsets of variables for the discriminative evidence they provide in multivariate mixture modeling for classification. The novel development of Bayesian classification analysis presented is partly motivated by problems of design and selection of variables in biomolecular studies, particularly involving widely used assays of large-scale single-cell data generated using flow cytometry technology. For such studies and for mixture modeling generally, we define discriminative analysis that overlays fitted mixture models using a natural measure of concordance between mixture component densities, and define an effective and computationally feasible method for assessing and prioritizing subsets of variables according to their roles in discrimination of one or more mixture components. We relate the new discriminative information measures to Bayesian classification probabilities and error rates, and exemplify their use in Bayesian analysis of Dirichlet process mixture models fitted via Markov chain Monte Carlo methods as well as using a novel Bayesian expectation-maximization algorithm. We present a series of theoretical and simulated data examples to fix concepts and exhibit the utility of the approach, and compare with prior approaches. We demonstrate application in the context of automatic classification and discriminative variable selection in high-throughput systems biology using large flow cytometry datasets.

Duke Scholars

Published In

Biostatistics

DOI

EISSN

1468-4357

Publication Date

January 2016

Volume

17

Issue

1

Start / End Page

40 / 53

Location

England

Related Subject Headings

  • T-Lymphocytes, Regulatory
  • Statistics & Probability
  • Models, Statistical
  • Humans
  • Flow Cytometry
  • Bayes Theorem
  • 4905 Statistics
  • 0604 Genetics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Lin, L., Chan, C., & West, M. (2016). Discriminative variable subsets in Bayesian classification with mixture models, with application in flow cytometry studies. Biostatistics, 17(1), 40–53. https://doi.org/10.1093/biostatistics/kxv021
Lin, Lin, Cliburn Chan, and Mike West. “Discriminative variable subsets in Bayesian classification with mixture models, with application in flow cytometry studies.Biostatistics 17, no. 1 (January 2016): 40–53. https://doi.org/10.1093/biostatistics/kxv021.
Lin, Lin, et al. “Discriminative variable subsets in Bayesian classification with mixture models, with application in flow cytometry studies.Biostatistics, vol. 17, no. 1, Jan. 2016, pp. 40–53. Pubmed, doi:10.1093/biostatistics/kxv021.
Journal cover image

Published In

Biostatistics

DOI

EISSN

1468-4357

Publication Date

January 2016

Volume

17

Issue

1

Start / End Page

40 / 53

Location

England

Related Subject Headings

  • T-Lymphocytes, Regulatory
  • Statistics & Probability
  • Models, Statistical
  • Humans
  • Flow Cytometry
  • Bayes Theorem
  • 4905 Statistics
  • 0604 Genetics
  • 0104 Statistics