Skip to main content
Journal cover image

Unsupervised Bayesian linear unmixing of gene expression microarrays.

Publication ,  Journal Article
Bazot, C; Dobigeon, N; Tourneret, J-Y; Zaas, AK; Ginsburg, GS; Hero, AO
Published in: BMC Bioinformatics
March 19, 2013

BACKGROUND: This paper introduces a new constrained model and the corresponding algorithm, called unsupervised Bayesian linear unmixing (uBLU), to identify biological signatures from high dimensional assays like gene expression microarrays. The basis for uBLU is a Bayesian model for the data samples which are represented as an additive mixture of random positive gene signatures, called factors, with random positive mixing coefficients, called factor scores, that specify the relative contribution of each signature to a specific sample. The particularity of the proposed method is that uBLU constrains the factor loadings to be non-negative and the factor scores to be probability distributions over the factors. Furthermore, it also provides estimates of the number of factors. A Gibbs sampling strategy is adopted here to generate random samples according to the posterior distribution of the factors, factor scores, and number of factors. These samples are then used to estimate all the unknown parameters. RESULTS: Firstly, the proposed uBLU method is applied to several simulated datasets with known ground truth and compared with previous factor decomposition methods, such as principal component analysis (PCA), non negative matrix factorization (NMF), Bayesian factor regression modeling (BFRM), and the gradient-based algorithm for general matrix factorization (GB-GMF). Secondly, we illustrate the application of uBLU on a real time-evolving gene expression dataset from a recent viral challenge study in which individuals have been inoculated with influenza A/H3N2/Wisconsin. We show that the uBLU method significantly outperforms the other methods on the simulated and real data sets considered here. CONCLUSIONS: The results obtained on synthetic and real data illustrate the accuracy of the proposed uBLU method when compared to other factor decomposition methods from the literature (PCA, NMF, BFRM, and GB-GMF). The uBLU method identifies an inflammatory component closely associated with clinical symptom scores collected during the study. Using a constrained model allows recovery of all the inflammatory genes in a single factor.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

BMC Bioinformatics

DOI

EISSN

1471-2105

Publication Date

March 19, 2013

Volume

14

Start / End Page

99

Location

England

Related Subject Headings

  • Microarray Analysis
  • Male
  • Influenza, Human
  • Influenza A Virus, H3N2 Subtype
  • Humans
  • Gene Expression Profiling
  • Bioinformatics
  • Bayes Theorem
  • Algorithms
  • 49 Mathematical sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Bazot, C., Dobigeon, N., Tourneret, J.-Y., Zaas, A. K., Ginsburg, G. S., & Hero, A. O. (2013). Unsupervised Bayesian linear unmixing of gene expression microarrays. BMC Bioinformatics, 14, 99. https://doi.org/10.1186/1471-2105-14-99
Bazot, Cécile, Nicolas Dobigeon, Jean-Yves Tourneret, Aimee K. Zaas, Geoffrey S. Ginsburg, and Alfred O. Hero. “Unsupervised Bayesian linear unmixing of gene expression microarrays.BMC Bioinformatics 14 (March 19, 2013): 99. https://doi.org/10.1186/1471-2105-14-99.
Bazot C, Dobigeon N, Tourneret J-Y, Zaas AK, Ginsburg GS, Hero AO. Unsupervised Bayesian linear unmixing of gene expression microarrays. BMC Bioinformatics. 2013 Mar 19;14:99.
Bazot, Cécile, et al. “Unsupervised Bayesian linear unmixing of gene expression microarrays.BMC Bioinformatics, vol. 14, Mar. 2013, p. 99. Pubmed, doi:10.1186/1471-2105-14-99.
Bazot C, Dobigeon N, Tourneret J-Y, Zaas AK, Ginsburg GS, Hero AO. Unsupervised Bayesian linear unmixing of gene expression microarrays. BMC Bioinformatics. 2013 Mar 19;14:99.
Journal cover image

Published In

BMC Bioinformatics

DOI

EISSN

1471-2105

Publication Date

March 19, 2013

Volume

14

Start / End Page

99

Location

England

Related Subject Headings

  • Microarray Analysis
  • Male
  • Influenza, Human
  • Influenza A Virus, H3N2 Subtype
  • Humans
  • Gene Expression Profiling
  • Bioinformatics
  • Bayes Theorem
  • Algorithms
  • 49 Mathematical sciences