Hierarchical factor modeling of proteomics data


Journal Article

This paper presents a hierarchical bayesian factor model specifically designed to model the known correlation structure of both peptides and proteins in unbiased, label free proteomics. The model utilizes partial identification information from peptide sequencing and database lookup as well as observed correlation in the data set in order to appropriately compress features into metaproteins and to estimate correlation structure. Although peptide to phenotype associations may be computed from hypothesis testing or multiple regression summaries, to date, there have been no published approaches that directly model what we know to be multiple different levels of correlation structure. We test the the proposed model using publicly available benchmark data and a recent study based on a collection of volunteers who were infected with two different strands of viral influenza. © 2012 IEEE.

Full Text

Duke Authors

Cited Authors

  • Henao, R; Thompson, JW; Moseley, MA; Ginsburg, GS; Carin, L; Lucas, JE

Published Date

  • May 8, 2012

Published In

  • 2012 Ieee 2nd International Conference on Computational Advances in Bio and Medical Sciences, Iccabs 2012

Digital Object Identifier (DOI)

  • 10.1109/ICCABS.2012.6182638

Citation Source

  • Scopus