Skip to main content
Journal cover image

Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics

Publication ,  Journal Article
Li, F; Zhang, NR
Published in: Journal of the American Statistical Association
September 1, 2010

We consider the problem of variable selection in regression modeling in high-dimensional spaces where there is known structure among the covariates. This is an unconventional variable selection problem for two reasons: (1) The dimension of the covariate space is comparable, and often much larger, than the number of subjects in the study, and (2) the covariate space is highly structured, and in some cases it is desirable to incorporate this structural information in to the model building process. We approach this problem through the Bayesian variable selection framework, where we assume that the covariates lie on an undirected graph and formulate an Ising prior on the model space for incorporating structural information. Certain computational and statistical problems arise that are unique to such high-dimensional, structured settings, the most interesting being the phenomenon of phase transitions. We propose theoretical and computational schemes to mitigate these problems. We illustrate our methods on two different graph structures: the linear chain and the regular graph of degree k. Finally, we use our methods to study a specific application in genomics: the modeling of transcription factor binding sites in DNA sequences. © 2010 American Statistical Association.

Duke Scholars

Published In

Journal of the American Statistical Association

DOI

ISSN

0162-1459

Publication Date

September 1, 2010

Volume

105

Issue

491

Start / End Page

1202 / 1214

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, F., & Zhang, N. R. (2010). Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics. Journal of the American Statistical Association, 105(491), 1202–1214. https://doi.org/10.1198/jasa.2010.tm08177
Li, F., and N. R. Zhang. “Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics.” Journal of the American Statistical Association 105, no. 491 (September 1, 2010): 1202–14. https://doi.org/10.1198/jasa.2010.tm08177.
Li F, Zhang NR. Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics. Journal of the American Statistical Association. 2010 Sep 1;105(491):1202–14.
Li, F., and N. R. Zhang. “Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics.” Journal of the American Statistical Association, vol. 105, no. 491, Sept. 2010, pp. 1202–14. Scopus, doi:10.1198/jasa.2010.tm08177.
Li F, Zhang NR. Bayesian variable selection in structured high-dimensional covariate spaces with applications in genomics. Journal of the American Statistical Association. 2010 Sep 1;105(491):1202–1214.
Journal cover image

Published In

Journal of the American Statistical Association

DOI

ISSN

0162-1459

Publication Date

September 1, 2010

Volume

105

Issue

491

Start / End Page

1202 / 1214

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics