Skip to main content
construction release_alert
Scholars@Duke will be undergoing maintenance April 11-15. Some features may be unavailable during this time.
cancel
Journal cover image

Bayesian Approximate Kernel Regression with Variable Selection.

Publication ,  Journal Article
Crawford, L; Wood, KC; Zhou, X; Mukherjee, S
Published in: J Am Stat Assoc
2018

Nonlinear kernel regression models are often used in statistics and machine learning because they are more accurate than linear models. Variable selection for kernel regression models is a challenge partly because, unlike the linear regression setting, there is no clear concept of an effect size for regression coefficients. In this paper, we propose a novel framework that provides an effect size analog for each explanatory variable in Bayesian kernel regression models when the kernel is shift-invariant - for example, the Gaussian kernel. We use function analytic properties of shift-invariant reproducing kernel Hilbert spaces (RKHS) to define a linear vector space that: (i) captures nonlinear structure, and (ii) can be projected onto the original explanatory variables. This projection onto the original explanatory variables serves as an analog of effect sizes. The specific function analytic property we use is that shift-invariant kernel functions can be approximated via random Fourier bases. Based on the random Fourier expansion, we propose a computationally efficient class of Bayesian approximate kernel regression (BAKR) models for both nonlinear regression and binary classification for which one can compute an analog of effect sizes. We illustrate the utility of BAKR by examining two important problems in statistical genetics: genomic selection (i.e. phenotypic prediction) and association mapping (i.e. inference of significant variants or loci). State-of-the-art methods for genomic selection and association mapping are based on kernel regression and linear models, respectively. BAKR is the first method that is competitive in both settings.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

J Am Stat Assoc

DOI

ISSN

0162-1459

Publication Date

2018

Volume

113

Issue

524

Start / End Page

1710 / 1721

Location

United States

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Crawford, L., Wood, K. C., Zhou, X., & Mukherjee, S. (2018). Bayesian Approximate Kernel Regression with Variable Selection. J Am Stat Assoc, 113(524), 1710–1721. https://doi.org/10.1080/01621459.2017.1361830
Crawford, Lorin, Kris C. Wood, Xiang Zhou, and Sayan Mukherjee. “Bayesian Approximate Kernel Regression with Variable Selection.J Am Stat Assoc 113, no. 524 (2018): 1710–21. https://doi.org/10.1080/01621459.2017.1361830.
Crawford L, Wood KC, Zhou X, Mukherjee S. Bayesian Approximate Kernel Regression with Variable Selection. J Am Stat Assoc. 2018;113(524):1710–21.
Crawford, Lorin, et al. “Bayesian Approximate Kernel Regression with Variable Selection.J Am Stat Assoc, vol. 113, no. 524, 2018, pp. 1710–21. Pubmed, doi:10.1080/01621459.2017.1361830.
Crawford L, Wood KC, Zhou X, Mukherjee S. Bayesian Approximate Kernel Regression with Variable Selection. J Am Stat Assoc. 2018;113(524):1710–1721.
Journal cover image

Published In

J Am Stat Assoc

DOI

ISSN

0162-1459

Publication Date

2018

Volume

113

Issue

524

Start / End Page

1710 / 1721

Location

United States

Related Subject Headings

  • Statistics & Probability
  • 4905 Statistics
  • 3802 Econometrics
  • 1603 Demography
  • 1403 Econometrics
  • 0104 Statistics