Scholars@Duke publication: Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.

Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.

Publication , Journal Article

Yi, H; Breheny, P; Imam, N; Liu, Y; Hoeschele, I

Published in: Genetics

January 2015

The data from genome-wide association studies (GWAS) in humans are still predominantly analyzed using single-marker association methods. As an alternative to single-marker analysis (SMA), all or subsets of markers can be tested simultaneously. This approach requires a form of penalized regression (PR) as the number of SNPs is much larger than the sample size. Here we review PR methods in the context of GWAS, extend them to perform penalty parameter and SNP selection by false discovery rate (FDR) control, and assess their performance in comparison with SMA. PR methods were compared with SMA, using realistically simulated GWAS data with a continuous phenotype and real data. Based on these comparisons our analytic FDR criterion may currently be the best approach to SNP selection using PR for GWAS. We found that PR with FDR control provides substantially more power than SMA with genome-wide type-I error control but somewhat less power than SMA with Benjamini-Hochberg FDR control (SMA-BH). PR with FDR-based penalty parameter selection controlled the FDR somewhat conservatively while SMA-BH may not achieve FDR control in all situations. Differences among PR methods seem quite small when the focus is on SNP selection with FDR control. Incorporating linkage disequilibrium into the penalization by adapting penalties developed for covariates measured on graphs can improve power but also generate more false positives or wider regions for follow-up. We recommend the elastic net with a mixing weight for the Lasso penalty near 0.5 as the best method.

Duke Scholars

Author Yongmei Liu Medicine, Cardiology

Published In

Genetics

DOI

10.1534/genetics.114.167817

EISSN

1943-2631

Publication Date

January 2015

Volume

199

Issue

Start / End Page

205 / 222

Location

United States

Related Subject Headings

Quantitative Trait, Heritable
Polymorphism, Single Nucleotide
Humans
Genome-Wide Association Study
Genome, Human
Developmental Biology
Algorithms
3105 Genetics
3101 Biochemistry and cell biology
0604 Genetics

Citation

APA

Chicago

ICMJE

MLA

NLM

Yi, H., Breheny, P., Imam, N., Liu, Y., & Hoeschele, I. (2015). Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits. Genetics, 199(1), 205–222. https://doi.org/10.1534/genetics.114.167817

Yi, Hui, Patrick Breheny, Netsanet Imam, Yongmei Liu, and Ina Hoeschele. “Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.” Genetics 199, no. 1 (January 2015): 205–22. https://doi.org/10.1534/genetics.114.167817.

Yi H, Breheny P, Imam N, Liu Y, Hoeschele I. Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits. Genetics. 2015 Jan;199(1):205–22.

Yi, Hui, et al. “Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.” Genetics, vol. 199, no. 1, Jan. 2015, pp. 205–22. Pubmed, doi:10.1534/genetics.114.167817.

Yi H, Breheny P, Imam N, Liu Y, Hoeschele I. Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits. Genetics. 2015 Jan;199(1):205–222.

Published In

Genetics

DOI

10.1534/genetics.114.167817

EISSN

1943-2631

Publication Date

January 2015

Volume

199

Issue

Start / End Page

205 / 222

Location

United States

Related Subject Headings

Quantitative Trait, Heritable
Polymorphism, Single Nucleotide
Humans
Genome-Wide Association Study
Genome, Human
Developmental Biology
Algorithms
3105 Genetics
3101 Biochemistry and cell biology
0604 Genetics