Skip to main content

A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study.

Publication ,  Journal Article
Dong, X; Su, Y-R; Barfield, R; Bien, SA; He, Q; Harrison, TA; Huyghe, JR; Keku, TO; Lindor, NM; Schafmayer, C; Chan, AT; Gruber, SB; Hsu, L ...
Published in: PLoS genetics
August 2020

Genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants associated with various phenotypes, but together they explain only a fraction of heritability, suggesting many variants have yet to be discovered. Recently it has been recognized that incorporating functional information of genetic variants can improve power for identifying novel loci. For example, S-PrediXcan and TWAS tested the association of predicted gene expression with phenotypes based on GWAS summary statistics by leveraging the information on genetic regulation of gene expression and found many novel loci. However, as genetic variants may have effects on more than one gene and through different mechanisms, these methods likely only capture part of the total effects of these variants. In this paper, we propose a summary statistics-based mixed effects score test (sMiST) that tests for the total effect of both the effect of the mediator by imputing genetically predicted gene expression, like S-PrediXcan and TWAS, and the direct effects of individual variants. It allows for multiple functional annotations and multiple genetically predicted mediators. It can also perform conditional association analysis while adjusting for other genetic variants (e.g., known loci for the phenotype). Extensive simulation and real data analyses demonstrate that sMiST yields p-values that agree well with those obtained from individual level data but with substantively improved computational speed. Importantly, a broad application of sMiST to GWAS is possible, as only summary statistics of genetic variant associations are required. We apply sMiST to a large-scale GWAS of colorectal cancer using summary statistics from ∼120, 000 study participants and gene expression data from the Genotype-Tissue Expression (GTEx) project. We identify several novel and secondary independent genetic loci.

Duke Scholars

Published In

PLoS genetics

DOI

EISSN

1553-7404

ISSN

1553-7390

Publication Date

August 2020

Volume

16

Issue

8

Start / End Page

e1008947

Related Subject Headings

  • Quantitative Trait Loci
  • Polymorphism, Single Nucleotide
  • Phenotype
  • Models, Statistical
  • Humans
  • Genotype
  • Genome-Wide Association Study
  • Genetic Variation
  • Genetic Predisposition to Disease
  • Gene Expression Regulation, Neoplastic
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Dong, X., Su, Y.-R., Barfield, R., Bien, S. A., He, Q., Harrison, T. A., … Hsu, L. (2020). A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study. PLoS Genetics, 16(8), e1008947. https://doi.org/10.1371/journal.pgen.1008947
Dong, Xinyuan, Yu-Ru Su, Richard Barfield, Stephanie A. Bien, Qianchuan He, Tabitha A. Harrison, Jeroen R. Huyghe, et al. “A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study.PLoS Genetics 16, no. 8 (August 2020): e1008947. https://doi.org/10.1371/journal.pgen.1008947.
Dong X, Su Y-R, Barfield R, Bien SA, He Q, Harrison TA, et al. A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study. PLoS genetics. 2020 Aug;16(8):e1008947.
Dong, Xinyuan, et al. “A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study.PLoS Genetics, vol. 16, no. 8, Aug. 2020, p. e1008947. Epmc, doi:10.1371/journal.pgen.1008947.
Dong X, Su Y-R, Barfield R, Bien SA, He Q, Harrison TA, Huyghe JR, Keku TO, Lindor NM, Schafmayer C, Chan AT, Gruber SB, Jenkins MA, Kooperberg C, Peters U, Hsu L. A general framework for functionally informed set-based analysis: Application to a large-scale colorectal cancer study. PLoS genetics. 2020 Aug;16(8):e1008947.

Published In

PLoS genetics

DOI

EISSN

1553-7404

ISSN

1553-7390

Publication Date

August 2020

Volume

16

Issue

8

Start / End Page

e1008947

Related Subject Headings

  • Quantitative Trait Loci
  • Polymorphism, Single Nucleotide
  • Phenotype
  • Models, Statistical
  • Humans
  • Genotype
  • Genome-Wide Association Study
  • Genetic Variation
  • Genetic Predisposition to Disease
  • Gene Expression Regulation, Neoplastic