Skip to main content
Journal cover image

gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.

Publication ,  Journal Article
Larson, NB; McDonnell, S; Cannon Albright, L; Teerlink, C; Stanford, J; Ostrander, EA; Isaacs, WB; Xu, J; Cooney, KA; Lange, E; Schleutker, J ...
Published in: Genet Epidemiol
May 2017

Next-generation sequencing technologies have afforded unprecedented characterization of low-frequency and rare genetic variation. Due to low power for single-variant testing, aggregative methods are commonly used to combine observed rare variation within a single gene. Causal variation may also aggregate across multiple genes within relevant biomolecular pathways. Kernel-machine regression and adaptive testing methods for aggregative rare-variant association testing have been demonstrated to be powerful approaches for pathway-level analysis, although these methods tend to be computationally intensive at high-variant dimensionality and require access to complete data. An additional analytical issue in scans of large pathway definition sets is multiple testing correction. Gene set definitions may exhibit substantial genic overlap, and the impact of the resultant correlation in test statistics on Type I error rate control for large agnostic gene set scans has not been fully explored. Herein, we first outline a statistical strategy for aggregative rare-variant analysis using component gene-level linear kernel score test summary statistics as well as derive simple estimators of the effective number of tests for family-wise error rate control. We then conduct extensive simulation studies to characterize the behavior of our approach relative to direct application of kernel and adaptive methods under a variety of conditions. We also apply our method to two case-control studies, respectively, evaluating rare variation in hereditary prostate cancer and schizophrenia. Finally, we provide open-source R code for public use to facilitate easy application of our methods to existing rare-variant analysis results.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Genet Epidemiol

DOI

EISSN

1098-2272

Publication Date

May 2017

Volume

41

Issue

4

Start / End Page

297 / 308

Location

United States

Related Subject Headings

  • Statistics, Nonparametric
  • Sample Size
  • Models, Genetic
  • Humans
  • Genetic Variation
  • Genetic Association Studies
  • Epidemiology
  • Computer Simulation
  • Algorithms
  • 4202 Epidemiology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Larson, N. B., McDonnell, S., Cannon Albright, L., Teerlink, C., Stanford, J., Ostrander, E. A., … Schaid, D. J. (2017). gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels. Genet Epidemiol, 41(4), 297–308. https://doi.org/10.1002/gepi.22036
Larson, Nicholas B., Shannon McDonnell, Lisa Cannon Albright, Craig Teerlink, Janet Stanford, Elaine A. Ostrander, William B. Isaacs, et al. “gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.Genet Epidemiol 41, no. 4 (May 2017): 297–308. https://doi.org/10.1002/gepi.22036.
Larson NB, McDonnell S, Cannon Albright L, Teerlink C, Stanford J, Ostrander EA, et al. gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels. Genet Epidemiol. 2017 May;41(4):297–308.
Larson, Nicholas B., et al. “gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels.Genet Epidemiol, vol. 41, no. 4, May 2017, pp. 297–308. Pubmed, doi:10.1002/gepi.22036.
Larson NB, McDonnell S, Cannon Albright L, Teerlink C, Stanford J, Ostrander EA, Isaacs WB, Xu J, Cooney KA, Lange E, Schleutker J, Carpten JD, Powell I, Bailey-Wilson JE, Cussenot O, Cancel-Tassin G, Giles GG, MacInnis RJ, Maier C, Whittemore AS, Hsieh C-L, Wiklund F, Catalona WJ, Foulkes W, Mandal D, Eeles R, Kote-Jarai Z, Ackerman MJ, Olson TM, Klein CJ, Thibodeau SN, Schaid DJ. gsSKAT: Rapid gene set analysis and multiple testing correction for rare-variant association studies using weighted linear kernels. Genet Epidemiol. 2017 May;41(4):297–308.
Journal cover image

Published In

Genet Epidemiol

DOI

EISSN

1098-2272

Publication Date

May 2017

Volume

41

Issue

4

Start / End Page

297 / 308

Location

United States

Related Subject Headings

  • Statistics, Nonparametric
  • Sample Size
  • Models, Genetic
  • Humans
  • Genetic Variation
  • Genetic Association Studies
  • Epidemiology
  • Computer Simulation
  • Algorithms
  • 4202 Epidemiology