Robust analysis of secondary phenotypes in case-control genetic association studies.


Journal Article

The case-control study is a common design for assessing the association between genetic exposures and a disease phenotype. Though association with a given (case-control) phenotype is always of primary interest, there is often considerable interest in assessing relationships between genetic exposures and other (secondary) phenotypes. However, the case-control sample represents a biased sample from the general population. As a result, if this sampling framework is not correctly taken into account, analyses estimating the effect of exposures on secondary phenotypes can be biased leading to incorrect inference. In this paper, we address this problem and propose a general approach for estimating and testing the population effect of a genetic variant on a secondary phenotype. Our approach is based on inverse probability weighted estimating equations, where the weights depend on genotype and the secondary phenotype. We show that, though slightly less efficient than a full likelihood-based analysis when the likelihood is correctly specified, it is substantially more robust to model misspecification, and can out-perform likelihood-based analysis, both in terms of validity and power, when the model is misspecified. We illustrate our approach with an application to a case-control study extracted from the Framingham Heart Study. Copyright © 2016 John Wiley & Sons, Ltd.

Full Text

Duke Authors

Cited Authors

  • Xing, C; M McCarthy, J; Dupuis, J; Adrienne Cupples, L; B Meigs, J; Lin, X; S Allen, A

Published Date

  • October 15, 2016

Published In

Volume / Issue

  • 35 / 23

Start / End Page

  • 4226 - 4237

PubMed ID

  • 27241694

Pubmed Central ID

  • 27241694

Electronic International Standard Serial Number (EISSN)

  • 1097-0258

Digital Object Identifier (DOI)

  • 10.1002/sim.6976


  • eng

Conference Location

  • England