Skip to main content
Journal cover image

Methods for interaction analyses using family-based case-control data: conditional logistic regression versus generalized estimating equations.

Publication ,  Journal Article
Hancock, DB; Martin, ER; Li, Y-J; Scott, WK
Published in: Genet Epidemiol
December 2007

A complex web of gene-gene and gene-environment interactions likely underlies late-onset disease development. We compared conditional logistic regression (CLR) and generalized estimating equations (GEE) in modeling such interactions in pedigrees with missing parents. Using the simulation of linkage and association (SIMLA) program, disease genes, an environmental risk factor, gene-gene interaction, and gene-environment interaction were generated in family-based data sets. Four scenarios for the relationship between the marker and disease loci were examined: linkage and association, linkage without association, association without linkage, and absence of both linkage and association. Models for CLR and GEE (with exchangeable and independence correlation matrices) were built, and type I error, power, average odds ratio (OR), standard deviation, and 95% confidence intervals were estimated. CLR and GEE were valid tests of association in the presence of linkage, but type I error was inflated for association without linkage, particularly with GEE. CLR generated estimates of the OR with lower bias but often more variability than the OR estimates observed for GEE. Further, GEE was more powerful than CLR in detecting main and interactive effects. Although GEE with both matrices had similar power, use of the independence matrix resulted in lower type I error and less biased OR estimation as compared to the exchangeable matrix. Our findings support the use of GEE in maximizing power to detect gene-gene and gene-environment interactions but caution its use under potential association without linkage (e.g., population stratification) and the interpretation of its OR estimates.

Duke Scholars

Published In

Genet Epidemiol

DOI

ISSN

0741-0395

Publication Date

December 2007

Volume

31

Issue

8

Start / End Page

883 / 893

Location

United States

Related Subject Headings

  • Statistics as Topic
  • Regression Analysis
  • Pedigree
  • Models, Statistical
  • Models, Genetic
  • Humans
  • Genetics, Medical
  • Genes
  • Family Health
  • Epidemiology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hancock, D. B., Martin, E. R., Li, Y.-J., & Scott, W. K. (2007). Methods for interaction analyses using family-based case-control data: conditional logistic regression versus generalized estimating equations. Genet Epidemiol, 31(8), 883–893. https://doi.org/10.1002/gepi.20249
Hancock, Dana B., Eden R. Martin, Yi-Ju Li, and William K. Scott. “Methods for interaction analyses using family-based case-control data: conditional logistic regression versus generalized estimating equations.Genet Epidemiol 31, no. 8 (December 2007): 883–93. https://doi.org/10.1002/gepi.20249.
Hancock, Dana B., et al. “Methods for interaction analyses using family-based case-control data: conditional logistic regression versus generalized estimating equations.Genet Epidemiol, vol. 31, no. 8, Dec. 2007, pp. 883–93. Pubmed, doi:10.1002/gepi.20249.
Journal cover image

Published In

Genet Epidemiol

DOI

ISSN

0741-0395

Publication Date

December 2007

Volume

31

Issue

8

Start / End Page

883 / 893

Location

United States

Related Subject Headings

  • Statistics as Topic
  • Regression Analysis
  • Pedigree
  • Models, Statistical
  • Models, Genetic
  • Humans
  • Genetics, Medical
  • Genes
  • Family Health
  • Epidemiology