Kinship estimation bias carries over to heritability estimation bias using variance components.
Heritability is a fundamental parameter of diseases and other traits, quantifying the contribution of genetics to that trait. Kinship matrices are required for heritability estimation with variance components models. However, the most common "standard" kinship estimator employed by GCTA (Genome-wide Complex Trait Analysis) and other approaches, can be severely biased in structured populations. In this study, we characterize two heritability estimation biases in GCTA due to kinship estimation biases under population structure. For the standard ratio-of-means (ROM) kinship estimator, we derive a closed-form expression for first heritability bias given by the mean kinship and the true heritability. The standard mean-of-ratios (MOR) estimator, which is the most widely used in practice, exhibits both the bias shared with ROM and an additional, more severe bias caused by the upweighting of low-frequency variants. Using simulations with admixture and family structures, as well as simulated traits from 1000 Genomes genotypes, we find that only Popkin-the only unbiased population kinship estimator-produces unbiased heritability estimates in structured settings. Pedigree-only estimates have upward heritability biases when there is population structure. Finally, we analyze three structured datasets with real phenotypes-the San Antonio Family Study, the Hispanic Community Health Study/Study of Latinos, and a multiethnic Nephrotic Syndrome cohort. The standard MOR estimator can produce both downward and upward heritability biases depending on population structure and variant frequency spectrum, compared to the other two estimators. Overall, common kinship estimators result in heritability estimation biases when applied to structured populations, a challenge that Popkin successfully overcomes.
Duke Scholars
Published In
DOI
EISSN
Publication Date
Location
Related Subject Headings
- Developmental Biology
- 3105 Genetics
- 3101 Biochemistry and cell biology
- 0604 Genetics
Citation
Published In
DOI
EISSN
Publication Date
Location
Related Subject Headings
- Developmental Biology
- 3105 Genetics
- 3101 Biochemistry and cell biology
- 0604 Genetics