Applying family analyses to electronic health records to facilitate genetic research.

Published

Journal Article

Motivation: Pedigree analysis is a longstanding and powerful approach to gain insight into the underlying genetic factors in human health, but identifying, recruiting and genotyping families can be difficult, time consuming and costly. Development of high throughput methods to identify families and foster downstream analyses are necessary. Results: This paper describes simple methods that allowed us to identify 173 368 family pedigrees with high probability using basic demographic data available in most electronic health records (EHRs). We further developed and validate a novel statistical method that uses EHR data to identify families more likely to have a major genetic component to their diseases risk. Lastly, we showed that incorporating EHR-linked family data into genetic association testing may provide added power for genetic mapping without additional recruitment or genotyping. The totality of these results suggests that EHR-linked families can enable classical genetic analyses in a high-throughput manner. Availability and implementation: Pseudocode is provided as supplementary information. Contact: HEBBRING.SCOTT@marshfieldresearch.org. Supplementary information: Supplementary data are available at Bioinformatics online.

Full Text

Duke Authors

Cited Authors

  • Huang, X; Elston, RC; Rosa, GJ; Mayer, J; Ye, Z; Kitchner, T; Brilliant, MH; Page, D; Hebbring, SJ; Stegle, O

Published Date

  • February 15, 2018

Published In

Volume / Issue

  • 34 / 4

Start / End Page

  • 635 - 642

PubMed ID

  • 28968884

Pubmed Central ID

  • 28968884

Electronic International Standard Serial Number (EISSN)

  • 1367-4811

Digital Object Identifier (DOI)

  • 10.1093/bioinformatics/btx569

Language

  • eng

Conference Location

  • England