Skip to main content
Journal cover image

Biases introduced by filtering electronic health records for patients with "complete data".

Publication ,  Journal Article
Weber, GM; Adams, WG; Bernstam, EV; Bickel, JP; Fox, KP; Marsolo, K; Raghavan, VA; Turchin, A; Zhou, X; Murphy, SN; Mandl, KD
Published in: J Am Med Inform Assoc
November 1, 2017

OBJECTIVE: One promise of nationwide adoption of electronic health records (EHRs) is the availability of data for large-scale clinical research studies. However, because the same patient could be treated at multiple health care institutions, data from only a single site might not contain the complete medical history for that patient, meaning that critical events could be missing. In this study, we evaluate how simple heuristic checks for data "completeness" affect the number of patients in the resulting cohort and introduce potential biases. MATERIALS AND METHODS: We began with a set of 16 filters that check for the presence of demographics, laboratory tests, and other types of data, and then systematically applied all 216 possible combinations of these filters to the EHR data for 12 million patients at 7 health care systems and a separate payor claims database of 7 million members. RESULTS: EHR data showed considerable variability in data completeness across sites and high correlation between data types. For example, the fraction of patients with diagnoses increased from 35.0% in all patients to 90.9% in those with at least 1 medication. An unrelated claims dataset independently showed that most filters select members who are older and more likely female and can eliminate large portions of the population whose data are actually complete. DISCUSSION AND CONCLUSION: As investigators design studies, they need to balance their confidence in the completeness of the data with the effects of placing requirements on the data on the resulting patient cohort.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

J Am Med Inform Assoc

DOI

EISSN

1527-974X

Publication Date

November 1, 2017

Volume

24

Issue

6

Start / End Page

1134 / 1141

Location

England

Related Subject Headings

  • Medical Informatics
  • Insurance Claim Reporting
  • Information Storage and Retrieval
  • Humans
  • Electronic Health Records
  • Data Accuracy
  • Bias
  • 46 Information and computing sciences
  • 42 Health sciences
  • 32 Biomedical and clinical sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Weber, G. M., Adams, W. G., Bernstam, E. V., Bickel, J. P., Fox, K. P., Marsolo, K., … Mandl, K. D. (2017). Biases introduced by filtering electronic health records for patients with "complete data". J Am Med Inform Assoc, 24(6), 1134–1141. https://doi.org/10.1093/jamia/ocx071
Weber, Griffin M., William G. Adams, Elmer V. Bernstam, Jonathan P. Bickel, Kathe P. Fox, Keith Marsolo, Vijay A. Raghavan, et al. “Biases introduced by filtering electronic health records for patients with "complete data".J Am Med Inform Assoc 24, no. 6 (November 1, 2017): 1134–41. https://doi.org/10.1093/jamia/ocx071.
Weber GM, Adams WG, Bernstam EV, Bickel JP, Fox KP, Marsolo K, et al. Biases introduced by filtering electronic health records for patients with "complete data". J Am Med Inform Assoc. 2017 Nov 1;24(6):1134–41.
Weber, Griffin M., et al. “Biases introduced by filtering electronic health records for patients with "complete data".J Am Med Inform Assoc, vol. 24, no. 6, Nov. 2017, pp. 1134–41. Pubmed, doi:10.1093/jamia/ocx071.
Weber GM, Adams WG, Bernstam EV, Bickel JP, Fox KP, Marsolo K, Raghavan VA, Turchin A, Zhou X, Murphy SN, Mandl KD. Biases introduced by filtering electronic health records for patients with "complete data". J Am Med Inform Assoc. 2017 Nov 1;24(6):1134–1141.
Journal cover image

Published In

J Am Med Inform Assoc

DOI

EISSN

1527-974X

Publication Date

November 1, 2017

Volume

24

Issue

6

Start / End Page

1134 / 1141

Location

England

Related Subject Headings

  • Medical Informatics
  • Insurance Claim Reporting
  • Information Storage and Retrieval
  • Humans
  • Electronic Health Records
  • Data Accuracy
  • Bias
  • 46 Information and computing sciences
  • 42 Health sciences
  • 32 Biomedical and clinical sciences