Skip to main content
Journal cover image

Estimating the observability of an outcome from an electronic health record data set using external data.

Publication ,  Journal Article
Yan, M; Hong, H; Wilson, J; Goldstein, BA
Published in: Am J Epidemiol
December 2, 2025

One of the key limitations of electronic health record (EHR) data is that not all health care encounters are observed. The degree to which patient information is captured is referred to as observability. Poor observability, particularly differential observability, can lead to biased estimates and inference. As such, understanding the degree of observability is important in EHR-based studies. In this study, we propose using external data with known observability to assess the degree of overall observability in EHRs. We also construct a test for differential observability in the target EHR data set. Using principles from the transportability literature, we show that we can use a balancing score-based weight to estimate the observability of our target outcome. We conduct a series of simulation experiments to understand the conditions under which data set features must be required to obtain proper inference. To illustrate this, we consider hospital readmissions among patients with end-stage renal disease as our outcome of interest. We use administrative claims data, where the outcome is fully observed, as our external data.

Duke Scholars

Published In

Am J Epidemiol

DOI

EISSN

1476-6256

Publication Date

December 2, 2025

Volume

194

Issue

12

Start / End Page

3224 / 3432

Location

United States

Related Subject Headings

  • Patient Readmission
  • Kidney Failure, Chronic
  • Humans
  • Epidemiology
  • Electronic Health Records
  • 4202 Epidemiology
  • 11 Medical and Health Sciences
  • 01 Mathematical Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yan, M., Hong, H., Wilson, J., & Goldstein, B. A. (2025). Estimating the observability of an outcome from an electronic health record data set using external data. Am J Epidemiol, 194(12), 3224–3432. https://doi.org/10.1093/aje/kwaf013
Yan, Mengying, Hwanhee Hong, Jonathan Wilson, and Benjamin A. Goldstein. “Estimating the observability of an outcome from an electronic health record data set using external data.Am J Epidemiol 194, no. 12 (December 2, 2025): 3224–3432. https://doi.org/10.1093/aje/kwaf013.
Yan M, Hong H, Wilson J, Goldstein BA. Estimating the observability of an outcome from an electronic health record data set using external data. Am J Epidemiol. 2025 Dec 2;194(12):3224–432.
Yan, Mengying, et al. “Estimating the observability of an outcome from an electronic health record data set using external data.Am J Epidemiol, vol. 194, no. 12, Dec. 2025, pp. 3224–432. Pubmed, doi:10.1093/aje/kwaf013.
Yan M, Hong H, Wilson J, Goldstein BA. Estimating the observability of an outcome from an electronic health record data set using external data. Am J Epidemiol. 2025 Dec 2;194(12):3224–3432.
Journal cover image

Published In

Am J Epidemiol

DOI

EISSN

1476-6256

Publication Date

December 2, 2025

Volume

194

Issue

12

Start / End Page

3224 / 3432

Location

United States

Related Subject Headings

  • Patient Readmission
  • Kidney Failure, Chronic
  • Humans
  • Epidemiology
  • Electronic Health Records
  • 4202 Epidemiology
  • 11 Medical and Health Sciences
  • 01 Mathematical Sciences