Skip to main content

A data pipeline for secure extraction and sharing of social determinants of health.

Publication ,  Journal Article
Schappe, T; McElroy, LM; Ogundolie, M; Matsouaka, R; Rogers, U; Bhavsar, NA
Published in: PLoS One
2025

OBJECTIVES: Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deidentify, and share neighborhood-level SDOH data across multiple health systems. METHODS: We created a pipeline centered around Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) that utilizes containerization to geocode patient addresses and obtain neighborhood-level SDOH variables. We compared DeGAUSS to a third-party vendor geocoding tool available at Duke Health using a cohort of adult patients referred for abdominal transplant from January 1, 2016, to December 31, 2022. We calculated Cohen's Kappa and percent disagreement at census block group and tract levels, and by Area Deprivation Index, urbanicity, and year. RESULTS: The pipeline successfully generated SDOH data for 97.8% of addresses. There was high concordance between DeGAUSS and the vendor tool at the census block group (0.93) and tract levels (0.95). At the block group level, disagreement proportion differed by year and urbanicity, with larger disagreement in the rural category than in micropolitan and metropolitan categories (13%, 7%, 6.2%, respectively). DISCUSSION AND CONCLUSION: We describe a novel pipeline that can facilitate the secure acquisition and sharing of neighborhood-level SDOH without sharing PHI. The pipeline can be scaled to include additional social, climate, and environmental variables, and can be extended to an unlimited number of health systems.

Duke Scholars

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2025

Volume

20

Issue

1

Start / End Page

e0317215

Location

United States

Related Subject Headings

  • Social Determinants of Health
  • Residence Characteristics
  • Neighborhood Characteristics
  • Middle Aged
  • Male
  • Information Dissemination
  • Humans
  • General Science & Technology
  • Female
  • Adult
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Schappe, T., McElroy, L. M., Ogundolie, M., Matsouaka, R., Rogers, U., & Bhavsar, N. A. (2025). A data pipeline for secure extraction and sharing of social determinants of health. PLoS One, 20(1), e0317215. https://doi.org/10.1371/journal.pone.0317215
Schappe, Tyler, Lisa M. McElroy, Moronke Ogundolie, Roland Matsouaka, Ursula Rogers, and Nrupen A. Bhavsar. “A data pipeline for secure extraction and sharing of social determinants of health.PLoS One 20, no. 1 (2025): e0317215. https://doi.org/10.1371/journal.pone.0317215.
Schappe T, McElroy LM, Ogundolie M, Matsouaka R, Rogers U, Bhavsar NA. A data pipeline for secure extraction and sharing of social determinants of health. PLoS One. 2025;20(1):e0317215.
Schappe, Tyler, et al. “A data pipeline for secure extraction and sharing of social determinants of health.PLoS One, vol. 20, no. 1, 2025, p. e0317215. Pubmed, doi:10.1371/journal.pone.0317215.
Schappe T, McElroy LM, Ogundolie M, Matsouaka R, Rogers U, Bhavsar NA. A data pipeline for secure extraction and sharing of social determinants of health. PLoS One. 2025;20(1):e0317215.

Published In

PLoS One

DOI

EISSN

1932-6203

Publication Date

2025

Volume

20

Issue

1

Start / End Page

e0317215

Location

United States

Related Subject Headings

  • Social Determinants of Health
  • Residence Characteristics
  • Neighborhood Characteristics
  • Middle Aged
  • Male
  • Information Dissemination
  • Humans
  • General Science & Technology
  • Female
  • Adult