Coverage of rare disease names in standard terminologies and implications for patients, providers, and research.


Journal Article

Small numbers of patients are a special challenge for rare diseases research. Electronic health record (EHR) data can facilitate research if patients with rare diseases can be reliably identified. We estimate the coverage of the names of a set of 6,519 rare diseases. Using the UMLS, 697 (11%) diseases were matched to ICD-9-CM, 1,386 (21%) to ICD-10-CM and 2,848 (44%) to SNOMED CT. Using published mappings from SNOMED CT to ICD, we further estimate additional broader matches of 2,569 (39%) rare diseases to ICD-9-CM and 1,635 (25%) to ICD-10-CM. The number of codes that match one and only one disease are 1,081 (62%) for ICD-9-CM, 1,403 (73%) for ICD-10-CM, and 3,311 (85%) for SNOMED CT. Our findings confirm that SNOMED CT has the greatest coverage and specificity needed to identify patients with a rare disease from EHR-data, and can facilitate research and evidence-based care.

Full Text

Duke Authors

Cited Authors

  • Fung, KW; Richesson, R; Bodenreider, O

Published Date

  • January 2014

Published In

Volume / Issue

  • 2014 /

Start / End Page

  • 564 - 572

PubMed ID

  • 25954361

Pubmed Central ID

  • 25954361

Electronic International Standard Serial Number (EISSN)

  • 1942-597X


  • eng