Skip to main content
Journal cover image

Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms.

Publication ,  Journal Article
Yuan, L; Chen, Q; Al-Hallaq, H; Yang, J; Yang, X; Geng, H; Latifi, K; Cai, B; Wu, QJ; Xiao, Y; Benedict, SH; Rong, Y; Buchsbaum, J; Qi, XS
Published in: Pract Radiat Oncol
2026

PURPOSE: This study aims to evaluate organs-at-risk (OARs) segmentation variability across 8 commercial artificial intelligence (AI)-based segmentation software using independent multi-institutional data sets, and to provide recommendations for clinical practices using AI-segmentation. METHODS AND MATERIALS: A total of 160 planning computed tomography image sets from 4 anatomic sites: head and neck, thorax, abdomen, and pelvis were retrospectively pooled from 3 institutions. Contours for 31 OARs generated by the software were compared to clinical contours using multiple accuracy metrics, including: dice similarity coefficient (DSC), 95 percentile of Hausdorff distance, surface DSC, as well as relative added path length as an efficiency metric. A 2-factor analysis of variance was used to quantify variability in contouring accuracy across software platforms (intersoftware) and patients (interpatient). Pairwise comparisons were performed to categorize the software into different performance groups, and intersoftware variations were calculated as the average performance differences between the groups. RESULTS: Significant intersoftware and interpatient contouring accuracy variations (P < .05) were observed for most OARs. The largest intersoftware variations in DSC in each anatomic region were cervical esophagus (0.41), trachea (0.10), spinal cord (0.13), and prostate (0.17). Among the organs evaluated, 7 had mean DSC >0.9 (ie, heart, liver), 15 had DSC ranging from 0.7 to 0.89 (ie, parotid, esophagus). The remaining organs (ie, optic nerves, seminal vesicle) had DSC<0.7. Of the 31 organs, 16 (52%) had relative added path length less than 0.1. CONCLUSIONS: Our results reveal significant intersoftware and interpatient variability in the performance of AI-segmentation software. These findings highlight the need of thorough software commissioning, testing, and quality assurance across disease sites, patient-specific anatomies, and image acquisition protocols.

Duke Scholars

Published In

Pract Radiat Oncol

DOI

EISSN

1879-8519

Publication Date

2026

Volume

16

Issue

1

Start / End Page

e47 / e59

Location

United States

Related Subject Headings

  • Tomography, X-Ray Computed
  • Software
  • Retrospective Studies
  • Radiotherapy Planning, Computer-Assisted
  • Organs at Risk
  • Male
  • Image Processing, Computer-Assisted
  • Humans
  • Head and Neck Neoplasms
  • Artificial Intelligence
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yuan, L., Chen, Q., Al-Hallaq, H., Yang, J., Yang, X., Geng, H., … Qi, X. S. (2026). Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms. Pract Radiat Oncol, 16(1), e47–e59. https://doi.org/10.1016/j.prro.2025.06.012
Yuan, Lulin, Quan Chen, Hania Al-Hallaq, Jinzhong Yang, Xiaofeng Yang, Huaizhi Geng, Kujtim Latifi, et al. “Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms.Pract Radiat Oncol 16, no. 1 (2026): e47–59. https://doi.org/10.1016/j.prro.2025.06.012.
Yuan L, Chen Q, Al-Hallaq H, Yang J, Yang X, Geng H, et al. Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms. Pract Radiat Oncol. 2026;16(1):e47–59.
Yuan, Lulin, et al. “Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms.Pract Radiat Oncol, vol. 16, no. 1, 2026, pp. e47–59. Pubmed, doi:10.1016/j.prro.2025.06.012.
Yuan L, Chen Q, Al-Hallaq H, Yang J, Yang X, Geng H, Latifi K, Cai B, Wu QJ, Xiao Y, Benedict SH, Rong Y, Buchsbaum J, Qi XS. Quantitative Evaluation of Artificial Intelligence-Based Organ Segmentation Across Multiple Anatomic Sites Using 8 Commercial Software Platforms. Pract Radiat Oncol. 2026;16(1):e47–e59.
Journal cover image

Published In

Pract Radiat Oncol

DOI

EISSN

1879-8519

Publication Date

2026

Volume

16

Issue

1

Start / End Page

e47 / e59

Location

United States

Related Subject Headings

  • Tomography, X-Ray Computed
  • Software
  • Retrospective Studies
  • Radiotherapy Planning, Computer-Assisted
  • Organs at Risk
  • Male
  • Image Processing, Computer-Assisted
  • Humans
  • Head and Neck Neoplasms
  • Artificial Intelligence