Utility of the virtual imaging trials methodology for objective characterization of AI systems and training data.
PURPOSE: The credibility of artificial intelligence (AI) models for medical imaging continues to be a challenge, affected by the diversity of models, the data used to train the models, and the applicability of their combination to produce reproducible results for new data. We aimed to explore whether emerging virtual imaging trial (VIT) methodologies can provide an objective resource to approach this challenge. APPROACH: We conducted this study for the case example of COVID-19 diagnosis using clinical and virtual computed tomography (CT) and chest radiography (CXR) processed with convolutional neural networks. Multiple AI models were developed and tested using 3D ResNet-like and 2D EfficientNetv2 architectures across diverse datasets. RESULTS: Model performance was evaluated using the area under the curve (AUC) and the DeLong method for AUC confidence intervals. The models trained on the most diverse datasets showed the highest external testing performance, with AUC values ranging from 0.73 to 0.76 for CT and 0.70 to 0.73 for CXR. Internal testing yielded higher AUC values (0.77 to 0.85 for CT and 0.77 to 1.0 for CXR), highlighting a substantial drop in performance during external validation, which underscores the importance of diverse and comprehensive training and testing data. Most notably, the VIT approach provided an objective assessment of the utility of diverse models and datasets while offering insight into the influence of dataset characteristics, patient factors, and imaging physics on AI efficacy. CONCLUSIONS: The VIT approach enhances model transparency and reliability, offering nuanced insights into the factors driving AI performance and bridging the gap between experimental and clinical settings.
Duke Scholars
Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- 4003 Biomedical engineering
- 3202 Clinical sciences
Citation
Published In
DOI
ISSN
Publication Date
Volume
Issue
Start / End Page
Location
Related Subject Headings
- 4003 Biomedical engineering
- 3202 Clinical sciences