Correlation of Automated in Vivo Image Quality With Radiologist's Performance in Abdomen Computed Tomography Across Conventional and Deep Learning Reconstructions.
OBJECTIVE: Image quality evaluation in radiology is most relevant when reflects radiologists' performance. This study assessed how image quality measurement in terms of in vivo-characterized detectability index () for low-contrast liver lesion assessment in CT is correlated with radiologists' performance across 2 different CT reconstructions. METHODS: Fifty-one contrast-enhanced abdominal studies for investigating colorectal liver metastases were prospectively performed using 2 radiation dose exposures and reconstructed with Filtered back projection (FBP) and deep learning image reconstruction (DL) algorithms for a total of 161 noncalcified hypoattenuating lesions for 3 lesion size (D) subsets (<6 mm, 6 to 10 mm, and >10 mm). Images were assessed by expert radiologists for hepatic lesion detection task and likelihood of malignancy across the 2 imaging conditions. All cases were also evaluated automatically in terms of in vivo as a metric of task-based performance, both using a conventional technique and a new formalism of an added frequency term in the internal noise component of to accommodate the nonlinearity of the DL reconstruction (adj). RESULTS: The study found conventionally defined d' well-reflective of radiologists' evaluation of FBP images but not well-aligned with that of DL images. The new formalism provided more consistent reflection of performance across reconstruction techniques. In particular, in the lesion group D <=6 mm, the difference between radiologists' accuracy in images acquired with DL and images acquired with FBP was -26%, and the related adj difference was -9%, whereas the was 34%. Analogously, for the lesion group 6 mm < D <=10 mm, the differences were -15%, -13%, and 29%, respectively. Lastly, for the lesion group D>10 mm, radiologists showed the same accuracy in both FPB and DL images, difference in adj was -11%, and difference in was 31%. CONCLUSION: The new formalism can robustly reflect CT systems clinical performance irrespective of reconstruction algorithm. The methodology can be more readily applied to assess the real-world performance of CT systems.
Duke Scholars
Published In
DOI
EISSN
Publication Date
Location
Related Subject Headings
- Nuclear Medicine & Medical Imaging
- 4603 Computer vision and multimedia computation
- 3202 Clinical sciences
- 1103 Clinical Sciences
Citation
Published In
DOI
EISSN
Publication Date
Location
Related Subject Headings
- Nuclear Medicine & Medical Imaging
- 4603 Computer vision and multimedia computation
- 3202 Clinical sciences
- 1103 Clinical Sciences