Scholars@Duke publication: LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.

LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.

Publication , Journal Article

Ramesh, DB; Sridhar, RI; Upadhyaya, P; Kamaleswaran, R

Published in: Multimed Tools Appl

2025

Segmenting lung regions in ICU Chest X-rays (CXR's) is vital for diagnosing lung-related disorders, but existing methods require extensive annotations or training on large datasets. We present LuGSAM, a novel framework that integrates text prompts with the Segment Anything Model (SAM) for segmentation tasks, enhancing precision and adaptability in clinical settings. Our approach combines Grounding DINO, a zero-shot object detector using textual prompts (e.g., "right lobe"), and Meta AI's SAM. Grounding DINO generates bounding boxes based on word-level prompts. These bounding boxes serve as an input to SAM, to generate precise segmentation masks. To further improve accuracy, we propose an iterative bounding box adjustment algorithm that refines object detections through multiple iterations. The Vision Transformer huge (Vit-h) variant of SAM achieved the highest overlap score (IoU = 0.95) for right lung segmentation. Grounding DINO demonstrated high detection accuracy for prompts like "right lung" with a confidence score of 0.58. The Binarized Predicted IoU (BPIoU) metric showed significant improvements in segmentation quality, making this framework a promising tool for clinical applications.

Duke Scholars

Author Rishi Kamaleswaran Trauma, Acute, and Critical Care Surgery

Published In

Multimed Tools Appl

DOI

10.1007/s11042-025-21094-5

ISSN

1380-7501

Publication Date

2025

Volume

Issue

Start / End Page

50119 / 50149

Location

United States

Related Subject Headings

Software Engineering
Artificial Intelligence & Image Processing
4606 Distributed computing and systems software
4605 Data management and data science
4603 Computer vision and multimedia computation
4009 Electronics, sensors and digital hardware
0806 Information Systems
0805 Distributed Computing
0803 Computer Software
0801 Artificial Intelligence and Image Processing

Citation

APA

Chicago

ICMJE

MLA

NLM

Ramesh, D. B., Sridhar, R. I., Upadhyaya, P., & Kamaleswaran, R. (2025). LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays. Multimed Tools Appl, 84(42), 50119–50149. https://doi.org/10.1007/s11042-025-21094-5

Ramesh, Dhanush Babu, Rishika Iytha Sridhar, Pulakesh Upadhyaya, and Rishikesan Kamaleswaran. “LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.” Multimed Tools Appl 84, no. 42 (2025): 50119–49. https://doi.org/10.1007/s11042-025-21094-5.

Ramesh DB, Sridhar RI, Upadhyaya P, Kamaleswaran R. LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays. Multimed Tools Appl. 2025;84(42):50119–49.

Ramesh, Dhanush Babu, et al. “LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.” Multimed Tools Appl, vol. 84, no. 42, 2025, pp. 50119–49. Pubmed, doi:10.1007/s11042-025-21094-5.

Published In

Multimed Tools Appl

DOI

10.1007/s11042-025-21094-5

ISSN

1380-7501

Publication Date

2025

Volume

Issue

Start / End Page

50119 / 50149

Location

United States

Related Subject Headings

Software Engineering
Artificial Intelligence & Image Processing
4606 Distributed computing and systems software
4605 Data management and data science
4603 Computer vision and multimedia computation
4009 Electronics, sensors and digital hardware
0806 Information Systems
0805 Distributed Computing
0803 Computer Software
0801 Artificial Intelligence and Image Processing