Skip to main content
Journal cover image

LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays

Publication ,  Journal Article
Ramesh, DB; Sridhar, RI; Upadhyaya, P; Kamaleswaran, R
Published in: Multimedia Tools and Applications
January 1, 2025

Segmenting lung regions in ICU Chest X-rays (CXR’s) is vital for diagnosing lung-related disorders, but existing methods require extensive annotations or training on large datasets. We present LuGSAM, a novel framework that integrates text prompts with the Segment Anything Model (SAM) for segmentation tasks, enhancing precision and adaptability in clinical settings. Our approach combines Grounding DINO, a zero-shot object detector using textual prompts (e.g., "right lobe"), and Meta AI’s SAM. Grounding DINO generates bounding boxes based on word-level prompts. These bounding boxes serve as an input to SAM, to generate precise segmentation masks. To further improve accuracy, we propose an iterative bounding box adjustment algorithm that refines object detections through multiple iterations. The Vision Transformer huge (Vit-h) variant of SAM achieved the highest overlap score (IoU = 0.95) for right lung segmentation. Grounding DINO demonstrated high detection accuracy for prompts like “right lung” with a confidence score of 0.58. The Binarized Predicted IoU (BPIoU) metric showed significant improvements in segmentation quality, making this framework a promising tool for clinical applications.

Duke Scholars

Published In

Multimedia Tools and Applications

DOI

EISSN

1573-7721

ISSN

1380-7501

Publication Date

January 1, 2025

Related Subject Headings

  • Software Engineering
  • Artificial Intelligence & Image Processing
  • 4606 Distributed computing and systems software
  • 4605 Data management and data science
  • 4603 Computer vision and multimedia computation
  • 4009 Electronics, sensors and digital hardware
  • 0806 Information Systems
  • 0805 Distributed Computing
  • 0803 Computer Software
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Ramesh, D. B., Sridhar, R. I., Upadhyaya, P., & Kamaleswaran, R. (2025). LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays. Multimedia Tools and Applications. https://doi.org/10.1007/s11042-025-21094-5
Ramesh, D. B., R. I. Sridhar, P. Upadhyaya, and R. Kamaleswaran. “LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.” Multimedia Tools and Applications, January 1, 2025. https://doi.org/10.1007/s11042-025-21094-5.
Ramesh DB, Sridhar RI, Upadhyaya P, Kamaleswaran R. LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays. Multimedia Tools and Applications. 2025 Jan 1;
Ramesh, D. B., et al. “LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays.” Multimedia Tools and Applications, Jan. 2025. Scopus, doi:10.1007/s11042-025-21094-5.
Ramesh DB, Sridhar RI, Upadhyaya P, Kamaleswaran R. LuGSAM: a novel framework for integrating text prompts to Segment Anything Model (SAM) for segmentation tasks of ICU chest x-rays. Multimedia Tools and Applications. 2025 Jan 1;
Journal cover image

Published In

Multimedia Tools and Applications

DOI

EISSN

1573-7721

ISSN

1380-7501

Publication Date

January 1, 2025

Related Subject Headings

  • Software Engineering
  • Artificial Intelligence & Image Processing
  • 4606 Distributed computing and systems software
  • 4605 Data management and data science
  • 4603 Computer vision and multimedia computation
  • 4009 Electronics, sensors and digital hardware
  • 0806 Information Systems
  • 0805 Distributed Computing
  • 0803 Computer Software
  • 0801 Artificial Intelligence and Image Processing