Skip to main content

Improving annotation efficiency for fully labeling a breast mass segmentation dataset.

Publication ,  Journal Article
Sharma, V; Barnett, AJ; Yang, J; Cheon, S; Kim, G; Regina Schwartz, F; Wang, A; Hall, N; Grimm, L; Chen, C; Lo, JY; Rudin, C
Published in: J Med Imaging (Bellingham)
May 2025

PURPOSE: Breast cancer remains a leading cause of death for women. Screening programs are deployed to detect cancer at early stages. One current barrier identified by breast imaging researchers is a shortage of labeled image datasets. Addressing this problem is crucial to improve early detection models. We present an active learning (AL) framework for segmenting breast masses from 2D digital mammography, and we publish labeled data. Our method aims to reduce the input needed from expert annotators to reach a fully labeled dataset. APPROACH: We create a dataset of 1136 mammographic masses with pixel-wise binary segmentation labels, with the test subset labeled independently by two different teams. With this dataset, we simulate a human annotator within an AL framework to develop and compare AI-assisted labeling methods, using a discriminator model and a simulated oracle to collect acceptable segmentation labels. A UNet model is retrained on these labels, generating new segmentations. We evaluate various oracle heuristics using the percentage of segmentations that the oracle relabels and measure the quality of the proposed labels by evaluating the intersection over union over a validation dataset. RESULTS: Our method reduces expert annotator input by 44%. We present a dataset of 1136 binary segmentation labels approved by board-certified radiologists and make the 143-image validation set public for comparison with other researchers' methods. CONCLUSIONS: We demonstrate that AL can significantly improve the efficiency and time-effectiveness of creating labeled mammogram datasets. Our framework facilitates the development of high-quality datasets while minimizing manual effort in the domain of digital mammography.

Duke Scholars

Published In

J Med Imaging (Bellingham)

DOI

ISSN

2329-4302

Publication Date

May 2025

Volume

12

Issue

3

Start / End Page

035501

Location

United States

Related Subject Headings

  • 4003 Biomedical engineering
  • 3202 Clinical sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Sharma, V., Barnett, A. J., Yang, J., Cheon, S., Kim, G., Regina Schwartz, F., … Rudin, C. (2025). Improving annotation efficiency for fully labeling a breast mass segmentation dataset. J Med Imaging (Bellingham), 12(3), 035501. https://doi.org/10.1117/1.JMI.12.3.035501
Sharma, Vaibhav, Alina Jade Barnett, Julia Yang, Sangwook Cheon, Giyoung Kim, Fides Regina Schwartz, Avivah Wang, et al. “Improving annotation efficiency for fully labeling a breast mass segmentation dataset.J Med Imaging (Bellingham) 12, no. 3 (May 2025): 035501. https://doi.org/10.1117/1.JMI.12.3.035501.
Sharma V, Barnett AJ, Yang J, Cheon S, Kim G, Regina Schwartz F, et al. Improving annotation efficiency for fully labeling a breast mass segmentation dataset. J Med Imaging (Bellingham). 2025 May;12(3):035501.
Sharma, Vaibhav, et al. “Improving annotation efficiency for fully labeling a breast mass segmentation dataset.J Med Imaging (Bellingham), vol. 12, no. 3, May 2025, p. 035501. Pubmed, doi:10.1117/1.JMI.12.3.035501.
Sharma V, Barnett AJ, Yang J, Cheon S, Kim G, Regina Schwartz F, Wang A, Hall N, Grimm L, Chen C, Lo JY, Rudin C. Improving annotation efficiency for fully labeling a breast mass segmentation dataset. J Med Imaging (Bellingham). 2025 May;12(3):035501.

Published In

J Med Imaging (Bellingham)

DOI

ISSN

2329-4302

Publication Date

May 2025

Volume

12

Issue

3

Start / End Page

035501

Location

United States

Related Subject Headings

  • 4003 Biomedical engineering
  • 3202 Clinical sciences