Skip to main content
Journal cover image

Self-organizing map for cluster analysis of a breast cancer database.

Publication ,  Journal Article
Markey, MK; Lo, JY; Tourassi, GD; Floyd, CE
Published in: Artif Intell Med
February 2003

The purpose of this study was to identify and characterize clusters in a heterogeneous breast cancer computer-aided diagnosis database. Identification of subgroups within the database could help elucidate clinical trends and facilitate future model building. A self-organizing map (SOM) was used to identify clusters in a large (2258 cases), heterogeneous computer-aided diagnosis database based on mammographic findings (BI-RADS) and patient age. The resulting clusters were then characterized by their prototypes determined using a constraint satisfaction neural network (CSNN). The clusters showed logical separation of clinical subtypes such as architectural distortions, masses, and calcifications. Moreover, the broad categories of masses and calcifications were stratified into several clusters (seven for masses and three for calcifications). The percent of the cases that were malignant was notably different among the clusters (ranging from 6 to 83%). A feed-forward back-propagation artificial neural network (BP-ANN) was used to identify likely benign lesions that may be candidates for follow up rather than biopsy. The performance of the BP-ANN varied considerably across the clusters identified by the SOM. In particular, a cluster (#6) of mass cases (6% malignant) was identified that accounted for 79% of the recommendations for follow up that would have been made by the BP-ANN. A classification rule based on the profile of cluster #6 performed comparably to the BP-ANN, providing approximately 25% specificity at 98% sensitivity. This performance was demonstrated to generalize to a large (2177) set of cases held-out for model validation.

Duke Scholars

Published In

Artif Intell Med

DOI

ISSN

0933-3657

Publication Date

February 2003

Volume

27

Issue

2

Start / End Page

113 / 127

Location

Netherlands

Related Subject Headings

  • Sensitivity and Specificity
  • Reproducibility of Results
  • ROC Curve
  • Neural Networks, Computer
  • Middle Aged
  • Medical Informatics
  • Humans
  • Diagnosis-Related Groups
  • Diagnosis, Computer-Assisted
  • Databases as Topic
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Markey, M. K., Lo, J. Y., Tourassi, G. D., & Floyd, C. E. (2003). Self-organizing map for cluster analysis of a breast cancer database. Artif Intell Med, 27(2), 113–127. https://doi.org/10.1016/s0933-3657(03)00003-4
Markey, Mia K., Joseph Y. Lo, Georgia D. Tourassi, and Carey E. Floyd. “Self-organizing map for cluster analysis of a breast cancer database.Artif Intell Med 27, no. 2 (February 2003): 113–27. https://doi.org/10.1016/s0933-3657(03)00003-4.
Markey MK, Lo JY, Tourassi GD, Floyd CE. Self-organizing map for cluster analysis of a breast cancer database. Artif Intell Med. 2003 Feb;27(2):113–27.
Markey, Mia K., et al. “Self-organizing map for cluster analysis of a breast cancer database.Artif Intell Med, vol. 27, no. 2, Feb. 2003, pp. 113–27. Pubmed, doi:10.1016/s0933-3657(03)00003-4.
Markey MK, Lo JY, Tourassi GD, Floyd CE. Self-organizing map for cluster analysis of a breast cancer database. Artif Intell Med. 2003 Feb;27(2):113–127.
Journal cover image

Published In

Artif Intell Med

DOI

ISSN

0933-3657

Publication Date

February 2003

Volume

27

Issue

2

Start / End Page

113 / 127

Location

Netherlands

Related Subject Headings

  • Sensitivity and Specificity
  • Reproducibility of Results
  • ROC Curve
  • Neural Networks, Computer
  • Middle Aged
  • Medical Informatics
  • Humans
  • Diagnosis-Related Groups
  • Diagnosis, Computer-Assisted
  • Databases as Topic