Image analysis with deep learning to predict breast cancer grade, ER status, histologic subtype, and intrinsic subtype.

Published online

Journal Article

RNA-based, multi-gene molecular assays are available and widely used for patients with ER-positive/HER2-negative breast cancers. However, RNA-based genomic tests can be costly and are not available in many countries. Methods for inferring molecular subtype from histologic images may identify patients most likely to benefit from further genomic testing. To identify patients who could benefit from molecular testing based on H&E stained histologic images, we developed an image analysis approach using deep learning. A training set of 571 breast tumors was used to create image-based classifiers for tumor grade, ER status, PAM50 intrinsic subtype, histologic subtype, and risk of recurrence score (ROR-PT). The resulting classifiers were applied to an independent test set (n = 288), and accuracy, sensitivity, and specificity of each was assessed on the test set. Histologic image analysis with deep learning distinguished low-intermediate vs. high tumor grade (82% accuracy), ER status (84% accuracy), Basal-like vs. non-Basal-like (77% accuracy), Ductal vs. Lobular (94% accuracy), and high vs. low-medium ROR-PT score (75% accuracy). Sampling considerations in the training set minimized bias in the test set. Incorrect classification of ER status was significantly more common for Luminal B tumors. These data provide proof of principle that molecular marker status, including a critical clinical biomarker (i.e., ER status), can be predicted with accuracy >75% based on H&E features. Image-based methods could be promising for identifying patients with a greater need for further genomic testing, or in place of classically scored variables typically accomplished using human-based scoring.

Full Text

Duke Authors

Cited Authors

  • Couture, HD; Williams, LA; Geradts, J; Nyante, SJ; Butler, EN; Marron, JS; Perou, CM; Troester, MA; Niethammer, M

Published Date

  • 2018

Published In

Volume / Issue

  • 4 /

Start / End Page

  • 30 -

PubMed ID

  • 30182055

Pubmed Central ID

  • 30182055

International Standard Serial Number (ISSN)

  • 2374-4677

Digital Object Identifier (DOI)

  • 10.1038/s41523-018-0079-1


  • eng

Conference Location

  • United States