NYU Breast Cancer Screening Classifier (NYU1/NYU2) retrained on BreastScreen SA data
model2025-11-26https://doi.org/10.1148/atlas.1764159248497
51

Overview

Schema Version

https://atlas.rsna.org/schemas/2025-11/model.json

Name

NYU Breast Cancer Screening Classifier (NYU1/NYU2) retrained on BreastScreen SA data

Link

https://github.com/jamesjjcondon

Indexing

Keywords: Screening mammography, Breast cancer, Deep learning, Convolutional neural network, Transfer learning, Generalizability, External validation
Content: BR
RadLex: RID45682, RID4265, RID10357
SNOMED: 1162814007

Author(s)

James J. J. Condon
Vincent Trinh
Kelly A. Hall
Michelle Reintals
Andrew S. Holmes
Lauren Oakden-Rayner
Lyle J. Palmer

Organization(s)

Australian Institute for Machine Learning, University of Adelaide
School of Public Health, University of Adelaide
BreastScreen SA

Version

1.0

Contact

ua.ude.edialeda@remlap.elyl

Funding

Authors declared no funding for this work.

Ethical review

Approved by local institutional review board (HREC/16/229, R20160601), with a waiver of consent for the retrospective use of de-identified clinical data.

Date

Updated: 2024-07-01
Published: 2024-05-08
Created: 2023-09-16

References

[1] Condon JJJ, Trinh V, Hall KA, Reintals M, Holmes AS, Oakden-Rayner L, Palmer LJ. "Impact of Transfer Learning Using Local Data on Performance of a Deep Learning Model for Screening Mammography". Radiology: Artificial Intelligence. 2024 Jul;6(4):e230383. 2024-05-08. doi:10.1148/ryai.230383. PMID: 38717291. PMCID: PMC11294949.

Model

Architecture

Convolutional Neural Network. Two variants evaluated: NYU1 (image-only, four-view input) and NYU2 (image + patch-level benign/malignant heatmaps generated by a separate CNN). Network architecture unchanged from Wu et al.

Availability

Study code: https://github.com/jamesjjcondon (repository for transparency; relies on private data and not currently operational). Original NYU model/code: https://github.com/nyukat/breast_cancer_classifier

Clinical benefit

Intended to assist in classifying screening mammograms for malignancy to potentially support radiologist performance; in this study, evaluated for generalizability and impact of transfer learning.

Clinical workflow phase

Clinical decision support systems (research evaluation; not deployed clinically).

Decision threshold

For the best-performing retrained NYU2 model, operating point selected to achieve validation subset sensitivity above 90%.

Degree of automation

Decision support; not suitable as a stand-alone reader in the studied local population due to low specificity.

Indications for use

Binary classification of malignancy (invasive cancer or DCIS) versus no malignancy/benign on screening full-field digital mammography in adult women within a population screening program setting.

Input

Four-view screening mammograms (bilateral CC and MLO). For NYU2, additional inputs are benign and malignant patch-level heatmaps generated by a CNN trained on hand-drawn segmentations from the NYU dataset.

Instructions

Preprocess to match NYU pipeline; fixed input size required; right-sided images horizontally flipped for training/validation/testing; per-individual malignancy score taken as the maximum image-wise score across views; for transfer learning, initialize with NYU weights and retrain on local training subset.

Limitations

Out-of-the-box performance decreased on local Australian data; improved with transfer learning contingent on access to original model weights (which are not generally available). Heatmap-generating CNN was not retrained due to lack of local segmentations. Differences in vendor devices, image size, and image characteristics may affect generalizability. Interval cancer data not available; breast density not routinely collected. In the studied cohort, specificity remained below typical clinical standards.

Output

CDEs: RDE65, RDE1586
Description: Per-individual malignancy probability/score for presence of invasive breast cancer or ductal carcinoma in situ on screening mammography.

Recommendation

Models should be tested on and adapted with local data prior to clinical use; provision of complete model details and weights facilitates adaptation.

Reproducibility

All code used for this study is available at https://github.com/jamesjjcondon; repository depends on private BSSA data and is not currently operational for replication.

Use

Intended: Detection and diagnosis
Out-of-scope: Detection and diagnosis
Excluded: Other

User

Intended: Physician, Subspecialist diagnostic radiologist