NYU Breast Cancer Screening Classifier (NYU1/NYU2) retrained on BreastScreen SA data
2025-11-26https://doi.org/10.1148/atlas.1764159248497
51
Overview
Schema Version
https://atlas.rsna.org/schemas/2025-11/model.json
Name
NYU Breast Cancer Screening Classifier (NYU1/NYU2) retrained on BreastScreen SA data
Link
https://github.com/jamesjjcondon
Indexing
Keywords: Screening mammography, Breast cancer, Deep learning, Convolutional neural network, Transfer learning, Generalizability, External validation
Content: BR
RadLex: RID45682, RID4265, RID10357
SNOMED: 1162814007
Author(s)
James J. J. Condon
Vincent Trinh
Kelly A. Hall
Michelle Reintals
Andrew S. Holmes
Lauren Oakden-Rayner
Lyle J. Palmer
Organization(s)
Australian Institute for Machine Learning, University of Adelaide
School of Public Health, University of Adelaide
BreastScreen SA
Version
1.0
Contact
ua.ude.edialeda@remlap.elyl
Funding
Authors declared no funding for this work.
Ethical review
Approved by local institutional review board (HREC/16/229, R20160601), with a waiver of consent for the retrospective use of de-identified clinical data.
Date
Updated: 2024-07-01
Published: 2024-05-08
Created: 2023-09-16
References
[1] Condon JJJ, Trinh V, Hall KA, Reintals M, Holmes AS, Oakden-Rayner L, Palmer LJ. "Impact of Transfer Learning Using Local Data on Performance of a Deep Learning Model for Screening Mammography". Radiology: Artificial Intelligence. 2024 Jul;6(4):e230383. 2024-05-08. doi:10.1148/ryai.230383. PMID: 38717291. PMCID: PMC11294949.
Model
Architecture
Convolutional Neural Network. Two variants evaluated: NYU1 (image-only, four-view input) and NYU2 (image + patch-level benign/malignant heatmaps generated by a separate CNN). Network architecture unchanged from Wu et al.
Availability
Study code: https://github.com/jamesjjcondon (repository for transparency; relies on private data and not currently operational). Original NYU model/code: https://github.com/nyukat/breast_cancer_classifier
Clinical benefit
Intended to assist in classifying screening mammograms for malignancy to potentially support radiologist performance; in this study, evaluated for generalizability and impact of transfer learning.
Clinical workflow phase
Clinical decision support systems (research evaluation; not deployed clinically).
Decision threshold
For the best-performing retrained NYU2 model, operating point selected to achieve validation subset sensitivity above 90%.
Degree of automation
Decision support; not suitable as a stand-alone reader in the studied local population due to low specificity.
Indications for use
Binary classification of malignancy (invasive cancer or DCIS) versus no malignancy/benign on screening full-field digital mammography in adult women within a population screening program setting.
Input
Four-view screening mammograms (bilateral CC and MLO). For NYU2, additional inputs are benign and malignant patch-level heatmaps generated by a CNN trained on hand-drawn segmentations from the NYU dataset.
Instructions
Preprocess to match NYU pipeline; fixed input size required; right-sided images horizontally flipped for training/validation/testing; per-individual malignancy score taken as the maximum image-wise score across views; for transfer learning, initialize with NYU weights and retrain on local training subset.
Limitations
Out-of-the-box performance decreased on local Australian data; improved with transfer learning contingent on access to original model weights (which are not generally available). Heatmap-generating CNN was not retrained due to lack of local segmentations. Differences in vendor devices, image size, and image characteristics may affect generalizability. Interval cancer data not available; breast density not routinely collected. In the studied cohort, specificity remained below typical clinical standards.
Output
CDEs: RDE65, RDE1586
Description: Per-individual malignancy probability/score for presence of invasive breast cancer or ductal carcinoma in situ on screening mammography.
Recommendation
Models should be tested on and adapted with local data prior to clinical use; provision of complete model details and weights facilitates adaptation.
Reproducibility
All code used for this study is available at https://github.com/jamesjjcondon; repository depends on private BSSA data and is not currently operational for replication.
Use
Intended: Detection and diagnosis
Out-of-scope: Detection and diagnosis
Excluded: Other
User
Intended: Physician, Subspecialist diagnostic radiologist