Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning
2025-11-22https://doi.org/10.1148/atlas.1763833868486
31
Overview
Schema Version
https://atlas.rsna.org/schemas/2025-11/model.json
Name
Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning
Link
https://doi.org/10.1148/ryai.230342
Indexing
Keywords: Thorax, Diagnosis, Supervised contrastive learning, Convolutional Neural Network, Computer-aided Diagnosis, Fairness, Bias mitigation, Chest radiograph, COVID-19, ChestX-ray14, MIDRC
Content: CH, RS
Author(s)
Mingquan Lin
Tianhao Li
Zhaoyi Sun
Gregory Holste
Ying Ding
Fei Wang
George Shih
Yifan Peng
Organization(s)
Weill Cornell Medicine, Department of Population Health Sciences
Weill Cornell Medicine, Department of Radiology
University of Minnesota, Department of Surgery
The University of Texas at Austin, School of Information
The University of Texas at Austin, Department of Electrical and Computer Engineering
Version
1.0
License
Text: © 2024 by the Radiological Society of North America, Inc.
URL: https://pubs.rsna.org/doi/10.1148/ryai.230342
Contact
ude.llenroc.dem@2004piy
Funding
Supported by the National Library of Medicine (4R00LM013001), National Science Foundation CAREER (2145640), Intramural Research Program of the NIH, Amazon Research Award. MIDRC funded by NIBIB of the NIH under contract 75N920202D00021 and through ARPA-H.
Ethical review
Retrospective study with IRB approval at each clinical center and Weill Cornell Medicine; informed consent waived due to publicly available datasets (MIDRC and NIH ChestX-ray14).
Date
Published: 2024-08-21
Created: 2023-08-23
References
[1] Lin M, Li T, Sun Z, Holste G, Ding Y, Wang F, Shih G, Peng Y. "Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning". Radiology: Artificial Intelligence. 2024;6(5):e230342. 2024-08-21. doi:10.1148/ryai.230342. PMID: 39166973. PMCID: PMC11449211.
Model
Architecture
DenseNet-121 backbone (pretrained on CheXpert) with supervised contrastive learning (SCL) pretraining using a single-layer perceptron contrastive head (128-dim), followed by fine-tuning with a prediction head for binary classification.
Availability
Code available at https://github.com/bionlplab/CXRFairness
Clinical benefit
Mitigates algorithmic bias across sex, race, and age subgroups in automated chest radiograph diagnosis (COVID-19 and thorax abnormalities) while maintaining overall performance.
Clinical workflow phase
Clinical decision support research; method intended to improve fairness of image-based diagnosis and may be suitable for clinical practice per authors.
Degree of automation
Automated image classification model development and inference; provides decision support outputs for diagnosis tasks.
Indications for use
Research method for automated diagnosis of COVID-19 and thorax abnormalities from frontal chest radiographs, with explicit focus on reducing subgroup bias (sex, race, age). Intended for use on datasets similar to MIDRC and ChestX-ray14.
Input
Frontal chest radiographs (PA/AP) preprocessed from DICOM to JPEG; demographic subgroup labels (sex, race, age) are used during SCL pretraining to form positive/negative pairs; downstream prediction uses images.
Instructions
Pretrain with SCL: replace classifier with 128-dim contrastive head; define positives as same disease label from different subgroups and negatives as different label from same subgroup; temperature=0.05; Adam lr=1e-4; 10 epochs. Then replace with prediction head and fine-tune entire network for 1 epoch using binary cross-entropy. Image preprocessing: DICOM normalization [0,255], invert if needed (air white), histogram equalization, resize to 256×256, center crop 224×224, random rotation (0–10°) and random flipping. Evaluate fairness using ΔmAUC and other subgroup metrics.
Limitations
Calibration was not assessed; potential over/underconfidence not evaluated. Method addresses single sensitive attributes individually rather than multiple simultaneously; limited by small sample sizes in some intersectional groups. Datasets may not fully represent all patient populations; comorbidity history not available. Only DenseNet-121 backbone was used; other architectures not evaluated here. External validation limited; generalization to other modalities/diseases not established.
Output
CDEs: RDE397, RDE2638
Description: Binary classification probabilities/labels for disease presence (COVID-19 on MIDRC; thorax abnormality on ChestX-ray14). The proposed training reduces subgroup bias as measured by ΔmAUC.
Regulatory information
Comment: Research study; no regulatory clearance claimed.
Reproducibility
PyTorch implementation with public code; pretrained on CheXpert; explicit preprocessing and training hyperparameters provided; experiments run on Intel Core i9-9960X CPU and NVIDIA Quadro RTX 6000 GPU; 200 bootstrap samples used for CIs; official splits (ChestX-ray14) and patient-level split (MIDRC).
Use
Intended: Diagnosis
Out-of-scope: Other
Excluded: Other
User
Intended: Physician, Researcher
Out-of-scope: Layperson
Excluded: Other