Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning
model2025-11-22https://doi.org/10.1148/atlas.1763833868486
31

Overview

Schema Version

https://atlas.rsna.org/schemas/2025-11/model.json

Name

Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning

Link

https://doi.org/10.1148/ryai.230342

Indexing

Keywords: Thorax, Diagnosis, Supervised contrastive learning, Convolutional Neural Network, Computer-aided Diagnosis, Fairness, Bias mitigation, Chest radiograph, COVID-19, ChestX-ray14, MIDRC
Content: CH, RS

Author(s)

Mingquan Lin
Tianhao Li
Zhaoyi Sun
Gregory Holste
Ying Ding
Fei Wang
George Shih
Yifan Peng

Organization(s)

Weill Cornell Medicine, Department of Population Health Sciences
Weill Cornell Medicine, Department of Radiology
University of Minnesota, Department of Surgery
The University of Texas at Austin, School of Information
The University of Texas at Austin, Department of Electrical and Computer Engineering

Version

1.0

License

Text: © 2024 by the Radiological Society of North America, Inc.
URL: https://pubs.rsna.org/doi/10.1148/ryai.230342

Contact

ude.llenroc.dem@2004piy

Funding

Supported by the National Library of Medicine (4R00LM013001), National Science Foundation CAREER (2145640), Intramural Research Program of the NIH, Amazon Research Award. MIDRC funded by NIBIB of the NIH under contract 75N920202D00021 and through ARPA-H.

Ethical review

Retrospective study with IRB approval at each clinical center and Weill Cornell Medicine; informed consent waived due to publicly available datasets (MIDRC and NIH ChestX-ray14).

Date

Published: 2024-08-21
Created: 2023-08-23

References

[1] Lin M, Li T, Sun Z, Holste G, Ding Y, Wang F, Shih G, Peng Y. "Improving Fairness of Automated Chest Radiograph Diagnosis by Contrastive Learning". Radiology: Artificial Intelligence. 2024;6(5):e230342. 2024-08-21. doi:10.1148/ryai.230342. PMID: 39166973. PMCID: PMC11449211.

Model

Architecture

DenseNet-121 backbone (pretrained on CheXpert) with supervised contrastive learning (SCL) pretraining using a single-layer perceptron contrastive head (128-dim), followed by fine-tuning with a prediction head for binary classification.

Availability

Code available at https://github.com/bionlplab/CXRFairness

Clinical benefit

Mitigates algorithmic bias across sex, race, and age subgroups in automated chest radiograph diagnosis (COVID-19 and thorax abnormalities) while maintaining overall performance.

Clinical workflow phase

Clinical decision support research; method intended to improve fairness of image-based diagnosis and may be suitable for clinical practice per authors.

Degree of automation

Automated image classification model development and inference; provides decision support outputs for diagnosis tasks.

Indications for use

Research method for automated diagnosis of COVID-19 and thorax abnormalities from frontal chest radiographs, with explicit focus on reducing subgroup bias (sex, race, age). Intended for use on datasets similar to MIDRC and ChestX-ray14.

Input

Frontal chest radiographs (PA/AP) preprocessed from DICOM to JPEG; demographic subgroup labels (sex, race, age) are used during SCL pretraining to form positive/negative pairs; downstream prediction uses images.

Instructions

Pretrain with SCL: replace classifier with 128-dim contrastive head; define positives as same disease label from different subgroups and negatives as different label from same subgroup; temperature=0.05; Adam lr=1e-4; 10 epochs. Then replace with prediction head and fine-tune entire network for 1 epoch using binary cross-entropy. Image preprocessing: DICOM normalization [0,255], invert if needed (air white), histogram equalization, resize to 256×256, center crop 224×224, random rotation (0–10°) and random flipping. Evaluate fairness using ΔmAUC and other subgroup metrics.

Limitations

Calibration was not assessed; potential over/underconfidence not evaluated. Method addresses single sensitive attributes individually rather than multiple simultaneously; limited by small sample sizes in some intersectional groups. Datasets may not fully represent all patient populations; comorbidity history not available. Only DenseNet-121 backbone was used; other architectures not evaluated here. External validation limited; generalization to other modalities/diseases not established.

Output

CDEs: RDE397, RDE2638
Description: Binary classification probabilities/labels for disease presence (COVID-19 on MIDRC; thorax abnormality on ChestX-ray14). The proposed training reduces subgroup bias as measured by ΔmAUC.

Regulatory information

Comment: Research study; no regulatory clearance claimed.

Reproducibility

PyTorch implementation with public code; pretrained on CheXpert; explicit preprocessing and training hyperparameters provided; experiments run on Intel Core i9-9960X CPU and NVIDIA Quadro RTX 6000 GPU; 200 bootstrap samples used for CIs; official splits (ChestX-ray14) and patient-level split (MIDRC).

Use

Intended: Diagnosis
Out-of-scope: Other
Excluded: Other

User

Intended: Physician, Researcher
Out-of-scope: Layperson
Excluded: Other