Examination-Level Supervision for Deep Learning–based Intracranial Hemorrhage Detection on Head CT Scans
2025-11-30https://doi.org/10.1148/atlas.1764531489869
271
Overview
Schema Version
https://atlas.rsna.org/schemas/2025-11/model.json
Name
Examination-Level Supervision for Deep Learning–based Intracranial Hemorrhage Detection on Head CT Scans
Link
https://pubmed.ncbi.nlm.nih.gov/38294324/
Indexing
Keywords: intracranial hemorrhage, head CT, weak supervision, multiple instance learning, attention-based MIL, examination-level labels, strong supervision, Grad-CAM, h-Shap, triage
Content: CT, NR
RadLex: RID12362, RID4706
SNOMED: 1386000
Author(s)
Jacopo Teneggi
Paul H. Yi
Jeremias Sulam
Organization(s)
Johns Hopkins University
University of Maryland Medical Intelligent Imaging Center (UM2ii), University of Maryland School of Medicine
Version
1.0
Contact
Corresponding author: Jacopo Teneggi (email: ude.uhj@1ggenetj)
Funding
National Science Foundation CAREER Award Computing and Communication Foundations 2239787.
Ethical review
Retrospective study using public data; acknowledged as nonhuman subjects research by the University of Maryland Baltimore institutional review board.
Date
Published: 2023-12-20
References
[1] Teneggi J; Yi PH; Sulam J. "Examination-Level Supervision for Deep Learning–based Intracranial Hemorrhage Detection on Head CT Scans". Radiology: Artificial Intelligence. 2024 Jan;6(1):e230159. 2023-12-20. doi:10.1148/ryai.230159. PMID: 38294324. PMCID: PMC10831525.
Model
Architecture
Attention-based Deep Multiple Instance Learning model with a shared convolutional neural network feature extractor (ResNet-18) for 2D slices and a fully connected sigmoid classifier; strong learner operates at image level; weak learner aggregates bag-level (exam-level) predictions via attention.
Availability
Authors state: All code will be released. (No URL provided in article)
Clinical benefit
Automated detection and triage of intracranial hemorrhage on head CT; maintains or improves performance using examination-level labels, potentially reducing radiologist annotation burden by ~35x.
Clinical workflow phase
Clinical decision support and triage (flagging exams/images with suspected ICH); workflow optimization by prioritizing studies and localizing suspicious sections.
Decision threshold
Not explicitly specified; AUC used for evaluation. Section-level selection for WL based on attention weights or Shapley values thresholding.
Degree of automation
Decision support; provides automated exam-level classification and suggested section/pixel-level localizations via saliency maps to support radiologist review.
Indications for use
Detection of intracranial hemorrhage in patients undergoing unenhanced head CT examinations in acute care settings; intended for analysis of CT volumes of arbitrary number of sections.
Input
Unenhanced head CT examinations treated as bags of 2D images (axial sections); images windowed with brain settings (WW=80, WL=40) and min–max normalized.
Instructions
Train with focal loss to address class imbalance; use exam-level labels for weak learner or image-level labels for strong learner; for WL section-level localization, select images by attention weights or Shapley values (h-Shap). Data split 80/20 at exam level on RSNA 2019 dataset; externally test on CQ500 and CT-ICH.
Limitations
Study limited to ICH detection on head CT; some image-level annotations still required for section/pixel-level evaluation; no pixel-level training (no segmentation labels in RSNA); generalization assessed on two external datasets only; other architectures (e.g., vision transformers) not evaluated.
Output
CDEs: RDE1444, RDE1758, RDE1775
Description: Exam-level binary prediction of ICH presence; optional section-level selection of positive image sequences via attention/Shapley importance; pixel-level saliency maps (Grad-CAM or h-Shap) highlighting suspected hemorrhage regions within positive images.
Recommendation
For large datasets (m ≥ ~5,000–10,000 labeled exams), use examination-level weak supervision (MIL) to achieve strong exam-level and section-level performance with substantially fewer labels; accompany outputs with saliency maps to aid review.
Reproducibility
Training/validation on public RSNA 2019 dataset with random 80/20 split at exam level; external validation on public CQ500 and CT-ICH datasets; preprocessing (brain windowing and normalization) described; hyperparameters and augmentation in Appendix S1; code to be released.
Use
Intended: Report generation, Detection, Triage
Out-of-scope: Image segmentation, Treatment
Excluded: Diagnosis
User
Intended: Radiologist, Subspecialist diagnostic radiologist, Researcher
Out-of-scope: Layperson
Excluded: Layperson