3D MRI brain tumor segmentation using autoencoder regularization
3D MRI brain tumor segmentation using autoencoder regularization
2025-11-22https://doi.org/10.1148/atlas.1763846507137
61
Overview
Schema Version
https://atlas.rsna.org/schemas/2025-11/model.json
Name
3D MRI brain tumor segmentation using autoencoder regularization
Link
https://arxiv.org/abs/1810.11654
Indexing
Keywords: 3D Magnetic Resonance Imaging, Brain Tumors, Semantic Segmentation Network, Encoder-Decoder Architecture, Variational Auto-Encoder, BraTS 2018 Challenge
Content: NR, OI, MR
Author(s)
Andriy Myronenko
Organization(s)
NVIDIA
Version
BraTS 2018 winner
Contact
amyronenko@nvidia.com
Comments
This approach won 1st place in the BraTS 2018 challenge for brain tumor segmentation.
Date
Published: 2018-11-19
Model
Architecture
Semantic segmentation network based on encoder-decoder architecture with an asymmetrically large encoder and a variational auto-encoder (VAE) branch for regularization. Uses ResNet blocks with Group Normalization.
Availability
The model won the BraTS 2018 challenge. Implementation in Tensorflow. Trained on NVIDIA Tesla V100 32GB GPU.
Clinical benefit
Automated segmentation of 3D brain tumors can save physicians time and provide an accurate reproducible solution for further tumor analysis and monitoring.
Clinical workflow phase
Diagnosis, monitoring, and treatment planning
Degree of automation
Fully automated
Indications for use
Segmentation of brain tumor subregions (whole tumor, tumor core, enhancing tumor) from multimodal 3D MRI scans for patients with primary or secondary brain tumors, including gliomas (low-grade and high-grade).
Input
Four channel 3D MRI modalities (T1, T1c, T2, FLAIR) rigidly aligned, resampled to 1x1x1 mm isotropic resolution, skull-stripped, and normalized to zero mean and unit std. Input image size 240x240x155, processed with random crop of 160x192x128.
Instructions
Input images must be 3D multimodal MRI (T1, T1c, T2, FLAIR), rigidly aligned, resampled to 1x1x1 mm isotropic resolution, skull-stripped, and normalized. Test time augmentation by mirror flipping input 3D image axes and averaging 8 flipped segmentation probability maps can be applied. Ensemble of 10 models can be used for improved performance.
Limitations
Performance may be affected by data outside the distribution of the BraTS 2018 dataset. Batch size of 1 was used due to GPU memory limits, which required a smaller crop size compared to some related works. More sophisticated data augmentation techniques (histogram matching, affine transforms, filtering) and post-processing (CRF) did not consistently improve performance.
Output
Description: Dense segmentation masks for three nested tumor subregions: whole tumor (WT), tumor core (TC), and enhancing tumor (ET).
Reproducibility
The VAE branch helped to consistently achieve good training accuracy for any random initialization.
Sustainability
Training a single model for 300 epochs takes 2 days on a single NVIDIA Tesla V100 32GB GPU. Inference time is 0.4 sec for a single model on a single V100 GPU.
Use
Intended: Brain tumor segmentation, Tumor subregion segmentation, Diagnosis, Monitoring, Treatment planning
User
Intended: Physician, Radiologist, Oncologist, Neurosurgeon, Researcher