Self-supervised Learning of Dense Hierarchical Representations for Medical Image Segmentation
Abstract: This paper demonstrates a self-supervised framework for learning voxel-wise coarse-to-fine representations tailored for dense downstream tasks. Our approach stems from the observation that existing methods for hierarchical representation learning tend to prioritize global features over local features due to inherent architectural bias. To address this challenge, we devise a training strategy that balances the contributions of features from multiple scales, ensuring that the learned representations capture both coarse and fine-grained details. Our strategy incorporates 3-fold improvements: (1) local data augmentations, (2) a hierarchically balanced architecture, and (3) a hybrid contrastive-restorative loss function. We evaluate our method on CT and MRI data and demonstrate that our new approach particularly beneficial for fine-tuning with limited annotated data and consistently outperforms the baseline counterpart in linear evaluation settings.
- “Masked autoencoders are scalable vision learners,” 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 15979–15988, 2021.
- “Masked image modeling advances 3d medical image analysis,” 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 1969–1979, 2022.
- “Models genesis: Generic autodidactic models for 3d medical image analysis,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2019.
- “A simple framework for contrastive learning of visual representations,” in International conference on machine learning, 2020.
- “Lesion-based contrastive learning for diabetic retinopathy grading from fundus images,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2021.
- “Dense contrastive learning for self-supervised visual pre-training,” 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3023–3032, 2020.
- “Anatomical invariance modeling and semantic alignment for self-supervised learning in 3d medical image analysis,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023, pp. 15859–15869.
- “Self-supervised pre-training of swin transformers for 3d medical image analysis,” 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 20698–20708, 2021.
- “Dira: Discriminative, restorative, and adversarial learning for self-supervised medical image analysis,” 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 20792–20802, 2022.
- “Sam: Self-supervised learning of pixel-wise anatomical embeddings in radiological images,” IEEE Transactions on Medical Imaging, vol. 41, pp. 2658–2669, 2020.
- “vox2vec: A framework for self-supervised contrastive learning of voxel-level representations in medical images,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2023.
- J. Zhang and K. Ma, “Rethinking the augmentation module in contrastive learning: Learning hierarchical augmentation invariance with expanded views,” 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16629–16638, 2022.
- “Whole-body magnetic resonance imaging in the german national cohort (nako): Design & current status,” The European Journal of Public Health, vol. 32, 2022.
- “Amos: A large-scale abdominal multi-organ benchmark for versatile medical image segmentation,” Advances in Neural Information Processing Systems, vol. 35, pp. 36722–36732, 2022.
- “Fast and low-gpu-memory abdomen ct organ segmentation: the flare challenge,” Medical Image Analysis, vol. 82, pp. 102616, 2022.
- “Miccai multi-atlas labeling beyond the cranial vault–workshop and challenge,” in MICCAI Multi-Atlas Labeling Beyond Cranial Vault—Workshop Challenge, 2015.
- D.P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” CoRR, 2014.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.