Papers
Topics
Authors
Recent
Search
2000 character limit reached

ViM-UNet: Vision Mamba for Biomedical Segmentation

Published 11 Apr 2024 in cs.CV | (2404.07705v2)

Abstract: CNNs, most notably the UNet, are the default architecture for biomedical segmentation. Transformer-based approaches, such as UNETR, have been proposed to replace them, benefiting from a global field of view, but suffering from larger runtimes and higher parameter counts. The recent Vision Mamba architecture offers a compelling alternative to transformers, also providing a global field of view, but at higher efficiency. Here, we introduce ViM-UNet, a novel segmentation architecture based on it and compare it to UNet and UNETR for two challenging microscopy instance segmentation tasks. We find that it performs similarly or better than UNet, depending on the task, and outperforms UNETR while being more efficient. Our code is open source and documented at https://github.com/constantinpape/torch-em/blob/main/vimunet.md.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (20)
  1. Segment anything for microscopy. bioRxiv, pages 2023–08, 2023. URL \urlhttps://doi.org/10.1101/2023.08.21.554208.
  2. Multicut brings automated neurite segmentation closer to human performance. Nature methods, 14(2):101–102, 2017. URL \urlhttps://doi.org/10.1038/nmeth.4151.
  3. An image is worth 16x16 words: Transformers for image recognition at scale. ICLR, 2021. URL \urlhttps://doi.org/10.48550/arXiv.2010.11929.
  4. Livecell—a large-scale dataset for label-free live cell segmentation. Nature methods, 18(9):1038–1045, 2021. URL \urlhttps://doi.org/10.1038/s41592-021-01249-6.
  5. The pascal visual object classes (voc) challenge. International Journal of Computer Vision, 88(2):303–338, June 2010. URL \urlhttps://doi.org/10.1007/s11263-009-0275-4.
  6. Cremi: Miccai challenge on circuit reconstruction from electron microscopy images, 2016. URL \urlhttps://cremi.org/.
  7. Mamba: Linear-time sequence modeling with selective state spaces, 2023. URL \urlhttps://doi.org/10.48550/arXiv.2312.00752.
  8. Efficiently modeling long sequences with structured state spaces, 2022. URL \urlhttps://doi.org/10.48550/arXiv.2111.00396.
  9. Unetr: Transformers for 3d medical image segmentation, 2021. URL \urlhttps://doi.org/10.48550/arXiv.2103.10504.
  10. Swin unetr: Swin transformers for semantic segmentation of brain tumors in mri images, 2022. URL \urlhttps://doi.org/10.48550/arXiv.2103.10504.
  11. Cellvit: Vision transformers for precise cell segmentation and classification, 2023. URL \urlhttps://doi.org/10.48550/arXiv.2306.15350.
  12. nnu-net: a self-configuring method for deep learning-based biomedical image segmentation. Nature methods, 18(2):203–211, 2021. URL \urlhttps://doi.org/10.1038/s41592-020-01008-z.
  13. Segment anything. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV), pages 3992–4003, 2023. 10.1109/ICCV51070.2023.00371. URL \urlhttps://doi.org/10.1109/ICCV51070.2023.00371.
  14. U-mamba: Enhancing long-range dependency for biomedical image segmentation, 2024a. URL \urlhttps://doi.org/10.48550/arXiv.2401.04722.
  15. The multimodality cell segmentation challenge: toward universal solutions. Nature Methods, pages 1–11, 2024b. URL \urlhttps://doi.org/10.1038/s41592-024-02233-6.
  16. Constantin Pape. torch-em: Deep learning based semantic and instance segmentation for 3d electron microscopy and other bioimage anaylsis problems based on pytorch. URL \urlhttps://doi.org/10.5281/zenodo.10779428.
  17. U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18, pages 234–241. Springer, 2015. URL \urlhttps://doi.org/10.1007/978-3-319-24574-4_28.
  18. Cell detection with star-convex polygons. In Medical Image Computing and Computer Assisted Intervention - MICCAI 2018 - 21st International Conference, Granada, Spain, September 16-20, 2018, Proceedings, Part II, pages 265–273, 2018. 10.1007/978-3-030-00934-2_30.
  19. Cellpose: a generalist algorithm for cellular segmentation. Nature methods, 18(1):100–106, 2021. URL \urlhttps://doi.org/10.1038/s41592-020-01018-x.
  20. Vision mamba: Efficient visual representation learning with bidirectional state space model, 2024. URL \urlhttps://doi.org/10.48550/arXiv.2401.09417.
Citations (7)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 19 likes about this paper.