Mask-RCNN and U-net Ensembled for Nuclei Segmentation

Published 29 Jan 2019 in cs.CV | (1901.10170v1)

Abstract: Nuclei segmentation is both an important and in some ways ideal task for modern computer vision methods, e.g. convolutional neural networks. While recent developments in theory and open-source software have made these tools easier to implement, expert knowledge is still required to choose the right model architecture and training setup. We compare two popular segmentation frameworks, U-Net and Mask-RCNN in the nuclei segmentation task and find that they have different strengths and failures. To get the best of both worlds, we develop an ensemble model to combine their predictions that can outperform both models by a significant margin and should be considered when aiming for best nuclei segmentation performance.

Abstract PDF Upgrade to Chat

Citations (178)

View on Semantic Scholar

Summary

The paper proposes an ensemble model combining Mask-RCNN and U-Net predictions using gradient boosting to leverage their complementary strengths for nuclei segmentation.
The ensemble model strategically combines U-Net's mask accuracy with Mask-RCNN's detection capabilities, outperforming individual models across diverse image modalities like histology.
This study demonstrates that ensemble models like the one tested can significantly enhance instance segmentation accuracy in biomedical imaging tasks.

Mask-RCNN and U-Net Ensembled for Nuclei Segmentation

The paper "Mask-RCNN and U-Net Ensembled for Nuclei Segmentation," authored by Aarno Oskar Vuola, Saad Ullah Akram, and Juho Kannala, investigates the application of convolutional neural networks (CNNs) for the automatic segmentation of nuclei in microscopy images. This task has profound implications in biomedical imaging, offering a potential solution to the labor-intensive, subjective, manual segmentation typically requiring expert-level knowledge. By considering two prominent frameworks for nuclei segmentation—U-Net and Mask-RCNN—the paper evaluates their relative strengths and weaknesses and proposes an ensemble approach to leverage their combined predictive capabilities.

Methodology

The U-Net architecture, renowned for its utility in medical image segmentation, employs a U-shaped structure with skip-connections to preserve multi-resolution features, and its ability to use a deep network backbone like ResNet facilitates the classification of complex features, particularly when initialized with pretrained models on large datasets such as ImageNet. A challenge with U-Net is precise nuclei instance delineation, tackled herein by incorporating a border prediction channel to refine segmentation masks.

Conversely, Mask-RCNN is specifically designed for instance segmentation tasks. Its method incorporates bounding box prediction and individual mask segmentation within these regions. While the network demonstrates a robust capability for detecting nuclei, segmentation accuracy tends to be less precise than U-Net.

The paper introduces an ensemble model combining U-Net and Mask-RCNN predictions using a gradient boosting method. Features such as eccentricity and solidity, encapsulated in structural properties, serve as inputs to predict intersection over union (IoU) values for the ensemble predictions, effectively allowing refinement at test time through non-max suppression.

Experimental Evaluation

The ensemble model was trained and evaluated across diverse datasets to ensure generalizability, comprising fluorescence and histological images, among other modalities. Models underwent cross-validation training and were assessed based on metrics like mAP, Dice coefficient, precision, recall, and segmentation errors, focusing on IoU thresholds to penalize both slight and severe segmentation inaccuracies.

Results Discussion

The ensemble approach outperformed individual models by strategically utilizing their distinct advantages. While U-Net demonstrated better mask accuracy, Mask-RCNN excelled in object detection and accuracy, particularly for solo nuclei. Ensemble predictions showed heightened efficacy, incorporating the most accurate predictions from both models and attaining superior results for challenging modalities such as histology images.

The study underscores the ensemble model's effectiveness across varying structural properties of nuclei, affirming its suitability for scenarios with widely differing size, eccentricity, and clustering. This approach proved pivotal where either U-Net or Mask-RCNN faltered, with the ensemble system compensating effectively.

Implications and Future Work

This study implies that ensemble models can deliver enhanced results for instance segmentation challenges in biomedical contexts, advocating exploration into their application in other similar domains. Mask-RCNN and U-Net's complementary strengths suggest the ensemble methodology could be broadly beneficial for improving segmentation accuracy in medical imaging tasks.

Future research should explore the ensemble model's adaptability across other complex segmentation applications, potentially integrating additional deep learning architectures to augment predictive performance further. The insights garnered could lead to profound advancements in automated imaging systems, optimizing their integration into analytical pipelines in laboratory environments.

Markdown Report Issue