ProstAttention-Net: A deep attention model for prostate cancer segmentation by aggressiveness in MRI scans

Published 23 Nov 2022 in eess.IV, cs.LG, and eess.SP | (2211.13238v1)

Abstract: Multiparametric magnetic resonance imaging (mp-MRI) has shown excellent results in the detection of prostate cancer (PCa). However, characterizing prostate lesions aggressiveness in mp-MRI sequences is impossible in clinical practice, and biopsy remains the reference to determine the Gleason score (GS). In this work, we propose a novel end-to-end multi-class network that jointly segments the prostate gland and cancer lesions with GS group grading. After encoding the information on a latent space, the network is separated in two branches: 1) the first branch performs prostate segmentation 2) the second branch uses this zonal prior as an attention gate for the detection and grading of prostate lesions. The model was trained and validated with a 5-fold cross-validation on an heterogeneous series of 219 MRI exams acquired on three different scanners prior prostatectomy. In the free-response receiver operating characteristics (FROC) analysis for clinically significant lesions (defined as GS > 6) detection, our model achieves 69.0% $\pm$14.5% sensitivity at 2.9 false positive per patient on the whole prostate and 70.8% $\pm$14.4% sensitivity at 1.5 false positive when considering the peripheral zone (PZ) only. Regarding the automatic GS group

Abstract PDF Upgrade to Chat

Citations (65)

View on Semantic Scholar

Summary

The paper proposes ProstAttention-Net, a deep learning model that integrates a dual-branch CNN with attention mechanisms to segment prostate glands and classify lesions by Gleason score.
It employs combined cross-entropy and Dice losses with multisource MRI data, achieving sensitivities around 69% and improved performance in the peripheral zone.
The enhanced segmentation and lesion grading outperform traditional models, indicating strong potential for clinical applications and active surveillance improvements.

Summary of "ProstAttention-Net: A deep attention model for prostate cancer segmentation by aggressiveness in MRI scans"

Introduction

Prostate cancer (PCa) remains one of the most prevalent cancers among men worldwide. Although multiparametric MRI (mp-MRI) has proven effective in detecting PCa, it is challenging to characterize the aggressiveness of prostate lesions using mp-MRI, necessitating biopsies for Gleason scoring. This paper introduces ProstAttention-Net, a deep learning model that segments prostate glands and lesions, grading them by Gleason score. Utilizing an end-to-end multi-class approach, the model incorporates an attention mechanism to integrate anatomical priors, enhancing lesion detection precision.

Methodology

ProstAttention-Net employs a convolutional neural network architecture inspired by U-Net, augmented with attention mechanisms. The network is designed with dual branches: one for prostate segmentation and the other for lesion detection. The segmented prostate gland serves as an attention guide for lesion classification. The model's training involves a heterogeneous series of 219 MRI exams, including data from multiple scanners, enhancing its generalization potential.

Figure 1: The ProstAttention-Net architecture, illustrating the dual-branch configuration for prostate and lesion segmentation.

The global loss function used combines cross-entropy and Dice losses, weighted to balance class-specific imbalances. This strategic loss formulation aims to optimize both prostate and lesion segmentation accuracy. Hyperparameters are fine-tuned using randomized grid search, and the final model undergoes a validation process with 5-fold cross-validation.

Results

ProstAttention-Net achieved a sensitivity of 69.0% ± 14.5% at 2.9 false positives per patient for clinically significant lesions (GS > 6) detection. When examining the peripheral zone exclusively, the sensitivity improved to 70.8% ± 14.4% at 1.5 false positives per patient. The model demonstrated superior lesion-wise Cohen's kappa coefficients ( $\kappa = 0.418 \pm 0.138$ ), outperforming existing models for automatic Gleason score grading.

Figure 2: FROC curve demonstrating sensitivity improvements with the inclusion of attention mechanisms.

A comparative analysis against state-of-the-art models, such as U-Net, DeepLabv3+, and Attention U-Net, revealed that ProstAttention-Net offers improved segmentation performance. Notably, the implementation of attention mechanisms is shown to outperform traditional methods, enhancing lesion detection and Gleason score characterization.

Figure 3: FROC analysis indicating detection sensitivity variations across different prostate zones.

Discussion

The integration of attention mechanisms into the segmentation process offers substantial benefits, particularly in improving lesion grading accuracy. Multisource training data enabled the model to generalize across varied MRI data inputs, although domain adaptation remained a challenge, as evidenced by performance variations on the PROSTATEx-2 dataset.

Figure 4: Impact of multisource learning on the FROC sensitivity curves, highlighting generalization capabilities.

These findings validate the model's robustness, although further training data and domain-specific adaptation could enhance performance further. Future research directions may involve integrating additional MRI modalities and exploring weakly-supervised learning approaches to leverage less richly annotated datasets.

Conclusion

ProstAttention-Net establishes itself as a promising tool for PCa segmentation and grading, outperforming traditional architectures in detecting and characterizing PCa. It leverages attention mechanisms to improve anatomical segmentation accuracy, offering novel insights into CAD systems' potential in clinical radiology. The results suggest substantial future utility in clinical settings, particularly in refining active surveillance strategies for lower-grade lesions.

This study marks a crucial step forward in computer-aided diagnostic technology, reflecting the evolving capabilities of deep learning applications in medical imaging.