- The paper presents a novel synthesis method that uses GANs and gated convolutions to generate realistic lung nodule shapes, sizes, and textures.
- It incorporates precise size modulation and a hard example mining strategy to augment training data for improved CAD detection accuracy.
- Experimental results using metrics like PSNR, SSIM, and FID demonstrate significant performance improvements over existing state-of-the-art methods.
Image Synthesis with Disentangled Attributes for Chest X-Ray Nodule Augmentation and Detection
Introduction
The paper introduces a novel lung nodule synthesis framework to aid Computer-Assisted Diagnosis (CAD) systems in effectively detecting lung nodules from chest X-ray (CXR) images. Given the constrained availability of high-quality annotated datasets, which are critical for robust CAD system training, the synthesis framework aims to boost data augmentation by creating realistic lung nodules with controllable attributes such as shape, size, and texture. The proposed method employs Generative Adversarial Networks (GANs) for shape generation and leverages gated convolutions for texture synthesis.
Lung Nodule Synthesis Framework
The framework consists of three primary components: Shape Generation, Size Modulation, and Texture Synthesis.
Shape Generation
The Shape Generator uses a DCGAN architecture to produce diverse shape masks, crucial for modeling realistic lung nodule contours. It starts with sampling a latent vector from a Gaussian distribution and employs transposed convolutions to map this vector into a 128×128 binary mask.
Figure 1: The detailed structure of the proposed Shape Generator in the Shape Generation step.
Size Modulation
Size modulation is introduced to alter the nodule size precisely at pixel-level granularity. This adjustment uses morphological operations based on image scaling to modulate the diameter of a nodule shape, ensuring quantitative control over nodule dimensions.
Texture Synthesis
The Texture Generator synthesizes visually plausible nodule textures conditioned by the modulated shape masks. Leveraging gated convolutional layers, the generator utilizes the contextual information of surrounding anatomical structures to render realistic nodules. The generator's architecture follows a coarse-to-fine method where a coarse generator first synthesizes generalized textures, refined subsequently by a refinement generator to enhance image fidelity.
Figure 2: The architecture of the proposed Texture Generator in the Texture Synthesis step.
Hard Example Mining (HEM) Strategy
To further enhance nodule detection performance, the paper proposes an HEM-based augmentation strategy. This method synthesizes lung nodules by focusing on attributes that are often missed by existing detectors, thereby training CAD systems with targeted data that improve sensitivity and accuracy.
Figure 3: The proposed HEM-based data augmentation strategy consists of four steps to enhance detection performance.
Experimental Results
The evaluation showcases the framework's superiority in generating high-quality and diverse synthetic lung nodules. Quantitative metrics such as PSNR, SSIM, and FID demonstrate substantial improvements over existing methods like EdgeConnect and StructureFlow. Qualitative comparisons further reinforce the visually realistic outputs achieved by the proposed synthesis method.
Figure 4: Qualitative comparison between state-of-the-art methods and the proposed method for different nodule patch cases.
Conclusion
The proposed lung nodule synthesis framework offers significant advancements in image quality and controlling nodule attributes for effective data augmentation. It also introduces a unique HEM strategy that efficiently improves lung nodule detection capabilities in CAD systems. Future work may explore extending the application of the synthesis framework to other fields within medical imaging and further refine the control over nodule attributes such as texture and appearance. Potential applications in virtual reality training and counterfactual explanation for CAD systems present promising directions for ongoing research.