PGAD: Prototype-Guided Adaptive Distillation for Multi-Modal Learning in AD Diagnosis

Published 5 Mar 2025 in eess.IV, cs.AI, and cs.CV | (2503.04836v2)

Abstract: Missing modalities pose a major issue in Alzheimer's Disease (AD) diagnosis, as many subjects lack full imaging data due to cost and clinical constraints. While multi-modal learning leverages complementary information, most existing methods train only on complete data, ignoring the large proportion of incomplete samples in real-world datasets like ADNI. This reduces the effective training set and limits the full use of valuable medical data. While some methods incorporate incomplete samples, they fail to effectively address inter-modal feature alignment and knowledge transfer challenges under high missing rates. To address this, we propose a Prototype-Guided Adaptive Distillation (PGAD) framework that directly incorporates incomplete multi-modal data into training. PGAD enhances missing modality representations through prototype matching and balances learning with a dynamic sampling strategy. We validate PGAD on the ADNI dataset with varying missing rates (20%, 50%, and 70%) and demonstrate that it significantly outperforms state-of-the-art approaches. Ablation studies confirm the effectiveness of prototype matching and adaptive sampling, highlighting the potential of our framework for robust and scalable AD diagnosis in real-world clinical settings.

Abstract PDF Upgrade to Chat

Summary

The paper introduces PGAD, a novel framework using prototype matching and adaptive sampling to boost AD diagnostic accuracy under missing modality conditions.
It integrates Prototype Consistency Matching (PCM) and Adaptive Multi-Modal Sampling (AMS) to align features from MRI and PET data, ensuring robust multi-modal learning.
Experimental results on ADNI demonstrate superior performance with a mean MCC of 85.1 and AUC of 96.3%, highlighting its clinical applicability.

Introduction

The challenges of diagnosing Alzheimer's Disease (AD) are compounded by missing data modalities, particularly in multi-modal neuroimaging. PET imaging, despite its advantages, is often missing due to high costs, yielding incomplete datasets. The paper introduces the Prototype-Guided Adaptive Distillation (PGAD) framework, which innovatively tackles missing modality issues in AD diagnosis by leveraging prototype matching and adaptive sampling strategies. PGAD is validated on the ADNI dataset under varying missing rates, revealing its superior performance over existing techniques.

Methodology

The PGAD framework is designed to integrate incomplete multi-modal data efficiently, improving AD diagnosis accuracy even under significant data incompleteness. It comprises two core components:

Prototype Consistency Matching (PCM): This mechanism aligns feature representations by learning class-level prototypes from complete multi-modal datasets and enforcing feature consistency across modalities. The PCM ensures that unpaired MRI samples maintain modality-specific characteristics by aligning them with their respective category prototypes.
Adaptive Multi-Modal Sampling (AMS): AMS dynamically manages data balance by regulating the mix of paired and unpaired samples during training, enhancing cross-modal learning. The method adeptly adapts sampling strategies to optimize feature convergence and ensure robust feature alignment.
Figure 1: Overview of the PGAD framework. The teacher network utilizes both MRI and PET data to extract joint feature representations and construct category prototypes.

Experimental Results

Performance Evaluation

The PGAD framework's efficacy was evaluated on the ADNI dataset across different modality missing rates (20%, 50%, 70%). Its performance was benchmarked against state-of-the-art methods, exhibiting superior outcomes in AD classification and MCI conversion prediction. Notably, PGAD achieved a mean MCC of 85.1 and an AUC of 96.3% for AD vs. CN classification, outperforming alternatives such as DFTD and MDT-Student in sensitivity and specificity, crucial for clinical diagnosis.

Ablation Studies

A comprehensive ablation study delineated the contributions of PGAD's individual components. Removing PCM or AMS resulted in significant performance degradation, underscoring their role in maintaining feature alignment and model robustness. The studies confirmed that PCM effectively preserves class-discriminative features, while AMS ensures stable learning through optimal data balance in training.

Visualization Analysis

t-SNE and Grad-CAM visualizations were employed to qualitatively validate PGAD's feature learning and interpretability. t-SNE plots highlighted substantial improvements in feature clustering and separation with PGAD compared to baseline configurations (Figure 2). Grad-CAM visualizations revealed the model's focus on clinically relevant regions like the hippocampus, validating its biological plausibility and potential for trustworthy diagnostic support.

Figure 2: t-SNE visualization of MRI feature embeddings under different training configurations (50% missing modality).

Figure 3: Grad-CAM visualization of the model's attention regions on MRI slices. The red-highlighted areas indicate regions most relevant to AD classification in axial, coronal, and sagittal views.

Conclusion

The PGAD framework represents a significant advancement in multi-modal learning for AD diagnosis under missing data conditions. By integrating novel prototype-guided learning and adaptive sampling strategies, PGAD effectively mitigates the challenges of incomplete data, enhancing diagnostic accuracy and model robustness. This study paves the way for future research in applying prototype-guided adaptive distillation to more complex, multi-center clinical datasets. The framework's capability to operate under high missing modality rates highlights its practical applicability and potential for widespread adoption in real-world clinical settings.