Applicability of segmentation-based stain accuracy metrics to unpaired datasets

Ascertain whether segmentation-based stain accuracy metrics that rely on pixel-level paired annotations between hematoxylin and eosin and immunohistochemistry images are applicable to unpaired datasets used for evaluating virtual immunohistochemistry.

Background

The proposed accuracy framework computes overlap between brown DAB masks from real and virtual IHC and thus requires pixel-aligned H&E–IHC pairs, which are resource-intensive to obtain.

Because many virtual staining models are trained and evaluated on unpaired data, the authors highlight uncertainty about using these segmentation-based metrics in that setting and note that direct stain-accuracy evaluation is infeasible without ground truth.

References

First, segmentation-based metrics depend on pixel-level paired annotations, and their applicability to unpaired datasets remains uncertain.

Building Trust in Virtual Immunohistochemistry: Automated Assessment of Image Quality  (2511.04615 - Kataria et al., 6 Nov 2025) in Limitations subsection of Discussion