Modeling arbitrary conditional distributions in contrastive SSL with simple similarity functions
Determine how to flexibly model an arbitrary conditional distribution p(x^+ | x) for positive pairs in contrastive self-supervised learning while maintaining a simple similarity function (for example, a dot product on normalized embeddings) that supports efficient feature extraction.
References
Nevertheless, it remains unclear how to flexibly model an arbitrary conditional distribution p(+ \mid ) while keeping the similarity function simple enough to allow efficient feature extraction.
— Self-Supervised Learning from Structural Invariance
(2602.02381 - Zhang et al., 2 Feb 2026) in Section 2.2 (Preliminaries: contrastive SSL)