Papers
Topics
Authors
Recent
Search
2000 character limit reached

Semiparametric semi-supervised learning for general targets under distribution shift and decaying overlap

Published 9 May 2025 in math.ST and stat.TH | (2505.06452v1)

Abstract: In modern scientific applications, large volumes of covariate data are readily available, while outcome labels are costly, sparse, and often subject to distribution shift. This asymmetry has spurred interest in semi-supervised (SS) learning, but most existing approaches rely on strong assumptions -- such as missing completely at random (MCAR) labeling or strict positivity -- that put substantial limitations on their practical usefulness. In this work, we introduce a general semiparametric framework for estimation and inference in SS settings where labels are missing at random (MAR) and the overlap may vanish as sample size increases. Our framework accommodates a wide range of smooth statistical targets -- including means, linear coefficients, quantiles, and causal effects -- and remains valid under high-dimensional nuisance estimation and distributional shift between labeled and unlabeled samples. We construct estimators that are doubly robust and asymptotically normal by deriving influence functions under this decaying MAR-SS regime. A key insight is that classical root-$n$ convergence fails under vanishing overlap; we instead provide corrected asymptotic rates that capture the impact of the decay in overlap. We validate our theory through simulations and demonstrate practical utility in real-world applications on the internet of things and breast cancer where labeled data are scarce.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.