Regression Model for Censored Data

Updated 17 October 2025

Regression models for censored data are specialized frameworks that address partially observed responses through methods like right-censoring in survival analysis.
Dimension reduction via conditional independence and single-index models simplifies high-dimensional kernel smoothing, mitigating the curse of dimensionality.
Weighted empirical risk minimization and two-stage trimmed least squares yield asymptotically normal inference even with complex, high-dimensional covariates.

A regression model for censored data refers to any statistical or machine learning framework in which the response variable is only partially observed due to censoring—most commonly through right-censoring in survival analysis, but also through left, interval, or random censoring mechanisms. The canonical paradigm involves having a multivariate covariate $X \in \mathbb{R}^d$ and a univariate response $Y$ which is observable only up to an associated censoring variable $C$ , so that one actually observes $T = Y \wedge C$ and an indicator $\delta = \mathbb{1}\{Y \leq C\}$ . The principal challenge is to devise estimators for regression or conditional distribution functionals that correctly account for the censorship while controlling for the curse of dimensionality stemming from high-dimensional covariates.

1. Problem Setup and the Curse of Dimensionality

In the censored data regression context, for a sample of i.i.d. triplets $(X_i, T_i, \delta_i)$ , the fundamental object is the estimation of a regression or distributional functional of the unobserved $(X_i, Y_i)$ . Naïve nonparametric estimators for, for instance,

$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$

nominally require kernel or histogram smoothing in $d$ dimensions, leading to exponential sample size requirements in $d$ . Direct approaches are thus practically infeasible for moderate-to-large $Y$ 0, motivating the need for dimension reduction assumptions that specifically exploit structure in the dependence of $Y$ 1 (and/or $Y$ 2) on $Y$ 3.

2. Dimension Reduction via Conditional Independence and Single-Index Models

The central structural assumption (A0) posits that for some known $Y$ 4 (often taken as $Y$ 5 for a low-dimensional parameter $Y$ 6), the censoring variable $Y$ 7 and the response $Y$ 8 are conditionally independent given $Y$ 9. That is,

$C$ 0

Typically, $C$ 1 is parameterized so that $C$ 2, and the estimation of conditional functionals can be performed by nonparametric smoothing on $C$ 3, thereby bypassing the high-dimensionality of $C$ 4 in the smoothing step.

In the regression problem, a further dimension reduction is introduced via a mean regression single-index model for a possibly truncated response:

$C$ 5

where $C$ 6 is a finite-dimensional parameter and $C$ 7 is an unknown smooth function. This structure means all information about $C$ 8 influencing the mean is projected onto the one-dimensional index $C$ 9.

3. Construction of Joint Distribution and Regression Estimators

3.1. Joint Distribution Estimation

Given the conditional independence structure via $T = Y \wedge C$ 0, the conditional distribution of censoring at time $T = Y \wedge C$ 1 is estimated via a generalized Beran estimator:

$T = Y \wedge C$ 2

where $T = Y \wedge C$ 3 for a univariate kernel $T = Y \wedge C$ 4 and bandwidth $T = Y \wedge C$ 5.

The joint estimator of $T = Y \wedge C$ 6 then takes the form

$T = Y \wedge C$ 7

where $T = Y \wedge C$ 8 is a root- $T = Y \wedge C$ 9-consistent estimator for the index parameter. This construction corrects for the effect of censoring via a weighting scheme that adapts to the conditional survival of censoring.

3.2. Mean Regression Single-Index Estimation

The regression parameter is estimated via a two-stage trimmed minimum least squares approach:

Initial estimator: Minimize

$\delta = \mathbb{1}\{Y \leq C\}$ 0

over $\delta = \mathbb{1}\{Y \leq C\}$ 1 in a compact set, with initial trimming function $\delta = \mathbb{1}\{Y \leq C\}$ 2 and nonparametric kernel estimator for $\delta = \mathbb{1}\{Y \leq C\}$ 3.

Final estimator: With preliminary $\delta = \mathbb{1}\{Y \leq C\}$ 4, update the trimming region $\delta = \mathbb{1}\{Y \leq C\}$ 5 and minimize the same criterion over $\delta = \mathbb{1}\{Y \leq C\}$ 6 in shrinking neighborhoods to obtain $\delta = \mathbb{1}\{Y \leq C\}$ 7. Here, $\delta = \mathbb{1}\{Y \leq C\}$ 8 is the density of $\delta = \mathbb{1}\{Y \leq C\}$ 9 under truncation at $(X_i, T_i, \delta_i)$ 0.

The nonparametric regression function is estimated as

$(X_i, T_i, \delta_i)$ 1

where $(X_i, T_i, \delta_i)$ 2 is a univariate kernel and $(X_i, T_i, \delta_i)$ 3 is a bandwidth.

4. Asymptotic Theory and Efficiency

Under appropriate regularity (smoothness of $(X_i, T_i, \delta_i)$ 4, positivity of densities, conditions on kernels and bandwidths), the estimators admit uniform consistency and i.i.d.-style asymptotic (influence function) representations for general functionals:

$(X_i, T_i, \delta_i)$ 5

For the regression parameter,

$(X_i, T_i, \delta_i)$ 6

with $(X_i, T_i, \delta_i)$ 7 defined in terms of the derivatives of the regression function, trimming regions, and the conditional distribution of $(X_i, T_i, \delta_i)$ 8. The estimator is root- $(X_i, T_i, \delta_i)$ 9 consistent and asymptotically normal:

$(X_i, Y_i)$ 0

with $(X_i, Y_i)$ 1 computable from the influence function representation.

5. Methodological Innovations and Practical Implications

The methodology fundamentally leverages:

Dimension reduction in the censoring model: By parametrizing $(X_i, Y_i)$ 2, kernel estimation for censoring correction operates in one dimension, reducing variance and avoiding the curse of dimensionality.
Single-index regression: The mean structure is reduced to $(X_i, Y_i)$ 3, allowing a fully nonparametric regression function $(X_i, Y_i)$ 4 over a single argument, further mitigating dimensionality issues.
Weighted empirical risk minimization: Both the joint distribution and the regression parameter estimators use weights that correct for censoring, via conditioning on the low-dimensional summaries of covariate information.

These allow consistent, asymptotically normal inference about joint and regression functionals in high-dimensional censored data, provided the conditional independence and single-index assumptions hold.

6. Theoretical and Computational Considerations

The kernel smoothing steps require bandwidth selection, and the estimator for $(X_i, Y_i)$ 5 in $(X_i, Y_i)$ 6 must be root- $(X_i, Y_i)$ 7 consistent. The two-stage regression procedure requires careful selection of the trimming region and control of approximation error at the boundaries of the covariate space. Martingale asymptotics and counting process theory underlie the uniform convergence and central limit behavior.

The approach is implementable with moderate computational resources when $(X_i, Y_i)$ 8 is large but $(X_i, Y_i)$ 9 is small and kernel density estimation is practical. The main computational burden is in the iterative nonparametric estimation of the conditional censoring distribution and in the optimization over the regression parameter.

7. Impact on High-Dimensional Survival and Censored Regression

This framework addresses two major limitations in previous censored regression methodology:

It circumvents the high-variance, low-precision regime induced by kernel smoothing in high dimensions by explicit and testable dimension reduction in both censoring and regression index structure.
It provides asymptotics and practical implementation steps for kernel-based censored regression that are robust to high-dimensional, potentially complex, covariate distributions and censoring that depends on observable covariates.

The approach has direct implications for large-scale biomedical survival analysis, reliability studies with high-dimensional predictors, and in semiparametric regression models where the classic Cox proportional hazards model’s assumptions are not tenable.

Key Formula Recap:

Quantity	Formula
Joint distribution estimator	$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$ 0
Conditional censoring cdf	$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$ 1
Nonparametric regression estimator	$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$ 2
Regression parameter estimator	$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$ 3
Asymptotic linearization	$F(x, y) = \mathbb{P}(X \leq x, Y \leq y)$ 4

This approach constitutes a significant advance in the methodology for regression with censored data under high-dimensional covariates, enabling practical and theoretically valid inference when classical nonparametric and semiparametric approaches are no longer feasible due to dimension and censoring dependencies (Lopez et al., 2011).

Markdown Report Issue Upgrade to Chat

References (1)

Single index regression models in the presence of censoring depending on the covariates (2011)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Regression Model for Censored Data.