Conformal Interquantile Regression (CIR)
- Conformal Interquantile Regression (CIR) is a method that constructs near-minimal, distribution-aware prediction intervals with finite-sample marginal coverage.
- It leverages interquantile regression and conformal calibration to efficiently adapt to skewed and heteroscedastic conditional distributions.
- The approach, along with its CIR+ variant, offers strong theoretical guarantees and computational advantages over methods like CQR and CHR.
Conformal Interquantile Regression (CIR) is a method for constructing near-minimal prediction intervals in regression that guarantee finite-sample marginal coverage, achieve approximate conditional coverage under mild assumptions, and adapt to skewed, heteroscedastic conditional distributions. CIR applies conformal calibration to outcome distributions estimated via interquantile regression, yielding efficient, distribution-aware intervals. The framework is realized with fast algorithms and has been extended by the CIR+ variant and connected to other conformal prediction approaches such as Conformalized Quantile Regression (CQR), Conformal Histogram Regression (CHR), and Conformal Thresholded Intervals (CTI) (Guo et al., 6 Jan 2026, Romano et al., 2019, Luo et al., 2024, Sesia et al., 2019, Gupta et al., 2019).
1. Problem Setup and Fundamental Definitions
CIR operates in the standard regression setting: given exchangeable data with , one splits the sample into a training set , a calibration set , and a test set . The goal is to construct, for each test , a prediction interval such that the marginal coverage
holds for a user-specified miscoverage . Ideally, the interval also achieves approximate conditional coverage and is as short as possible.
Interquantile regression fits a black-box quantile regressor to estimate the –th conditional quantile for . The -th interquantile interval is . The classical two-quantile case uses and to define the interquantile range (Guo et al., 6 Jan 2026, Gupta et al., 2019).
2. CIR Algorithm: Calibration and Construction
CIR’s validity arises from its calibration strategy via conformity scores. For each calibration pair , CIR computes , the smallest number of consecutive interquantile bins whose union covers : where is the shortest union of adjacent intervals. With , the threshold quantifies the maximum allowed miscoverage. The calibrated interval is defined by the -th smallest score, denoted , and prediction proceeds by outputting (Guo et al., 6 Jan 2026).
The CIR+ enhancement applies a width-based tie-break for samples with identical . The refined fractional score is
where rescales the length of the -th interval. Calibration now uses the -th smallest , decomposed as ; for a test point , CIR+ outputs either or based on the scaled interval length, yielding narrower intervals in expectation (Guo et al., 6 Jan 2026).
3. Theoretical Guarantees
CIR guarantees finite-sample marginal coverage under the sole assumption of sample exchangeability: This is achieved by selecting the -th quantile of the conformity scores—ranks are uniform, ensuring the desired coverage level (Guo et al., 6 Jan 2026, Romano et al., 2019, Gupta et al., 2019, Sesia et al., 2019).
Asymptotic conditional coverage is attained when: data are i.i.d.; quantile estimation is consistent; and is unimodal. As ,
and the length of approaches the oracle shortest interval. Unimodality ensures smallest unions are nested, supporting optimality in interval construction (Guo et al., 6 Jan 2026, Gupta et al., 2019).
4. Computational Efficiency and Comparisons
CIR’s efficiency arises from bypassing histogram construction (as in CHR) and limiting calibration to operations (check at most bins per calibration sample), with test-time complexity per point. For comparison:
| Method | Calibration Complexity | Test-Time Complexity | Adaptivity to Skewness |
|---|---|---|---|
| CIR | Strong | ||
| CQR | Weak | ||
| CHR | Strong |
CIR’s algorithmic cost matches CQR (for multi-quantile regression) and greatly surpasses CHR in speed, especially for large (Guo et al., 6 Jan 2026, Luo et al., 2024).
5. Empirical Evaluation
Extensive experiments on synthetic and real-world datasets benchmark CIR and CIR+ against CQR, CHR, DistSplit, DCP, and DCP-CQR. On synthetic data with heteroscedastic, asymmetric, and jump noise models, CIR/CIR+ replicates CHR’s minimal intervals and coverage, while consuming only 1–5% of CHR’s total computation time. CQR yields noticeably wider intervals under skewed distributions.
Real dataset evaluations span seven UCI/MEPS regression tasks. CIR+ typically yields the shortest or near-shortest intervals for both neural-net and random-forest quantile regressors, maintaining marginal coverage near 90% and providing strong conditional coverage. Calibration and prediction are up to 100× faster than histogram-based CHR, and CIR+ intervals are narrowest in most splits (Guo et al., 6 Jan 2026, Luo et al., 2024, Sesia et al., 2019).
6. Connections to Other Methods
CIR generalizes and extends prior conformal regression approaches. Conformalized Quantile Regression (CQR) (Romano et al., 2019, Sesia et al., 2019) is recovered in the two-quantile case, constructing intervals with the empirical conformal quantile of the nonconformity scores. CQR is theoretically valid but adapts poorly to skewed distributions.
Conformal Histogram Regression (CHR) achieves strong distribution adaptivity via histogram binning but at the cost of computational complexity (Luo et al., 2024). CIR is “CHR without histogram”: it finds minimal unions of quantile intervals for coverage, directly leveraging multi-quantile regression without explicit density estimation.
Conformal Thresholded Intervals (CTI) calibrate by thresholding interquantile interval lengths; the threshold is set to the -quantile of interval-length scores, approximating Neyman–Pearson optimality (Luo et al., 2024). CIR’s fractionally-scored enhancement in CIR+ is similar in spirit.
Nested conformal frameworks and their cross-conformal, OOB, and jackknife+ extensions further unify these approaches, subsuming all such nonconformity scores under the calibration of rank-based level sets for prediction (Gupta et al., 2019).
7. Limitations, Extensions, and Best-Use Scenarios
CIR requires high-quality multi-quantile regressors; poor quantile estimation or limited data degrade interval quality and conditional validity. CIR’s asymptotic optimality for conditional coverage relies on the unimodality of , though marginal coverage is achieved regardless.
CIR and especially CIR+ are suited for large-scale regression tasks with skewness and heteroscedasticity, settings that demand per-point interval adaptivity without the prohibitive cost of histogram-based methods. Fast recalibration under concept shift is naturally supported, especially with nonparametric quantile regression backends.
Extensions such as the CTI approach and ensemble methods (e.g., QOOB) continue to generalize CIR by deploying aggregation, cross-conformalization, and uniform marginal coverage under exchangeability (Gupta et al., 2019, Luo et al., 2024).
References:
- "Fast Conformal Prediction using Conditional Interquantile Intervals" (Guo et al., 6 Jan 2026)
- "Conformalized Quantile Regression" (Romano et al., 2019)
- "Conformal Thresholded Intervals for Efficient Regression" (Luo et al., 2024)
- "A comparison of some conformal quantile regression methods" (Sesia et al., 2019)
- "Nested conformal prediction and quantile out-of-bag ensemble methods" (Gupta et al., 2019)