Sensitivity Analysis for Instrumental Variables Under Joint Relaxations of Monotonicity and Independence

Published 26 Mar 2026 in econ.EM and stat.ME | (2603.25529v2)

Abstract: In this paper I develop a breakdown frontier approach to assess the sensitivity of Local Average Treatment Effects (LATE) estimates to violations of monotonicity and independence of the instrument. I parametrize violations of independence using the concept of $c$-dependence from Masten & Poirier (2018) and allow for the share of defiers to be greater than zero but smaller than the share of compliers. I derive identified sets for the LATE and the Average Treatment Effect (ATE) in which the bounds are functions of these two sensitivity parameters. Using these bounds, I derive the breakdown frontier for the LATE, which is the weakest set of assumptions such that a conclusion regarding the LATE holds. I derive consistent sample analogue estimators for the breakdown frontiers and provide a valid bootstrap procedure for inference. Monte Carlo simulations show the desirable finite-sample properties of the estimators and an empirical application shows that the conclusions regarding the effect of family size on unemployment from Angrist & Evans (1998) are highly sensitive to violations of independence and monotonicity.

Abstract PDF Upgrade to Chat

Authors (1)

Pedro Picchetti

Summary

The paper introduces a breakdown frontier approach to jointly relax monotonicity and independence assumptions in binary-outcome IV frameworks for causal inference.
Utilizing nonparametric estimators and bootstrapping, the method calculates sharp identification bounds for LATE and ATE under controlled deviations.
Empirical application to family size and female unemployment shows that even minor violations can undermine standard IV conclusions.

Sensitivity Analysis in Instrumental Variables: Joint Relaxations of Monotonicity and Independence

Introduction

The paper "Sensitivity Analysis for Instrumental Variables Under Joint Relaxations of Monotonicity and Independence" (2603.25529) investigates the partial identification of causal effects in binary-outcome instrumental variables (IV) frameworks, focusing on Local Average Treatment Effects (LATE) and the Average Treatment Effect (ATE) under simultaneous, explicit relaxations of the canonical monotonicity and independence assumptions. The core technical contribution is the construction of "breakdown frontiers," which delineate the largest combinations of violation magnitudes of both assumptions that still allow researchers to draw pre-specified substantive conclusions about the LATE.

Model and Relaxations

The approach denotes an assignment variable $Z$ (the instrument), which may imperfectly affect treatment uptake due to compliance issues. The model permits the actual treatment $D$ to diverge from assignment, with outcomes $Y$ governed by potential outcomes $Y(D(z), z)$ , all binary-valued.

Violations of instrument independence are parameterized using the $c$ -dependence framework of Masten & Poirier (2018), where $c$ upper bounds deviations between propensity scores computed on observables versus those that incorporate unobserved potential treatments and outcomes. Simultaneously, monotonicity is relaxed by allowing a strictly positive share of defiers $T^{def}$ , constrained to be less than the share of compliers. Both parameters $c$ and $T^{def}$ are on probability scales and thus directly trade-off in robustness analysis.

Identification Bounds

The paper delineates sharp identified sets for potential joint and marginal probabilities, and, crucially, for parameters of substantive interest such as the ITT, LATE, and ATE, as functions of $c$ and $D$ 0.

Figure 1: Identified sets for the LATE as a function of $D$ 1, contrasting the cases $D$ 2 (left) and $D$ 3 (right).

For $D$ 4, sharp identification is possible if monotonicity is assumed, with the identified set collapsing to the standard Wald estimand for LATE. As $D$ 5 increases or $D$ 6 grows, the sets widen and may quickly become too broad to yield sign-identified or policy-relevant inference—a critical, nontrivial finding substantiated numerically.

Breakdown Frontiers: Robustness to Assumption Violations

The central methodological innovation is the breakdown frontier (BF): level sets in the $D$ 7 plane corresponding to the weakest combinations of assumption violations under which a given conclusion, e.g., $D$ 8, can still be maintained. Analytically, the frontiers can be expressed in closed form and are shown, under regularity, to sharply separate robust from non-robust regions for qualitative inference.

Figure 2: Breakdown frontiers for the LATE exceeding $D$ 9 (left) and $Y$ 0 (right). Robust regions indicated in blue.

The BF approach generalizes and systematizes sensitivity analysis beyond one-dimensional relaxations, precisely quantifying the joint margin-of-error for both monotonicity and independence. Notably, trade-offs are transparent: allowing larger shares of defiers strictly contracts the set of permissible $Y$ 1 values (and vice versa) under which sign-inference or policy conclusions can be drawn.

Estimation and Inference

Consistent, nonparametric sample-analogue estimators of both the identified set bounds and the breakdown frontiers are proposed and analyzed using the mathematical machinery of Hadamard directional differentiability and the functional delta method. Standard nonparametric bootstrapping is extended using the results of Fang & Santos (2018) and Hong & Li (2018), yielding asymptotically valid uniform confidence bands for the frontier curves.

Figure 3: Sampling distribution of the breakdown frontier estimator for N=1000 (left) and N=2000 (right), indicating consistency and the absence of finite-sample bias.

Figure 4: Finite-sample bias of the breakdown frontier estimator is negligible, with coverage of lower confidence bands tracking near true values.

Monte Carlo experiments substantiate the inferential validity, displaying minimal bias and proper coverage even with moderate sample sizes.

Empirical Application: Family Size and Female Unemployment

The methodology is illustrated through a re-analysis of the canonical Angrist & Evans (1998) application, using "same-sex" as an instrument for family size when estimating effects on female non-employment. Baseline calibrations obtained from selection-on-observables techniques and external survey data provide context for plausible magnitudes of $Y$ 2 and $Y$ 3. The breakdown frontier analysis reveals that even minute violations— $Y$ 4, $Y$ 5—invalidate the qualitative conclusion that the effect of additional children on non-employment is negative. The permissible robustness region lies strictly inside plausible values for both parameters, sharply limiting the empirical credibility of substantive findings predicated on conventional IV assumptions.

Figure 5: Breakdown frontier for the estimated effect of family size on unemployment; empirical robustness is not supported for realistic violation levels.

Theoretical and Practical Implications

This work makes precise the extent to which standard IV-based inference is reliant on strict monotonicity and independence. The explicit, joint parametrization and graphical presentation via breakdown frontiers provide a tool for immediate assessment of sensitivity; robustness claims must be quantified with respect to both dimensions, not abstractly presumed. Results underscore that in many empirical contexts (especially when both assumptions may be questionable), even weak or moderate violations can suffice to render IV point estimates uninformative.

Theoretically, these findings highlight the crucial role of partial identification and the necessity to report robust regions rather than relying solely on point identification. The functional delta method extensions and uniform bootstrap confidence bands serve as a blueprint for robust inference in other nonstandard settings involving directionally differentiable functionals.

Conclusion

By reframing IV sensitivity analysis as a breakdown frontier problem and rigorously quantifying the limits of assumption-robustness, this research exposes the fragility of qualitative IV conclusions under joint relaxations of monotonicity and independence. The proposed approach is not only methodologically general, but also practically implementable, supporting recalibration of empirical standards in applied econometrics. Future work may extend to multivalued treatments, other forms of non-compliance, or different violation parameterizations, and address inference refinements in weak instrument regimes.

Markdown Report Issue