Papers
Topics
Authors
Recent
Search
2000 character limit reached

Chatterjee’s Rank Correlation

Updated 22 February 2026
  • Chatterjee’s rank correlation is a nonparametric, rank-based statistic that quantifies the extent to which one variable is a function of another, ranging from 0 (independence) to 1 (perfect dependence).
  • It is derived from copula theory, features a closed-form for continuous distributions, and supports efficient bootstrap and kernel-based inference with asymptotic normality properties.
  • Key limitations include issues with weak continuity and local power against near-independence, which can be mitigated by combining it with classical measures like Spearman’s rho.

Chatterjee’s Rank Correlation Coefficient is a nonparametric, rank-based measure of dependence between random variables, designed to quantify the extent to which one variable is a functional of another, regardless of monotonicity. The statistic is grounded in copula theory, admits a closed-form for continuous distributions, is distribution-free under independence, and is asymptotically normal in a wide range of settings. While it provides powerful tools for functional dependence and independence testing, subtle issues arise regarding weak continuity and local power that distinguish it from classical concordance measures.

1. Definition and Mathematical Formulation

Let (X,Y)(X, Y) be continuous random variables with joint distribution FX,YF_{X,Y} and copula CC. Chatterjee’s rank correlation coefficient, generally denoted ξ\xi or ξ(X,Y)\xi(X,Y), is defined in multiple, equivalent forms:

  • Copula Form:

ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 2

where 1C(u,v)\partial_1 C(u,v) is the partial derivative of the copula with respect to its first argument (Sato, 13 Dec 2025, Ansari et al., 18 Jun 2025, Rockel, 8 Sep 2025).

  • Population (Integral) Form:

ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}

This representation makes it clear that ξ\xi measures the proportion of total variation in YY explained via conditioning on FX,YF_{X,Y}0 (Sato, 13 Dec 2025, Dalitz et al., 2023).

  • Sample Estimator: For an i.i.d. sample FX,YF_{X,Y}1 of size FX,YF_{X,Y}2, and no ties in FX,YF_{X,Y}3,

FX,YF_{X,Y}4

Here, FX,YF_{X,Y}5 is the rank of FX,YF_{X,Y}6 where data are sorted so FX,YF_{X,Y}7 and FX,YF_{X,Y}8 is the concomitant value (Sato, 13 Dec 2025, Zhang, 2022, Zhang, 2023).

  • Graph-Based Extension: In higher dimensions or for multivariate FX,YF_{X,Y}9, Azadkia and Chatterjee’s graph-based version is defined using nearest-neighbor pairs in CC0-space (Ansari et al., 2022, Han et al., 2022).

2. Fundamental Properties

3. Asymptotic and Finite-Sample Theory

  • Asymptotic Normality:
  • Symmetrized Statistic: The maximum of ξ\xi9 and ξ(X,Y)\xi(X,Y)0 converges in distribution to a skew-normal limit (Zhang, 2022).
  • Local Power and Detection Boundary: For independence testing, the detection boundary for alternatives with ξ(X,Y)\xi(X,Y)1 cannot be reached; for many classical alternatives (e.g. Gaussian correlation ξ(X,Y)\xi(X,Y)2), only ξ(X,Y)\xi(X,Y)3 is detectable (Auddy et al., 2021, Shi et al., 2020). This rate is suboptimal relative to classical measures like Hoeffding's ξ(X,Y)\xi(X,Y)4, Blum-Kiefer-Rosenblatt's ξ(X,Y)\xi(X,Y)5, and Yanagimoto’s ξ(X,Y)\xi(X,Y)6, which detect alternatives at the ξ(X,Y)\xi(X,Y)7 regime.
  • Minimax-optimality for Strong Dependence: For testing a fixed nonzero level of dependence (ξ(X,Y)\xi(X,Y)8), tests based on ξ(X,Y)\xi(X,Y)9 achieve the optimal ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 20 rate (Auddy et al., 2021).
  • Variance Estimation and Bootstrap: Analytical and ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 21-out-of-ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 22 bootstrap-based variance estimations are consistent for constructing valid confidence intervals (Dette et al., 2023, Dalitz et al., 2023).

4. Relationship to Other Rank Correlations

  • Spearman's ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 23: While both statistics are rank-based, ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 24 focuses on functional dependence, whereas ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 25 captures monotonic association. The possible ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 26 pairs fill a convex region; for stochastically increasing (SI) or decreasing copulas, ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 27, with a maximal difference of ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 28 attainable by an explicit copula (Ansari et al., 18 Jun 2025).
  • Spearman’s Footrule ξ(C)=6[0,1]2[1C(u,v)]2uvdudv2\xi(C) = 6 \iint_{[0,1]^2} [\partial_1 C(u,v)]^2\,u v\, du\,dv - 29: When 1C(u,v)\partial_1 C(u,v)0 and 1C(u,v)\partial_1 C(u,v)1 are continuous, 1C(u,v)\partial_1 C(u,v)2 equals the footrule of the Markov product of the copula and its transpose. For SI copulas, the region 1C(u,v)\partial_1 C(u,v)3 (1C(u,v)\partial_1 C(u,v)4) is sharp (Rockel, 8 Sep 2025).
  • Extremal Cases: There are explicit rank patterns where 1C(u,v)\partial_1 C(u,v)5 is close to zero but 1C(u,v)\partial_1 C(u,v)6 is near 1 and vice versa, highlighting their complementary sensitivities (Zhang, 2023).
  • Combined Tests and Power: Max-type tests combining 1C(u,v)\partial_1 C(u,v)7 with Spearman’s 1C(u,v)\partial_1 C(u,v)8 or Kendall’s 1C(u,v)\partial_1 C(u,v)9 (e.g., ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}0) have favorable properties: they are asymptotically jointly normal under independence, with nontrivial power for both monotone and non-monotone scenarios (Zhang, 2023, Zhang, 2024).

5. Graph-Based and Multivariate Extensions

  • Azadkia–Chatterjee Correlation: In the presence of multivariate predictors, the statistic generalizes to the nearest neighbor graph-based estimator,

ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}1

for ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}2 the nearest neighbor of ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}3 (Ansari et al., 2022, Han et al., 2022, Tran et al., 2024).

  • Manifold Adaptivity: When ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}4 lies on an ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}5-dimensional submanifold, the limiting null variance depends only on ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}6, not on the ambient dimension (Han et al., 2022).
  • Rank-Based NNG: The rank-vector-based nearest-neighbor graph (Rosenbaum NNG), which uses marginal ranks, achieves full scale invariance and improved finite-sample behavior especially in high-dimensional or rescaled covariate regimes (Tran et al., 2024).
  • Multiresponse Generalization: The ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}7 statistic, defined via sequential conditioning and normalization, extends the measure to ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}8, preserving interpretation and properties such as equitability and scale invariance. The estimator ξ(X,Y)=Var(E[1{Yt}X])dFY(t)Var(1{Yt})dFY(t)\xi(X, Y) = \frac{\displaystyle \int \operatorname{Var} \left( \mathbb{E}[ \mathbf{1}\{ Y \geq t \} \mid X] \right) dF_Y(t)}{\displaystyle \int \operatorname{Var}(\mathbf{1}\{ Y \geq t \}) dF_Y(t)}9 is strongly consistent and asymptotically normal under mild conditions (Ansari et al., 2022).

6. Practical Computation and Inference

  • Computational Complexity: For univariate ξ\xi0, computing ξ\xi1 requires ξ\xi2 time. The nearest-neighbor generalization for multivariate ξ\xi3 can also be computed in ξ\xi4 with data structures such as KD-trees (Dalitz et al., 2023, Tran et al., 2024).
  • Bias and Normalization: The maximum attainable value of ξ\xi5 is strictly less than 1 for small ξ\xi6, introducing finite-sample bias. Simple normalization (e.g., scaling by its maximal value on the sample) reduces this bias (Dalitz et al., 2023).
  • Bootstrap Inference: The ξ\xi7-out-of-ξ\xi8 bootstrap is consistent for distributional inference on ξ\xi9 for both continuous and discrete data, and usually outperforms the classical YY0-out-of-YY1 bootstrap, especially in terms of coverage accuracy (Dette et al., 2023, Dalitz et al., 2023).
  • Kernel Estimation: A kernel estimator for YY2 achieves asymptotic normality at a faster rate than the original rank-based estimator, providing improved detection for local alternatives near independence (Azadkia et al., 15 Feb 2026).

7. Limitations, Controversies, and Open Questions

  • Lack of Weak Continuity: Chatterjee’s coefficient is not continuous with respect to weak convergence. Sequences of distributions may converge weakly to independence, yet YY3 remains nonzero (even 1) along the sequence. This discontinuity is structurally required by the property that YY4 iff YY5 (Bücher et al., 2024).
  • Pathologies in Inference: Due to the above, tests for independence or confidence intervals based on YY6 can have trivial power against alternatives arbitrarily close to independence, and uniform confidence intervals may fail to shrink, rendering them uninformative in large samples (Bücher et al., 2024).
  • Local Power Deficiency: For certain classical local alternatives (e.g., Gaussian correlation, rotation/mixture alternatives), YY7 is rate-suboptimal for independence testing compared to classical U-statistics, missing alternatives at the parametric YY8 detection threshold (Shi et al., 2020, Auddy et al., 2021).
  • Combined Tests: Combining YY9 with monotonicity-sensitive measures like Spearman’s FX,YF_{X,Y}00 or Kendall’s FX,YF_{X,Y}01 can mitigate some power deficiencies, creating robust tests across a wide array of dependence structures (Zhang, 2023, Zhang, 2024).

References

Chatterjee’s rank correlation is thus a foundational statistic for nonparametric dependence assessment, functionally complete for general forms of regression-like dependence, computationally tractable, and extensively analyzed both asymptotically and in the finite-sample regime. Its use in practice should be informed by awareness of its unique strengths and fundamental inferential limitations.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Chatterjee’s Rank Correlation Coefficient.