Papers
Topics
Authors
Recent
Search
2000 character limit reached

Chatterjee's Rank Correlation Tests

Updated 9 October 2025
  • The paper introduces Chatterjee’s rank correlation, a nonparametric measure that is 0 for independent variables and 1 when one variable is an almost sure function of the other, offering an unbiased estimator and asymptotic normality under the null.
  • It employs efficient O(n log n) computation with rank-order differences and bias corrections, using the m-out-of-n bootstrap to ensure valid inference especially in non-i.i.d. or noncontinuous contexts.
  • The test framework is extended to multivariate and high-dimensional settings through nearest-neighbor graphs and spectral analysis, with boosted and combined methods to improve power in detecting complex dependencies.

Chatterjee's rank correlation-based tests constitute a family of nonparametric and distribution-free procedures for quantifying and detecting dependencies between random variables, with particular emphasis on directed (functional) rather than symmetric association. Unlike classical rank-based measures such as Spearman's rho or Kendall's tau, Chatterjee's correlation ξ\xi is tailored to measure the strength of functional dependence: ξ=0\xi=0 if and only if variables are independent, and ξ=1\xi=1 if and only if one is an (almost sure) measurable function of the other. Since its introduction, the theory, properties, extensions, comparative behavior, implementation protocols, and inferential implications of ξ\xi have been the subject of intensive research across probability, statistics, and applied computational mathematics.

1. Definition, Population Properties, and Computable Form

Chatterjee’s rank correlation is formally defined for a pair of continuous random variables (or via their copula CC) as

ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 2

where 1C(u,v)\partial_1 C(u,v) is the density of CC with respect to the first argument. It can equivalently be interpreted in terms of conditional variances, as

ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }

For independent (X,Y)(X, Y), ξ=0\xi=00; for perfect functional dependence ξ=0\xi=01, ξ=0\xi=02. Intermediate values correspond to intermediate degrees of predictability, but ξ=0\xi=03 does not linearize the notion of strength of association. In sample computations, an unbiased estimator is

ξ=0\xi=04

where ξ=0\xi=05 indexes ξ=0\xi=06-ordered samples and ξ=0\xi=07 are the concomitant ranks of ξ=0\xi=08. This computation can be performed in ξ=0\xi=09 time for continuous data with no ties.

2. Inference: Asymptotic Laws, Bootstrap, and Bias Correction

For i.i.d. data and continuous margins, the normalized statistic is asymptotically normal under the null,

ξ=1\xi=10

as established via Stein’s method and projection representations (Auddy et al., 2021, Lin et al., 2022, Kroll, 2024). General consistency and asymptotic normality extend to non-i.i.d. (strongly mixing) data given suitable assumptions, after a vanishing bias correction. Bootstrap inference for ξ=1\xi=11 must employ the ξ=1\xi=12-out-of-ξ=1\xi=13 bootstrap, not ξ=1\xi=14-out-of-ξ=1\xi=15, to ensure valid coverage and limit approximation, especially with noncontinuous data or under dependence (Dette et al., 2023, Dalitz et al., 2023). Simple normalization of ξ=1\xi=16 by its finite-sample upper bound further reduces finite-sample bias.

3. Power, Detection Boundaries, and Comparison with Competing Methods

Chatterjee's test is consistent and distribution-free under arbitrary alternatives, but it is rate sub-optimal for local (contiguous) alternatives relative to independence: the detection threshold against local parametric alternatives scales as ξ=1\xi=17 (e.g., for local correlations ξ=1\xi=18 with ξ=1\xi=19) (Shi et al., 2020, Auddy et al., 2021). Competing methods such as Hoeffding’s ξ\xi0, Blum–Kiefer–Rosenblatt’s ξ\xi1, or Bergsma–Dassios–Yanagimoto’s ξ\xi2 achieve the ξ\xi3 detection boundary and are thus preferred for local weak alternatives. Nevertheless, for testing non-trivial levels of association (e.g., distinguishing ξ\xi4 vs ξ\xi5), the Chatterjee-based test is minimax optimal with ξ\xi6 at the ξ\xi7 rate (Auddy et al., 2021).

Recent theoretical and algorithmic advances have addressed this shortcoming:

  • Boosted versions that aggregate information over ξ\xi8 right-nearest neighbors, ξ\xi9, can achieve near-parametric detection boundaries, especially when CC0 is scaled appropriately with CC1 (Lin et al., 2021).
  • Combined tests (e.g., CC2 with CC3 Spearman's rank correlation) yield uniform type I control and substantially improved power across monotonic and nonmonotonic alternatives (Zhang, 2023, Zhang, 2024).

4. Multivariate and High-Dimensional Extensions

Extensions of Chatterjee's methodology to multivariate (vector-valued) responses and predictors have been developed at two levels:

  • Azadkia–Chatterjee correlation: Employs nearest-neighbor graph constructions for multivariate variables. Recent versions use rank-based nearest-neighbor graphs to guarantee scale invariance, consistency, and asymptotic normality (Tran et al., 2024).
  • Multi-response dependence measure CC4: For multivariate responses CC5 and predictors CC6, the measure

CC7

generalizes CC8 and is permutation-invariant under certain symmetrizations. The corresponding estimator CC9 is strongly consistent and asymptotically normal (Ansari et al., 2022).

  • Tests of joint and complete independence: In high dimensions, quadratic sum statistics and extreme-value type statistics based on pairwise ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 20 are developed for composite testing, enhanced with variable screening to address dense and sparse alternative regimes (Xia et al., 2024, Olivares et al., 27 Mar 2025). In all cases, the null distribution is derivable or accurately estimated by block-multiplier or ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 21-out-of-ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 22 bootstraps.

A further generalization employs the distance-based Chatterjee correlation, where data are mapped to real-valued “distance transformed” representations (Szekely et al. transformation), allowing ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 23-type dependence measurement and causal inference for general multivariate or even complex-valued data (Pascual-Marqui et al., 2024).

5. Mathematical Relations to Classical Rank Correlations and Structural Constraints

Chatterjee's ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 24 has fundamentally different structural and functional properties compared to Spearman’s ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 25 and Spearman’s footrule ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 26:

  • For continuous bivariate copulas ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 27, ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 28 is a quadratic functional of the copula's derivative, in contrast to ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 29 or 1C(u,v)\partial_1 C(u,v)0 which are linear functionals.
  • The attainable 1C(u,v)\partial_1 C(u,v)1 region over all (or stochastically increasing/decreasing) copulas is convex, with boundaries defined by a family of piecewise-linear, absolutely continuous, asymmetric copulas (Ansari et al., 18 Jun 2025). For stochastically monotonic copulas,

1C(u,v)\partial_1 C(u,v)2

always holds. The maximal possible gap 1C(u,v)\partial_1 C(u,v)3 is 1C(u,v)\partial_1 C(u,v)4 for a specific copula with 1C(u,v)\partial_1 C(u,v)5, 1C(u,v)\partial_1 C(u,v)6.

  • The region 1C(u,v)\partial_1 C(u,v)7 for stochastically increasing copulas is exactly 1C(u,v)\partial_1 C(u,v)8 (Rockel, 8 Sep 2025). The upper bound is uniquely achieved by the Fréchet copula, and the lower bound is approached by a newly constructed two-parameter copula family.
  • The Markov product of the copula and its transpose provides the essential link: 1C(u,v)\partial_1 C(u,v)9, connecting CC0 to CC1 via copula operations.

6. Spectral Analysis and Random Matrix Behavior

For large random vectors of independent variables, the empirical spectral distribution (ESD) of the symmetrized Chatterjee rank correlation matrix converges to the Wigner semicircle law, rather than the Marchenko–Pastur law found for Pearson, Spearman, or Kendall matrices (Dong et al., 8 Oct 2025). This deviation marks the first example of such a phenomenon among correlation matrices and is foundational for developing CLTs for linear spectral statistics and for tests of high-dimensional complete independence based on eigenvalue distributions.

7. Controversies and Statistical Limitations

A key theoretical limitation of Chatterjee’s CC2 is its lack of weak continuity (Bücher et al., 2024). That is, for any CC3 with continuous margins, one can approximate CC4 arbitrarily closely (in distribution) by random pairs for which CC5, and thus for any CC6, there exists a convergent sequence with constant CC7. Therefore,

  • No test based solely on CC8 can have nontrivial power uniformly separating CC9 from ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }0; in particular, tests based on smooth functions of the empirical statistic have trivial power against such "perfect dependence yet nearby" alternatives.
  • Uniform asymptotic confidence intervals for ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }1 must necessarily degenerate, covering the entire interval ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }2 with high probability.
  • This impossibility is not a defect particular to ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }3 but is fundamental to any rank-based association measure that attains ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }4 only for measurable functions.

These facts emphasize the importance of careful interpretation, the necessity for alternative or combined inferential strategies in certain regimes, and motivate the ongoing research in boosting, combining, and generalizing Chatterjee’s framework for practical and theoretical robustness across varied dependence structures.


Table: Comparative Aspects of Chatterjee’s Rank Correlation and Related Quantities

Quantity Null Limiting Law Power (Local Alt.) Maximal Value Functional Target
ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }5 (Chatterjee) Normal, variance ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }6 rate sub-optimal (ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }7) ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }8 functional dependence (ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }9)
(X,Y)(X, Y)0 (Spearman) Non-normal (degenerate) rate-optimal ((X,Y)(X, Y)1) (X,Y)(X, Y)2 (monotone) concordance (monotonic)
(X,Y)(X, Y)3, (X,Y)(X, Y)4, (X,Y)(X, Y)5 Non-normal (degenerate) rate-optimal ((X,Y)(X, Y)6) less than (X,Y)(X, Y)7 concordance/symmetric dependence

This synthesis incorporates pivotal results on estimation, asymptotic and finite-sample performance, extension to multivariate and high-dimensional analysis, algebraic structure, and known theoretical limitations, with references to the primary literature where each point is established (Shi et al., 2020, Auddy et al., 2021, Ansari et al., 2022, Zhang, 2023, Dalitz et al., 2023, Zhang, 2024, Kroll, 2024, Xia et al., 2024, Bücher et al., 2024, Tran et al., 2024, Olivares et al., 27 Mar 2025, Ansari et al., 18 Jun 2025, Rockel, 8 Sep 2025, Dong et al., 8 Oct 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (18)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Chatterjee's Rank Correlation-Based Tests.