Papers
Topics
Authors
Recent
Search
2000 character limit reached

Rank-Based Control Charts

Updated 17 February 2026
  • Rank-based control charts are nonparametric statistical tools that use the ordering of data rather than actual values to detect shifts in process location or scale.
  • They employ methodologies like the Wilcoxon signed-rank statistic and sequential rank schemes (SSR-CUSUM/SSR-SR) for distribution-free monitoring and sensitive shift detection.
  • Recent extensions integrate deep learning to accurately estimate control limits under complex tie scenarios, enhancing robustness and reducing false alarms.

Rank-based control charts are nonparametric statistical process control tools that rely on the ranks or order statistics of process data rather than their measured values. These charts provide distribution-free methodologies for detecting location and/or scale changes and offer robustness to non-normality, outliers, and tied observations. Rank-based approaches encompass both Shewhart-type and sequential schemes and have been algorithmically extended to handle tied data and estimation complexity via deep learning techniques.

1. Fundamentals and Motivation

Traditional Shewhart control charts assume observations Xt,kX_{t,k} are normally distributed, with location and scale estimated parametrically to set control limits (typically %%%%1%%%%). However, when the process data violate normality—due to heavy tails, skewness, or unknown distributions, or in the presence of ties caused by measurement rounding—parametric designs can exhibit inflated false alarm rates or diminished sensitivity to process shifts. Rank-based (nonparametric) control charts, including those based on Wilcoxon signed-rank, sign, and runs statistics, circumvent distributional assumptions by basing decision statistics on the relative ordering and sign relative to a target or center value (Mortezanejad et al., 26 Mar 2025).

Key properties include:

  • Distribution-free performance for in-control data, valid regardless of the underlying continuous distribution (given symmetry for certain designs).
  • Robustness to outliers and non-normality, due to reliance on ranks rather than values.
  • Adaptability to tied data via explicit modeling of zero differences and randomization techniques.

2. Core Methodologies: Signed-Rank and Sequential Rank Procedures

Wilcoxon Signed-Rank Statistic

For a sample Xt,1,,Xt,nX_{t,1},\dots,X_{t,n} in subgroup tt with reference value V0V_0, one computes difference vectors dk=Xt,kV0d_k = X_{t,k} - V_0, then forms Rk=rank(dk)R_k = \operatorname{rank}(|d_k|), and signs sk=sign(dk)s_k = \operatorname{sign}(d_k). The Wilcoxon signed-rank statistic is Wt=kskRkW_t = \sum_k s_k R_k. Under no shift and no ties (Untied Observations, UO), the statistic has E(Wt)=0E(W_t)=0 and Var(Wt)=n(n+1)(2n+1)/24\operatorname{Var}(W_t)=n(n+1)(2n+1)/24 (Mortezanejad et al., 26 Mar 2025). For data with ties (Tied Observations, TO), ties are assigned sk=0s_k=0 and the statistic and its moments are recomputed after excluding zero differences, requiring a binomial reduction of effective sample size.

Sequential Rank Schemes and CUSUMs

Signed Sequential Rank (SSR) CUSUMs (Lombard et al., 2017) and Shiryaev–Roberts (SSR-SR) schemes (Zyl et al., 2019) extend rank-based ideas to sequential monitoring. Given incoming data X1,X2,X_1,X_2,\ldots, each residual Yi=Xim0Y_i = X_i - m_0 (centered at known median m0m_0) is assigned a sign sis_i and a sequential rank ri+r_i^+. The statistic is transformed using a chosen odd, square-integrable score function J(u)J(u) (e.g., Wilcoxon: JW(u)=12uJ_W(u)=\sqrt{12}u; Van der Waerden: JVdW(u)=Φ1(1+u2)J_{VdW}(u)=\Phi^{-1}(\frac{1+u}{2})). For each observation, one computes a normalized score

ξi=siJ(ri+i+1)/νi,\xi_i = s_i J\left(\frac{r_i^+}{i+1}\right) / \nu_i,

where νi2=(1/i)j=1iJ2(j/(i+1))\nu_i^2=(1/i)\sum_{j=1}^i J^2(j/(i+1)). The CUSUM is then updated recursively, e.g.,

Di+=max{0,Di1++ξiζ},D_i^+ = \max\{0, D_{i-1}^+ + \xi_i - \zeta\},

with reference constant ζ\zeta and threshold hh chosen to yield the desired in-control average run length (ARL). These procedures are both self-starting (no pre-estimate of variance needed) and distribution-free under symmetry (Lombard et al., 2017, Zyl et al., 2019).

SSR-SR schemes similarly employ the recursion

Sn=(1+Sn1)exp(θXnψ(θ)),S_n = (1 + S_{n-1})\exp(\theta X_n^* - \psi(\theta)),

with ψ(θ)\psi(\theta) being the log moment-generating function, and signal when Sn>hS_n > h (Zyl et al., 2019).

3. Distributional Approximation and Tied Data Handling

For untied data, the null distribution of signed-rank statistics is exactly known or Normal-approximable. When rounding or limited device resolution induces ties, signed-rank procedures must explicitly accommodate the increased occurrence of zero differences (sk=0s_k=0). For the Shewhart Signed-Rank Control Chart (SS-RCC), the tied case is handled as follows (Mortezanejad et al., 26 Mar 2025):

  • Define the effective nonzero sample size NN, distributed as Binomial(n,1p0)(n,1-p_0).
  • Remove zero-difference points and recompute ranks among remaining NN data.
  • Moments of the tied-statistic Wt,NW_{t,N} are derived by conditional expectation on NN.

To approximate the null distribution of Wt,NW_{t,N}, a Scaled-Normal Distribution (SND) is constructed,

fW(s)s0.25s+0.25aφ(xbc)dx,f_W(s) \approx \int_{s-0.25}^{s+0.25} a\varphi\left(\frac{x-b}{c}\right) dx,

where (a>0,b,c)(a > 0, b, c) are parameters set to match the mean, variance, skewness, and kurtosis of Wt,NW_{t,N}. Because direct moment-matching is computationally intensive for arbitrary tie patterns and data distributions, recent work applies deep learning to estimate (a,b,c)(a,b,c) rapidly for new configuration settings (Mortezanejad et al., 26 Mar 2025). An 11-feature MLP, trained on simulated tied-data scenarios across Johnson family distributions, yields sub-percent test relative errors for SND parameter estimation. Applicability remains contingent on sample sizes and tie structures seen during training.

4. Control Limit Design and Average Run Length Analysis

Control limits are set to assure a pre-specified type-I error rate α\alpha, typically leading to an in-control ARL of 370 for 3σ3\sigma-like limits. Formally,

α=P(W>Cin-control)=1[F(C)F(C)],\alpha = P(|W| > C \mid \text{in-control}) = 1 - [F(C) - F(-C)],

where FF is the (scaled) normal cdf (untied case) or the SND-based cdf (tied case) (Mortezanejad et al., 26 Mar 2025). The ARL is ARL0=1/αARL_0 = 1/\alpha in-control, and ARL1=1/(1β)ARL_1 = 1/(1-\beta) out-of-control, where β\beta is the probability that WC|W| \leq C under a shifted process. For sequential charts, ARL calculations depend on the recursion law and can be obtained via Monte Carlo or analytical normal approximations (Lombard et al., 2017, Zyl et al., 2019). The SSR-SR and SSR-CUSUM charts' run length distributions are distribution-free for continuous underlying distributions and symmetric score functions.

Empirical studies across Normal and heavy-tailed Johnson distributions show that, for n=20,50n=20,50 and moderate tie rates t=0.05t=0.05–$0.2$, the SS-RCC preserves ARL0370_0\approx370–623 and exhibits greater sensitivity to small shifts under ties: e.g., for shift δ=±0.1\delta=\pm0.1 and n=50n=50, ARL1_1(TO)12\approx12 vs ARL1_1(UO)45\approx45 (Mortezanejad et al., 26 Mar 2025).

5. Comparative Performance and Practical Recommendations

Rank-based charts demonstrate several practical advantages:

  • Superior robustness to non-normality and outlier contamination over parametric Shewhart, CUSUM, and EWMA charts, with guaranteed nominal ARL0_0 under minimal assumptions.
  • Enhanced detection performance in the presence of ties due to explicit statistical modeling.
  • For SSR-CUSUM and SSR-SR schemes, nearly equal or better out-of-control ARL compared to normal-theory charts for small to moderate shifts; the SSR-SR scheme is particularly efficient for small persistent shifts (Lombard et al., 2017, Zyl et al., 2019).
  • Simpler implementation and self-starting properties obviating the need for variance or higher-moment estimation.

Limitations exist where the data properties diverge sharply from conditions seen in training for deep learning parameterizations, or in cases of extreme tie prevalence (p01p_0\to1). The need to retrain or revalidate deep models is underscored when sampling scope or process conditions shift substantially (Mortezanejad et al., 26 Mar 2025).

Key practitioner recommendations include:

  • Prefer SS-RCC or SSR-based charts whenever normality is questionable or measurement rounding is non-negligible.
  • Validate the deep learning-based distributional approximations via simulation prior to routine control monitoring.
  • Periodically reassess device resolution and retrain machine-learning models as process or instrumentation evolves.
  • For very small persistent shifts, supplement with CUSUM or composite detection diagnostics (Mortezanejad et al., 26 Mar 2025).

Beyond classical signed-rank and sequential rank charts, ranked set sampling (RSS) and its extensions offer alternative means to incorporate order information when direct measurement is expensive or error-prone. Neoteric ranked set sampling (NRSS) control charts select kk observations per subgroup from k2k^2 candidates according to ranks derived from auxiliary variables (Silva et al., 2017). These sampling-based charts achieve improved ARL and signal detection over standard SRS and RSS charts, particularly under perfect or high-quality ranking. The variance structure is determined by theoretical or simulation-based order statistic moments, and imperfect ranking scenarios are explicitly addressed via simulation or Mallows-type error models.

Empirical evaluations across concrete-strength datasets with moderate auxiliary–interest correlation (ρ0.5\rho\approx0.5) show substantial ARL1_1 improvement of NRSS charts over classic approaches (Silva et al., 2017). A plausible implication is that, in settings where partial or proxy ordering is available, sampling-based rank methods can complement or outperform value-based and rank-based charts.

7. References and Key Literature

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Rank-Based Control Charts.