MDS Symbol-Pair Codes

Updated 10 February 2026

MDS symbol-pair codes are error-correcting codes designed for channels that read overlapping symbol pairs, achieving the Singleton-type bound.
They are constructed using methods such as classical MDS, repeated-root cyclic, simple-root cyclic, and matrix-product techniques to optimize pair-distance.
Their design supports efficient decoding and improved error correction in storage and communication systems by addressing unique pair-error challenges.

A maximum distance separable (MDS) symbol-pair code is a class of error-correcting code defined for channels that read overlapping symbol pairs rather than individual symbols. Such codes achieve the largest possible minimum pair-distance for prescribed length and dimension, meeting the Singleton-type bound. MDS symbol-pair codes combine combinatorial, algebraic, and geometric coding-theoretic techniques and generalize classical MDS codes to address the unique challenges of pair-error correction in practical storage and communication systems.

1. Symbol-Pair Channel Model and Metric

A symbol-pair channel outputs for each transmitted word $x=(x_0,\,x_1,\,\ldots,\,x_{n-1})$ the sequence of pairs $T(x) = \bigl((x_0,x_1),\,(x_1,x_2),\,\ldots,\,(x_{n-1},x_0)\bigr)$ , using indices modulo $n$ (Chee et al., 2012, Ma et al., 2019). The symbol-pair distance between $x,y\in\mathbb{F}_q^n$ is defined as

$d_p(x,y) = |\{\,i : (x_i,x_{i+1}) \neq (y_i, y_{i+1})\,\}|,$

and the symbol-pair weight of $x$ is $w_p(x) = d_p(x,0) = |\{\,i : (x_i,x_{i+1})\ne(0,0)\}|$ . The minimum symbol-pair distance of a code $C$ is $d_p(C) = \min_{x\neq y\in C} d_p(x,y)$ .

Pair-distance satisfies $d_H(x,y)+1\le d_p(x,y)\le 2d_H(x,y)$ for $0<d_H(x,y)<n$ ; if $C$ has minimum pair-distance $D$ , then $C$ can correct up to $\lfloor (D-1)/2\rfloor$ pair-errors.

2. Singleton-Type Bound and Definition of MDS Symbol-Pair Codes

The Singleton-type bound for symbol-pair codes [Chee et al., (Chee et al., 2012)]: $|C|\leq q^{n-d_p+2}$ for length $n$ , alphabet size $q$ , and minimum pair-distance $d_p$ . For linear $[n,k]$ codes, this gives $d_p\leq n-k+2$ . A symbol-pair code is called MDS if equality holds: $d_p = n-k+2$ and $|C| = q^{n-d_p+2}$ in the non-linear case. MDS symbol-pair codes thus maximize the error-correction capability for the pair-metric and generalize classical MDS codes (Ma et al., 2019, Chee et al., 2012).

3. Algebraic Constructions and Families of MDS Symbol-Pair Codes

A broad array of algebraic constructions yields MDS symbol-pair codes:

Classical MDS Codes: Any $[n,k,d=n-k+1]_q$ classical MDS code (e.g., Reed–Solomon, extended RS, Goppa codes) is an MDS symbol-pair code, with $d_p=n-k+2$ (Chee et al., 2012, Ma et al., 2019).
Repeated-Root Cyclic Codes: Families of cyclic and constacyclic codes with generator polynomials involving repeated roots (over $\mathbb{F}_q[x]/(x^{\ell p}-1)$ for primes $p$ ) give MDS symbol-pair codes, particularly for pair-distances $d_p=5,6,7,8$ (Ma et al., 2020, Ma et al., 2020, Tang et al., 2022, Tang et al., 2022). Explicit classification for degree $\leq 10$ is available in length $3p$.
Simple-Root Cyclic Codes: Parameter-optimized generator polynomials over $\mathbb{F}_q[x]/(x^n-1)$ provide infinite families with maximal attainable $n$ for given $d_p$ ; e.g., $n=4q+4$ with $d_p=7$ for $q\equiv 1\pmod{4}$ , or $n=2q+2$ with $d_p=9$ for $q$ odd (Qiu et al., 26 Mar 2025).
Matrix-Product Codes: Permuted matrix-product codes with underlying nonsingular-by-columns matrices yield families with high $d_p$ , such as $d_p=8,10$ for lengths $3n,4n$ over $q\equiv 1~\mathrm{mod}~3,4$ respectively (Zheng et al., 2024, Xu et al., 2023).
Projective and Geometric Constructions: Ovoids in projective spaces and functional AG codes from elliptic curves furnish $q$ -ary MDS symbol-pair codes of length up to $q^2+1$ ( $d_p=5,6$ ) or $q+\lfloor2\sqrt{q}\rfloor+\delta(q)-3$ ( $d_p\ge 7$ ) (Ding et al., 2016).
Constacyclic Codes and Chain Rings: Specific $\lambda$ -constacyclic codes of length $n p^s$ over finite fields and chain rings admit a complete characterization of which generator polynomials achieve the MDS bound for the pair metric (Tang et al., 2021).

A selection of families and their parameters is given below.

Family/type	Length $n$	Pair-distance $d_p$	Field(s)
Reed–Solomon/RS (classical)	$n\leq q+1$	$n-k+2$	Any $\mathbb{F}_q$
Repeated-root cyclic	$\ell p, 3p$	$5,6,7,8,9,10,12$	$p$ odd
Simple-root cyclic	$4q\pm 4, 2q+2$	$7,8,9$	$q\equiv 1,3\pmod{4}$
Matrix-product (with permutation)	$3n, 4n$	$8,10$	$q\equiv 1\pmod{3,4}$
AG from elliptic curves	$[7, q+\lfloor 2\sqrt{q}\rfloor+\delta-3]$	$d_p \ge 7$	$q$ any

4. Combinatorial and Structural Properties

The core property is that for every MDS symbol-pair code, the symbol-pair weight of a nonzero codeword $x$ satisfies

$w_H(x) + 1 \leq w_p(x) \leq 2w_H(x)$

with equality at the extremal cases. The symbol-pair weight distribution for $[n,k,d]_q$ MDS codes is computable in closed form—for $w \geq d+1$ : $B_w = 2 \sum_{i=1}^{M_1}\sum_{j=0}^{w-i-d} (-1)^j \binom{n-w+i-1}{i-1} \binom{w-i-1}{j} (q^{w-i+1-d-j} - 1) + \cdots$ where $M_1 = \min\{n-w+2, w-d\}$ , and similar for the second sum (Ma et al., 2019, Zhu et al., 2021).

The error-correction capability is ${\lfloor (d_p-1)/2 \rfloor}$ pair-errors (Ma et al., 2019). Full knowledge of the weight distribution enables estimation of error detection probabilities and informs the design of decoding algorithms tailored to the pair-metric.

5. Matrix-Theoretic and Geometric Criteria

Necessary and sufficient conditions for a linear $[n,k]$ code to be MDS in the symbol-pair metric can be phrased in terms of generator and parity-check matrices:

For symbol-pair ( $b=2$ ), $C$ is MDS if and only if for every set $J$ with $|J| = \max\{k-2,0\} + 1$ , the submatrix formed by all columns in the $2$-neighborhood of $J$ has full rank $k$ (Liu et al., 2021).
The parity-check matrix characterization involves the column rank of certain submatrices corresponding to sets of coordinates.

For some cyclic and constacyclic codes, characterization of the MDS property requires explicit exclusion of codewords with consecutive nonvanishing support or analysis of the roots of generator polynomials with respect to configured coordinate neighborhoods (Tang et al., 2021, Tang et al., 2022, Qiu et al., 26 Mar 2025).

6. Decoding, Extensions, and Open Problems

Error correction in the symbol-pair metric generalizes classical bounded-distance decoding, permitting adaptation of syndrome and list decoding variants. For Reed–Solomon and closely related codes, syndrome decoders can be modified to take into account the overlap in pair-read vectors (Ma et al., 2019). Matrix-product and cyclic code frameworks suggest further development of efficient decoding algorithms leveraging structural properties of the underlying algebra or geometry.

Current limitations include the upper bounds on achievable code lengths for fixed dimension and pair-distance, and the tightness of these bounds. For example, for pair-distance $d_p=7$ , the simple-root cyclic constructions in (Qiu et al., 26 Mar 2025) yield the longest known $q$ -ary MDS symbol-pair codes for many $q$ that are not prime. For higher $d_p$ , extending the geometric and cyclic algebraic methods is an open research direction (Ding et al., 2016).

The study of MDS symbol-pair codes also generalizes to $b$ -symbol MDS codes for $b>2$ : analogous Singleton-type bounds and algebraic characterizations hold, with structural properties reflecting generalizations of the symbol-pair metric (Liu et al., 2021, Xu et al., 2023).

7. Impact and Applications

MDS symbol-pair codes are central in the design of error-correcting systems for next-generation data storage and communication platforms where physical constraints or device limitations result in overlapping or low-resolution reads. Their algebraic and combinatorial richness—in particular, the possibility of attaining much longer lengths than classical MDS codes due to the symbol-pair metric—has driven both theoretical advances and motivates ongoing research into code families, decoding theory, and applications to practical nonvolatile storage architectures (Chee et al., 2012, Ding et al., 2016, Qiu et al., 26 Mar 2025).

References

(Chee et al., 2012) Chee et al., Maximum Distance Separable Codes for Symbol-Pair Read Channels
(Ma et al., 2019) Ma, Luo, Symbol-pair Weight Distributions of Some Linear Codes
(Ding et al., 2016) Ding, Ge, Zhang, New Constructions of MDS Symbol-Pair Codes
(Ma et al., 2020, Ma et al., 2020, Tang et al., 2022, Tang et al., 2022) (repeated-root cyclic code constructions)
(Qiu et al., 26 Mar 2025) New constructions of MDS symbol-pair codes via simple-root cyclic codes
(Zheng et al., 2024, Xu et al., 2023) (matrix-product code constructions)
(Zhu et al., 2021) Zhu, Liao, The $b$ -weight distribution for MDS codes
(Tang et al., 2021) A Characterization of MDS Symbol-pair Codes over Two Types of Alphabets