Lossy Common Information in Source Coding

Updated 31 January 2026

Lossy common information is a measure that quantifies the minimal rate of a common message in Gray–Wyner networks under prescribed distortion constraints.
It extends Wyner’s and Gács–Körner’s frameworks by incorporating fidelity requirements, offering insights into rate trade-offs for both discrete and Gaussian sources.
Practical implementations leverage explicit constructions like polar codes and learnable neural codecs, enabling distributed representation learning in modern signal processing.

Lossy common information generalizes classical information-theoretic characterizations of shared structure among correlated sources to contexts involving fidelity constraints, specifically within the Gray–Wyner network. It quantifies the minimal required rate of a common message that, combined with optimal private side-channels, enables reconstruction of the sources under prescribed distortion levels. This operationalizes the concept of “commonality” in lossy multiterminal source coding and allows for rigorous analyses of rate trade-offs in both discrete and continuous (notably, Gaussian) settings. The framework subsumes both Wyner’s and Gács–Körner’s common information as extremes, delineates plateau regions where common information is distortion-invariant, and now extends to learnable architectures for distributed representation learning in signal processing and machine learning.

1. Fundamental Definitions and Gray–Wyner Network Model

The Gray–Wyner network models two or more correlated sources $X_1, X_2, \ldots, X_N$ , which are compressed by an encoder into a common message $S_0$ and private messages $S_i$ , with rates $R_0$ and $R_i$ respectively. Each decoder reconstructs its respective target using the common and private messages, subject to per-letter distortion constraints $D_i$ . The achievable rate region $\mathcal{R}_{GW}(D_1, D_2)$ is characterized by the existence of an auxiliary variable $U$ and reconstructions $(\hat X, \hat Y)$ such that

$R_0 \geq I(X, Y; U),\quad R_1 \geq I(X; \hat X|U),\quad R_2 \geq I(Y; \hat Y|U),$

with $S_0$ 0, $S_0$ 1 (Viswanatha et al., 2014, Andrade et al., 29 Jan 2026). The sum-rate minimization $S_0$ 2 is fundamentally linked to the joint rate-distortion function $S_0$ 3.

Wyner’s lossy common information $S_0$ 4 is defined as the minimum possible common rate $S_0$ 5 such that the total coding rate equals the joint rate-distortion bound: $S_0$ 6 This infimum is achieved under the Markov constraints $S_0$ 7 and $S_0$ 8 with $S_0$ 9 optimal for $S_i$ 0 (Viswanatha et al., 2014, Andrade, 6 Jul 2025, Andrade et al., 29 Jan 2026).

2. Lossy Extensions: Wyner and Gács–Körner Notions

The two dominant notions—Wyner’s and Gács–Körner’s—are extended to lossy settings via distinct operational criteria in the Gray–Wyner region:

Wyner’s Lossy Common Information: Corresponds to the operating point achieving minimum sum transmit rate. The single-letter characterization is:

$S_i$ 1

where $S_i$ 2 is as above (Viswanatha et al., 2014, Xu et al., 2013, Andrade et al., 29 Jan 2026).

Gács–Körner’s Lossy Common Information: Maximizes the extractable common rate when each source is encoded at its individual rate-distortion bound. The characterization is:

$S_i$ 3

subject to $S_i$ 4 (resp. $S_i$ 5) achieving $S_i$ 6 (resp. $S_i$ 7), and appropriate Markov constraints (Viswanatha et al., 2014, Andrade, 6 Jul 2025, Andrade et al., 29 Jan 2026).

The relationship between these quantities and the mutual information of the reconstructed variables $S_i$ 8 is bounded as: $S_i$ 9 with strict equality only when a "perfect common part" $R_0$ 0 exists, separating all mutual dependence (Andrade, 6 Jul 2025).

3. Rate-Distortion Characterization and Plateaus

The solution to the optimization for $R_0$ 1 can exhibit a plateau: for distortions $R_0$ 2 within a nontrivial region, the lossy common information is constant and coincides with the lossless (zero-distortion) Wyner common information. That is,

$R_0$ 3

so long as $R_0$ 4 are sufficiently small (the so-called "Wyner plateau") (Xu et al., 2013, Shi et al., 2016, Charalambous et al., 2019). Outside this region, $R_0$ 5 generally increases with distortion or can be zero if the sources are effectively uncorrelated at the required resolution.

For multivariate Gaussian sources, this plateau is explicit: on $R_0$ 6, $R_0$ 7 (for correlation $R_0$ 8) (Xu et al., 2013, Charalambous et al., 2019, Shi et al., 2016). The explicit canonical-variable construction and weak-realization theory provide a complete parametrization of conditional-independence-inducing latent variables $R_0$ 9, and a closed-form expression for the minimal common rate in the quadratic-Gaussian case (Charalambous et al., 2019).

4. Operational and Structural Properties

Lossy common information precisely characterizes the boundary between efficient joint compression and source-specific refinements. The transmit rate $R_i$ 0 is minimized at the Wyner operating point, while the receive rate $R_i$ 1 is minimized at the Gács–Körner point. The transmit–receive trade-off is continuous across the Gray–Wyner region; $R_i$ 2 and $R_i$ 3 represent its extremes (Viswanatha et al., 2014, Andrade et al., 29 Jan 2026).

Key theorems establish:

Convexity and monotonicity of the common information as a function of "excess rate";
The operational significance of the Pangloss plane and its intersection with the Gray–Wyner region as yielding $R_i$ 4;
The necessity of certain Markov factorizations among $R_i$ 5 for achievability (Viswanatha et al., 2014, Andrade, 6 Jul 2025).

For lossless sources, $R_i$ 6, with equality when all shared information can be deterministically separated (Andrade, 6 Jul 2025).

5. Explicit Constructions and Computation

Polar codes (for discrete) and polar lattices (for Gaussians) allow explicit extraction of Wyner’s lossy common information (Shi et al., 2016). The strategy for DSBS is to polar-quantize under the joint test channel, extract the common part as a high-entropy block, and compress private deviations. In the Gaussian case, the problem reduces to optimal quantization of a single latent $R_i$ 7; the common information plateaus for distortion levels below $R_i$ 8.

An explicit Gaussian algorithm follows:

Canonicalization via Hotelling SVD.
Parameter extraction: $R_i$ 9.
Check $D_i$ 0.
Compute $D_i$ 1 (Charalambous et al., 2019).

The discrete Gaussian approximation and explicit coding constructions are proven to be achievable to within vanishing error (Shi et al., 2016).

6. Learnable Networks and Applications

Recent advances operationalize Gray–Wyner theory via learnable neural codecs for multitask computer vision problems (Andrade et al., 29 Jan 2026). These architectures instantiate three-channel (common and private) codes with structured neural transforms and entropy models. The Lagrangian-relaxed loss jointly optimizes rate allocation and distortion, automatically discovering the optimal splitting of common and private rates as predicted by theory. Empirical results verify that the learned codes attain the predicted rate savings on transmit–receive frontiers, with shared channels saturating theoretical bounds in strong-dependence regimes. Noteworthy effects include:

Dominantly shared codes when input PMFs coincide,
Zero shared rate for independent tasks,
Adaptive bit allocation for mixed dependence.

Lossy common information interrelates with multiple research axes:

Limited common randomness: The minimum common-randomness rate for constrained distortion, single-letter achievable region, and its optimization as a convex program (Saldi et al., 2014).
Mutual information bounds: $D_i$ 2 forms a tight sandwich between lossy Wyner and Gács–Körner CIs for all achievable reconstructions (Andrade, 6 Jul 2025).
Generalizations: Extensions to $D_i$ 3-tuples, arbitrary alphabets, and output distribution constraints, with the unified perspective of the Gray–Wyner rate region (Xu et al., 2013).
Unified transmit/receive trade-off: The locus of achievable $D_i$ 4 traces contours on the Gray–Wyner surface, interpolating between fully-shared and fully-private extreme points (Viswanatha et al., 2014, Andrade et al., 29 Jan 2026).

Table: Summary of Characterizations

Notion	Definition	Markov Constraint
Lossy Wyner CI $D_i$ 5	$D_i$ 6	$D_i$ 7
Lossy Gács-Körner CI	$D_i$ 8	$D_i$ 9
Mutual Info Bound	$\mathcal{R}_{GW}(D_1, D_2)$ 0	N/A

Wyner’s and Gács–Körner’s notions represent fundamental bounds in multiterminal source coding and are critical for understanding redundancy, sequential refinability, and practical codec design, in both classical and modern machine learning systems. Their generalizations to arbitrary sources, distortion regimes, and learnable representations continue to inform theoretical analysis and applied algorithm development across several disciplines.