Papers
Topics
Authors
Recent
Search
2000 character limit reached

Propagation of Chaos in Rényi Divergences

Updated 22 January 2026
  • The paper establishes a sharp O(d q²/N²) convergence rate for propagation of chaos using log-Sobolev inequalities and a Donsker–Varadhan splitting approach.
  • It extends the entropic framework beyond KL divergence by rigorously characterizing fluctuations and concentration phenomena in weakly interacting diffusions.
  • The methodology relies on strong isoperimetry and weak interaction assumptions, with illustrative Gaussian examples confirming the optimality of the derived scaling laws.

Propagation of chaos in Rényi divergences quantifies the extent to which finite-particle approximations to mean-field interacting diffusion systems decorrelate as the system size NN grows, using the qq-Rényi divergence as a measure of marginal independence. Recent work establishes sharp O(dq2/N2)O(d q^2/N^2) rates for the stationary measures of weakly interacting diffusions, extending the entropic framework previously developed for Kullback–Leibler (KL) divergence. This approach rigorously characterizes fluctuations and concentration phenomena in high-dimensional and high-particle-number regimes, under conditions of strong isoperimetry and weak particle interaction (Zhang, 15 Jan 2026).

1. qq-Rényi Divergence and Its Role

For probability measures μ,ν\mu,\nu on Rd\mathbb{R}^d with μν\mu\ll\nu and real q>1q > 1, the qq-Rényi divergence is defined as

Rq(μν)=1q1log(Eν[(dμdν)q]).\mathsf R_q(\mu\Vert\nu) = \frac{1}{q-1} \log\Bigl( \mathbb E_\nu\Big[(\frac{d\mu}{d\nu})^q \Big] \Bigr).

Setting ρ=dμ/dν\rho = d\mu/d\nu, this simplifies to

Rq(μν)=1q1log(ν[ρq]),limq1Rq(μν)=KL(μν).\mathsf R_q(\mu\Vert\nu) = \frac{1}{q-1} \log(\nu[\rho^q]), \quad \lim_{q \downarrow 1} \mathsf R_q(\mu\Vert\nu) = \mathrm{KL}(\mu\Vert\nu).

Rényi divergences interpolate between various risk-sensitive divergences and play a central role in non-asymptotic information-theoretic analysis. In the context of interacting diffusions, propagation of chaos in Rényi divergence precisely captures the rate at which empirical marginals approach statistical independence as NN \to \infty.

2. Main Theorem: Sharp Rényi Propagation of Chaos Rate

For an NN-particle interacting diffusion system at stationarity, let μ[k]\mu^{[k]} denote the kk-marginal of the stationary law and π\pi the mean-field Gibbs minimizer. The main result asserts that, under specified regularity and weak interaction assumptions, there exist constants C,c>0C,c > 0—independent of d,N,qd,N,q—such that

NΩ~(dq)    Rq(μ1π)CO~(dq2N2),N \ge \widetilde\Omega(\sqrt{d}\, q) \implies \mathsf R_q(\mu^1 \Vert \pi) \le C\, \widetilde O\left(\frac{d q^2}{N^2}\right),

where hidden constants depend only on the log-Sobolev constant CLSIC_{\rm LSI} and interaction smoothness parameter βW\beta_W, and O~\widetilde O notation suppresses polylogarithmic terms. The result establishes an optimal O(dq2/N2)O(d q^2/N^2) rate for first marginals; the arguments also yield bounds for general kk-marginals of the form O(dk3q2/N2)O(d k^3 q^2/N^2) under analogous conditions.

3. Technical Conditions and Setting

The result holds for stationary solutions of interacting diffusion systems characterized by:

  • Potentials and Interactions: Confinement potential V:RdRV:\mathbb R^d \to \mathbb R and interaction kernel W:RdRW:\mathbb R^d \to \mathbb R, with mean-field Gibbs law πexp(VWπ)\pi \propto \exp(-V - W*\pi).
  • Assumption 2.1 (Smoothness): W(x)W(y)βWxy\|\nabla W(x) - \nabla W(y)\| \le \beta_W\|x-y\| for all x,yRdx,y \in \mathbb R^d (interaction gradient Lipschitz with parameter βW\beta_W).
  • Assumption 2.2 (Uniform log-Sobolev/Isoperimetry): There exists CˉLSI<\bar C_{\rm LSI} < \infty such that for all fCcf \in C_c^\infty,

Entπ(f)CˉLSIEπ[f2],\mathrm{Ent}_\pi(f) \le \bar C_{\rm LSI} \mathbb E_\pi[\|\nabla f\|^2],

and this holds uniformly for all conditional measures of μ1:N\mu^{1:N}.

  • Assumption 2.3 (Very Weak Interaction): βWCˉLSI1\beta_W \bar C_{\rm LSI} \ll 1; i.e., the interaction is a small perturbation on the log-Sobolev scale.

These assumptions guarantee uniqueness of the relevant Gibbs measures and well-posedness of both McKean–Vlasov and NN-particle stationary dynamics.

4. Proof Structure and Key Components

The analysis proceeds through several layers:

  1. LSI-to-Rényi Inequality: For any ν\nu satisfying a log-Sobolev inequality, Lemma 3.1 shows

Rq(μν)qCLSI2RF ⁣Iq(μν),\mathsf R_q(\mu\Vert\nu) \le \frac{q\,C_{\rm LSI}}{2} \mathsf{RF\!I}_q(\mu\Vert\nu),

where the Rényi–Fisher information is

RF ⁣Iq(μν)=qEμ[ψqlogρ2],ψqρq1.\mathsf{RF\!I}_q(\mu\Vert\nu) = q\, \mathbb E_\mu\Bigl[\psi^q\,\|\nabla\log\rho\|^2\Bigr], \quad \psi^q \propto \rho^{q-1}.

  1. Donsker–Varadhan Splitting: The Rényi–Fisher information can be decomposed as

RF ⁣IqA+B+C,\mathsf{RF\!I}_q \le A+B+C,

with

A=FI(μν) B=KL(P~μ) C=logEμ[eζ]\begin{array}{rl} A & = \mathrm{FI}(\mu\Vert\nu) \ B & = \mathrm{KL}(\widetilde P \Vert \mu) \ C & = \log \mathbb E_\mu[e^{\zeta}] \end{array}

where P~\widetilde P is a tilted law and ζ\zeta captures variance-type fluctuation.

  1. Propagation-of-Chaos in Fisher Information: Lemma 3.4 yields

FI(μ[k]πk)=O(dk3N2)\mathrm{FI}(\mu^{[k]}\Vert\pi^{\otimes k}) = O\left(\frac{d\,k^3}{N^2}\right)

under stationarity.

  1. Control of Tilted-KL Term: Applying a second LSI-to-Rényi type argument:

KL(P~μ)(q1)CLSIRF ⁣Iq12RF ⁣Iq+O(dk3q2N2),\mathrm{KL}(\widetilde P\Vert\mu) \lesssim (q-1)C_{\rm LSI} \mathsf{RF\!I}_q \le \tfrac{1}{2} \mathsf{RF\!I}_q + O\left(\frac{d\,k^3 q^2}{N^2}\right),

absorbing half of the RF ⁣Iq\mathsf{RF\!I}_q term to the left.

  1. Exponential Moment Bound: Via concentration of Lipschitz functionals under the stationary law and LSI,

logEμ[k][eζ]=O~(dk3q2N2)\log \mathbb E_{\mu^{[k]}}[e^\zeta] = \widetilde O\left(\frac{d\,k^3 q^2}{N^2}\right)

with a hierarchical coupling controlling the relevant Lipschitz constant as O(k/N)O(k/N).

Aggregating these, one concludes the sharp

Rq=O~(dk3q2N2)\mathsf R_q = \widetilde O\left(\frac{d\,k^3 q^2}{N^2}\right)

for general kk-marginals, so for k=1k=1,

Rq(μ1π)=O(dq2N2).\mathsf R_q(\mu^1\Vert\pi) = O\left(\frac{d q^2}{N^2}\right).

5. Dependence on System Parameters and Sharpness

The established bound reveals several sharp regimes:

  • Scaling Laws: The bound scales as q2/N2q^2/N^2 up to polylog factors. In the Gaussian case, the 1/N21/N^2 and dd dependences are sharp, with an explicit expansion yielding Rq=Θ(dq/N2)\mathsf R_q = \Theta(dq/N^2).
  • Optimality: The only apparent suboptimality is the q2q^2 dependence; the true scaling is conjectured to be linear in qq.
  • Reduction to KL: In the limit q1q\downarrow 1, the bound recovers the O(1/N2)O(1/N^2) KL divergence propagation-of-chaos law.
  • Concentration Transfer: The corollary

μ[k](A)2[πk(A)]1/2\mu^{[k]}(A) \le 2\, [\pi^{\otimes k}(A)]^{1/2}

for Ndk3/2N \gtrsim \sqrt{d}\,k^{3/2} demonstrates high-probability transfer of concentration phenomena.

The stationary setting is essential for (i) closed-form score functions, (ii) uniform application of LSI to all conditional marginals, and (iii) sub-Gaussian concentration under the stationary law. Extensions to time-dependent (non-stationary) dynamics remain open.

6. Illustrative Gaussian Example

Consider V(x)=12x2V(x) = \frac{1}{2}\|x\|^2, W(x)=λ2x2W(x) = \frac{\lambda}{2}\|x\|^2. The stationary law μ1:N\mu^{1:N} is Gaussian with explicit block-covariance. For NkN \gg k,

Rq(μ1:kπk)=dqλ24(1+λ)2k[k(1+λ)2(2λ+1)]N2+O(dN3).\mathsf R_q\left(\mu^{1:k}\Vert\pi^{\otimes k}\right) = \frac{d\,q\,\lambda^2}{4(1+\lambda)^2}\, \frac{k[k(1+\lambda)^2-(2\lambda+1)]}{N^2} + O(d N^{-3}).

For k=1k=1, this results in Rq(μ1π)=Θ(dq/N2)\mathsf R_q(\mu^1\Vert\pi) = \Theta(d q / N^2), certifying the optimality of the 1/N21/N^2 and dd dependence, with the only discrepancy in the qq exponent relative to the general bound.

7. Broader Context and Implications

Propagation of chaos in Rényi divergence generalizes classical mean-field limit results beyond KL or Wasserstein metrics, providing a fine-grained, risk-sensitive quantification of independence in high-dimensional particle systems. The entropic and functional-analytic approach employed relies crucially on log-Sobolev inequalities, Donsker–Varadhan large deviations, and hierarchical couplings, synthesizing ideas from kinetic theory, concentration of measure, and information theory. These results further clarify the rates and mechanisms underlying empirical measure convergence in statistical physics and related domains (Zhang, 15 Jan 2026). High-dimensional sharpness and explicit scaling laws lay the foundation for subsequent investigations into temporal dynamics and non-stationary extensions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Propagation of Chaos in Rényi Divergences.