Papers
Topics
Authors
Recent
Search
2000 character limit reached

Erdős–Rényi Subgraph Pair Model

Updated 15 January 2026
  • The Erdős–Rényi Subgraph Pair Model is a formal framework that couples random graphs via subgraph extraction and vertex correspondence for network matching.
  • It establishes sharp information-theoretic phase transitions for both exact and partial recovery, guiding practical and adversarial network de-anonymization.
  • The model underpins studies in graph alignment and community detection using methods like brute-force MAP estimators and tail degree signature algorithms.

The Erdős–Rényi Subgraph Pair Model is a formal framework for studying random graphs coupled through subgraph extraction, vertex correspondence, and, crucially, for quantifying the information-theoretic limits of subgraph alignment and network matching. It forms the mathematical substrate for a wide array of statistical and computational analyses on network alignment, planted subgraph recovery, and correlated random graph pairs. This framework now plays a central role in the rigorous treatment of exact and partial graph alignment, particularly for assessing the feasibility and optimality of recovery in both practical and adversarial regimes (Shiu et al., 8 Jan 2026, Du, 17 Feb 2025, Bozorg et al., 2019).

1. Formal Model Definitions

1.1 Subgraph-Pair Model (Alignment/Recovery Setting)

Let nNn\in\mathbb{N}, m=m(n)<nm=m(n)<n, and p=p(n)[0,1]p=p(n)\in[0,1]. A base random graph GER(n,p)G\sim ER(n,p) is sampled on vertex set [n]={1,,n}[n]=\{1,\ldots,n\}. An mm-subset S[n]S\subset [n] is chosen uniformly; H=G[S]H=G[S] is the induced subgraph, which is then anonymized by a uniformly random permutation π:S[m]\pi:S\to[m] to produce HπH_\pi. The observer sees (G,Hπ)(G,H_\pi) but neither SS nor π\pi, and aims to recover SS (set recovery) and/or π\pi (permutation recovery) (Shiu et al., 8 Jan 2026).

1.2 Correlated Erdős–Rényi Subgraph Pair (Graph Matching Setting)

A "parent" graph G0G(n,p)G_0\sim \mathcal{G}(n,p) is generated. Two edge-subsampled graphs G1,G2G_1,G_2 are created by independently including each parent edge with probability ss. One of the graphs is vertex-permuted by an unknown bijection πSn\pi^*\in S_n. The analyst receives (G1,π(G2))(G_1,\pi^*(G_2)) and aims to recover π\pi^* (Du, 17 Feb 2025, Bozorg et al., 2019).

1.3 Agglomerated Subgraph-Pair (Super-vertex Construction)

Given a partition of [n][n] into NN disjoint nonempty subsets ("super-vertices"), a subgraph–pair model is defined on the super-vertex set: two super-vertices are connected iff at least one edge exists between their constituent nodes in the original G(n,p)G(n,p) graph. This construction creates an effective inhomogeneous random graph on the super-vertex level, with edge probabilities depending on the subset sizes (Kang et al., 2013).

2. Information-Theoretic Recovery Thresholds

Sharp information-theoretic phase transitions delimit when exact or partial recovery is possible:

2.1 Exact Subgraph Set and Permutation Recovery

  • Set Recovery: Achievable iff 12mh(p)lnn+\frac{1}{2} m h(p) - \ln n \to +\infty, impossible (converse) if 12mh(p)ln(n/m)\frac{1}{2} m h(p) - \ln(n/m) \to -\infty, where h(p)=plnp(1p)ln(1p)h(p) = -p\ln p - (1-p)\ln(1-p). Under mild conditions, the sharp threshold is 12mh(p)lnn\frac{1}{2}m h(p)\asymp \ln n (Shiu et al., 8 Jan 2026).
  • Permutation Recovery: Requires, in addition, mplnm+mp - \ln m \to +\infty (unique labeling). Fails if either the set recovery converse applies or mplnmmp-\ln m\to -\infty.

2.2 Partial Recovery in Correlated Graphs

For correlated pairs with p=nα+o(1)p = n^{-\alpha+o(1)}, α(0,1]\alpha\in(0,1], and nps2=λ=O(1)nps^2=\lambda=O(1), one cannot recover all vertices, but the fraction of recoverable correspondences is bounded tightly in terms of a limiting "balanced-load" distribution μλ\mu_\lambda:

  • The maximal fraction of accurately aligned vertices approaches Fλ(1/α)F_\lambda(1/\alpha), with Fλ(x)=μλ((x,))F_\lambda(x) = \mu_\lambda((x,\infty)) (Du, 17 Feb 2025).

These thresholds delineate computational and information-theoretic feasibility in subgraph alignment and network de-anonymization.

3. Structural and Statistical Properties

3.1 Degree and Clustering Structure

  • For two independent G(n,p1),G(n,p2)G(n,p_1),G(n,p_2) on a common vertex set, their union is G(n,punion)G(n,p_\text{union}) with punion=1(1p1)(1p2)p_\text{union}=1-(1-p_1)(1-p_2). Degree distributions are binomial, clustering coefficient is C(G)=punionC(G)=p_\text{union} (Wen et al., 2012).
  • Agglomerated super-vertex models produce inhomogeneous graphs, where connection probability between super-vertices of sizes i,ji,j is 1(1p)ij1-(1-p)^{ij}, enabling explicit degree and connectivity computations at the super-vertex level (Kang et al., 2013).

3.2 Emergence of Community and Heavy-Tailed Structures

When community sizes are heavy-tailed (e.g., piCiαp_i \sim C i^{-\alpha}), the induced super-vertex network has a scale-free (power-law) degree distribution, depending on the partition (Kang et al., 2013).

4. Methodologies and Algorithms

4.1 Brute-force (MAP) Estimator

For subgraph alignment, the optimal MAP estimator tests all mm-subsets S[n]S\subset[n] and bijections σ:S[m]\sigma:S\to[m], returning those for which relabeling G[S]G[S] by σ\sigma reproduces HπH_\pi. This is computationally intractable but achieves the information-theoretic threshold (Shiu et al., 8 Jan 2026).

4.2 Tail Degree Signature (TDS)

TDS is a polynomial-time, seedless matching algorithm exploiting the robustness of tail-degree statistics in correlated ER graphs. Feature vectors consist of sorted extremes of neighbor degree distributions across multiple neighborhood shells. Theoretical analysis shows it achieves the information-theoretic threshold ps2=Ω(logn/n)ps^2=\Omega(\log n/n) in regime p=Θ(logn/n)p=\Theta(\log n/n) (Bozorg et al., 2019).

Complexity

Algorithm Time Complexity Achieves IT Threshold
Brute-force MAP Exponential (m!(nm)m!{n\choose m}) Yes (exact recovery), not practical for large nn
TDS–h (Hungarian) O(n3)O(n^3) Yes (matching threshold for G(n,p)G(n,p), sparse regime)
TDS–g (Greedy) O(n2)O(n^2) Yes, with high probability under threshold conditions

5. Phase Transitions and Limit Theorems

5.1 Phase Diagrams in Alignment

Define α=[12mh(p)]/lnn\alpha = [\frac{1}{2}mh(p)]/\ln n. Set recovery is feasible for α>1+o(1)\alpha>1+o(1), infeasible for α<1o(1)\alpha<1-o(1), with a grey zone in between. Sharp phase transitions demarcate algorithmic possibility from impossibility (Shiu et al., 8 Jan 2026).

5.2 Community Graph Phase Transitions

For agglomerated super-vertex graphs, thresholds for connectivity and giant component emergence follow from inhomogeneous random graph (IRG) theory (Kang et al., 2013). The key parameter is cs~2c\tilde s_2, the average squared community size times edge probability:

  • Largest component vanishes if cs~21c\tilde s_2\leq1, occupies Θ(N)\Theta(N) super-vertices if cs~2>1c\tilde s_2>1.

6. Connections to Broader Random Graph Models

The ER subgraph-pair model is a special case of subgraph generated models (SUGMs), where the only generated subgraphs are links (k=1k=1 type), with SUGM reducing exactly to ER(n,pn,p). More general SUGMs encode dependency on motifs such as triangles, stars, and cliques, bridging ER structure and higher-order motif-based randomness (Chandrasekhar et al., 2016).

By tuning the types and rates of subgraph "atoms," the model generalizes ER, permitting tractable closed-form expressions for expectations, variances, and parameter inference.

7. Applications and Implications

The ER subgraph-pair model underpins rigorous analysis of biological network alignment, privacy and de-anonymization of social networks, and statistical models of network community structure. Its phase diagrams and thresholds provide foundational guarantees for algorithmic graph matching and motif-based inference. Recent advances demonstrate that truly seedless and polynomial-time algorithms can saturate the fundamental information-theoretic limits via robust local statistics, revealing new pathways for tractable recovery in high-noise regimes (Shiu et al., 8 Jan 2026, Du, 17 Feb 2025, Bozorg et al., 2019).


References:

(Shiu et al., 8 Jan 2026) Information-Theoretic Limits on Exact Subgraph Alignment Problem (Du, 17 Feb 2025) Optimal recovery of correlated Erdős-Rényi graphs (Bozorg et al., 2019) Seedless Graph Matching via Tail of Degree Distribution for Correlated Erdos-Renyi Graphs (Wen et al., 2012) Edge Union of Networks on the Same Vertex Set (Chandrasekhar et al., 2016) A Network Formation Model Based on Subgraphs (Kang et al., 2013) Evolution of a modified binomial random graph by agglomeration

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Erdos-Renyi Subgraph Pair Model.