Papers
Topics
Authors
Recent
Search
2000 character limit reached

Triple-Dyad Ratio Estimator

Updated 17 January 2026
  • The paper introduces a triple-dyad ratio estimator that leverages algebraic identities from dyad triplets to infer global and nodal parameters in directed graphs.
  • It employs explicit ratio formulas and bias correction techniques, ensuring computational tractability and asymptotic consistency with normality.
  • The method offers practical testing procedures and scalable implementations, outperforming traditional MLE in large network applications.

The triple-dyad ratio estimator is a parametric estimation method for the p1p_1 model of directed graphs, enabling statistically principled inference of global and nodal parameters including density and reciprocity. This estimator leverages algebraic identities satisfied by triple patterns of dyad indicators, yielding explicit, scalable estimators with guaranteed asymptotic consistency and normality. Its computational tractability, combined with rigorous bias correction and testing methods, makes it a foundational tool for the analysis of large directed networks (Feng et al., 10 Jan 2026).

1. Mathematical Foundation and Model Context

The p1p_1 model concerns a directed graph GnG_n with nn labeled vertices. The adjacency matrix X=(Xij)X=(X_{ij}) encodes directed edges with Xij=1X_{ij}=1 if an edge exists from ii to jj and Xii=0X_{ii}=0. For each unordered node pair {i,j}\{i, j\}, the dyad is Dij=(Xij,Xji){0,1}2D_{ij}=(X_{ij}, X_{ji}) \in \{0,1\}^2, leading to four dyad-types: (0,0)(0,0), (0,1)(0,1), (1,0)(1,0), and (1,1)(1,1), with binary indicators Iijab=1{Dij=(a,b)}I_{ij}^{ab} = \mathbf{1}\{D_{ij} = (a, b)\}.

The p1p_1 model posits independent dyads with probabilities parameterized as

P(Dij=(a,b))=pijab=exp{a(θ+αi+βj)+b(θ+αj+βi)+abρ}kij\mathbb{P}(D_{ij} = (a, b)) = p_{ij}^{ab} = \frac{\exp\{a(\theta + \alpha_i + \beta_j) + b(\theta + \alpha_j + \beta_i) + ab\rho\}}{k_{ij}}

where kijk_{ij} normalizes over all (a,b)(a,b), θ\theta governs density, ρ\rho governs reciprocity, and αi,βj\alpha_i, \beta_j are nodal “expansiveness” and “popularity” parameters subject to iαi=0\sum_{i}\alpha_i=0, jβj=0\sum_{j}\beta_j=0 (Feng et al., 10 Jan 2026).

2. Triple-Dyad Indicator Construction

For estimation, the triple-dyad indicator Titj(a1b1),(a2b2),(a3b3)=Iita1b1Itja2b2Ijia3b3T_{itj}^{(a_1b_1),(a_2b_2),(a_3b_3)} = I_{it}^{a_1b_1} I_{tj}^{a_2b_2} I_{ji}^{a_3b_3} is defined over triplets (i,t,j)(i, t, j) of distinct nodes and their dyad configurations. The expectation factorizes: ETitj(a1b1),(a2b2),(a3b3)=pita1b1ptja2b2pjia3b3\mathbb{E}T_{itj}^{(a_1b_1),(a_2b_2),(a_3b_3)} = p_{it}^{a_1b_1} p_{tj}^{a_2b_2} p_{ji}^{a_3b_3} Strategic selection of dyad patterns and ratios enables isolation of target parameter combinations through empirical counts.

3. Explicit Estimator Formulas

Density Estimation

For fixed tt, define \begin{align*} A_t &= \sum_{i \neq t} \sum_{j \neq t, i} I_{it}{01} I_{tj}{01} I_{ij}{00} \ B_t &= \sum_{i \neq t} \sum_{j \neq t, i} I_{it}{00} I_{tj}{00} I_{ij}{01} \end{align*} The estimator for θ+αt+βt\theta + \alpha_t + \beta_t is log(At/Bt)\log(A_t / B_t). Averaging over tt and exploiting identifiability yields

θ^=1nt=1nlog(AtBt)\hat{\theta} = \frac{1}{n}\sum_{t=1}^n \log\left(\frac{A_t}{B_t}\right)

Reciprocity Estimation

For another triple pattern, isolating ρ+θ+αt+βt\rho + \theta + \alpha_t + \beta_t and subtracting θ^\hat{\theta} gives

ρ^=1nt=1nlog(i,jtIit11Itj11Iij10i,jtIit10Itj10Iij11)θ^\hat{\rho} = \frac{1}{n}\sum_{t=1}^n \log\left(\frac{\sum_{i,j \neq t} I_{it}^{11} I_{tj}^{11} I_{ij}^{10}}{\sum_{i,j \neq t} I_{it}^{10} I_{tj}^{10} I_{ij}^{11}}\right) - \hat{\theta}

Nodal Parameters

For each ii \begin{align*} \hat{\alpha}i &= \frac{1}{n}\sum{t \neq i} \log\left(\frac{\sum_{k,l \neq i, t} I_{ki}{01} I_{kl}{00} I_{tl}{01}}{\sum_{k,l \neq i, t} I_{ki}{00} I_{kl}{01} I_{tl}{00}}\right) - \hat{\theta} \ \hat{\beta}j &= \frac{1}{n}\sum{t \neq j} \log\left(\frac{\sum_{k,l \neq j, t} I_{kt}{01} I_{kl}{00} I_{jl}{01}}{\sum_{k,l \neq j, t} I_{kt}{00} I_{kl}{01} I_{jl}{00}}\right) - \hat{\theta} \end{align*} Collectively, Θ^=(ρ^,θ^,{α^i},{β^j})\widehat{\Theta} = (\hat{\rho}, \hat{\theta}, \{ \hat{\alpha}_i \}, \{ \hat{\beta}_j \}) defines the triple-dyad ratio estimator (Feng et al., 10 Jan 2026).

4. Statistical Guarantees and Bias Correction

Consistency and Asymptotics

Under bounded parameter norms and non-sparsity,

Θ^Θ=O(lognn)\|\widehat{\Theta} - \Theta\|_\infty = O\left(\sqrt{\frac{\log n}{n}}\right)

Sharper rates are available in regimes with specified dyad probability bounds.

Central Limit Theorem

For large nn,

θ^θθσθdN(0,1),ρ^ρρσρdN(0,1)\frac{\hat{\theta} - \theta - \theta^*}{\sigma_\theta} \xrightarrow{d} \mathcal{N}(0,1), \quad \frac{\hat{\rho} - \rho - \rho^*}{\sigma_\rho} \xrightarrow{d} \mathcal{N}(0,1)

where the explicit bias terms θ,ρ\theta^*, \rho^* are O(1/n)O(1/n) and variances are analytic sums over probabilities and mixing weights. Nodal parameters admit a joint asymptotic normality with explicit block-diagonal covariance formulas.

Bias Correction

Plug-in empirical versions of all dyad probabilities yield θ^,ρ^\hat{\theta}^*, \hat{\rho}^*; bias-corrected estimators are

θ^BC=θ^θ^,ρ^BC=ρ^ρ^\hat{\theta}_{\mathrm{BC}} = \hat{\theta} - \hat{\theta}^*, \quad \hat{\rho}_{\mathrm{BC}} = \hat{\rho} - \hat{\rho}^*

5. Testing Procedures and Inferential Extensions

To test for reciprocity (H0:ρ=0H_0: \rho = 0), the asymptotic normality and bias-correction yield the statistic

T=ρ^ρ^σ^ρT = \frac{\hat{\rho} - \hat{\rho}^*}{\hat{\sigma}_\rho}

H0H_0 is rejected at significance α\alpha if T>z1α/2|T| > z_{1-\alpha/2}. Confidence intervals and two-sample tests comparing graphs are constructed via analogous formulas, with coverage frequencies empirically verified to be near nominal values.

6. Computational Complexity and Implementation

Naïve evaluation of all node triples is prohibitive at O(n3)O(n^3) time. The estimator is efficiently computed via matrix-product chains. Configuration matrices AabA^{ab} encode dyad indicators; products such as A01A00A01A^{01}A^{00}A^{01} allow extraction of all necessary triple sums for every tt via diagonal entries, computable in O(nω)O(n^\omega) time (ω2.373\omega \approx 2.373), or O(n2)O(n^2) for sparse graphs. For nn up to 5,000, the triple-dyad estimator executes in under 1.5 seconds, compared to 8–90 seconds for MLE fitting.

7. Empirical Performance and Real-World Applications

Simulation studies across multiple density regimes and node counts show that the triple-dyad ratio estimator matches the accuracy of the MLE for global parameters and approaches it for nodal parameters as the network size grows. In a real-world example on the Sina Weibo following network (n=4077n=4077), the estimator recovered strong reciprocity (ρ^7.56\hat{\rho} \approx 7.56) and heterogeneous node degrees, yielding statistically significant reciprocity test results. The estimator’s speed and scalability are confirmed in extensive numerical experiments (Feng et al., 10 Jan 2026).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Triple-Dyad Ratio Estimator.