Triple-Dyad Ratio Estimator
- The paper introduces a triple-dyad ratio estimator that leverages algebraic identities from dyad triplets to infer global and nodal parameters in directed graphs.
- It employs explicit ratio formulas and bias correction techniques, ensuring computational tractability and asymptotic consistency with normality.
- The method offers practical testing procedures and scalable implementations, outperforming traditional MLE in large network applications.
The triple-dyad ratio estimator is a parametric estimation method for the model of directed graphs, enabling statistically principled inference of global and nodal parameters including density and reciprocity. This estimator leverages algebraic identities satisfied by triple patterns of dyad indicators, yielding explicit, scalable estimators with guaranteed asymptotic consistency and normality. Its computational tractability, combined with rigorous bias correction and testing methods, makes it a foundational tool for the analysis of large directed networks (Feng et al., 10 Jan 2026).
1. Mathematical Foundation and Model Context
The model concerns a directed graph with labeled vertices. The adjacency matrix encodes directed edges with if an edge exists from to and . For each unordered node pair , the dyad is , leading to four dyad-types: , , , and , with binary indicators .
The model posits independent dyads with probabilities parameterized as
where normalizes over all , governs density, governs reciprocity, and are nodal “expansiveness” and “popularity” parameters subject to , (Feng et al., 10 Jan 2026).
2. Triple-Dyad Indicator Construction
For estimation, the triple-dyad indicator is defined over triplets of distinct nodes and their dyad configurations. The expectation factorizes: Strategic selection of dyad patterns and ratios enables isolation of target parameter combinations through empirical counts.
3. Explicit Estimator Formulas
Density Estimation
For fixed , define \begin{align*} A_t &= \sum_{i \neq t} \sum_{j \neq t, i} I_{it}{01} I_{tj}{01} I_{ij}{00} \ B_t &= \sum_{i \neq t} \sum_{j \neq t, i} I_{it}{00} I_{tj}{00} I_{ij}{01} \end{align*} The estimator for is . Averaging over and exploiting identifiability yields
Reciprocity Estimation
For another triple pattern, isolating and subtracting gives
Nodal Parameters
For each \begin{align*} \hat{\alpha}i &= \frac{1}{n}\sum{t \neq i} \log\left(\frac{\sum_{k,l \neq i, t} I_{ki}{01} I_{kl}{00} I_{tl}{01}}{\sum_{k,l \neq i, t} I_{ki}{00} I_{kl}{01} I_{tl}{00}}\right) - \hat{\theta} \ \hat{\beta}j &= \frac{1}{n}\sum{t \neq j} \log\left(\frac{\sum_{k,l \neq j, t} I_{kt}{01} I_{kl}{00} I_{jl}{01}}{\sum_{k,l \neq j, t} I_{kt}{00} I_{kl}{01} I_{jl}{00}}\right) - \hat{\theta} \end{align*} Collectively, defines the triple-dyad ratio estimator (Feng et al., 10 Jan 2026).
4. Statistical Guarantees and Bias Correction
Consistency and Asymptotics
Under bounded parameter norms and non-sparsity,
Sharper rates are available in regimes with specified dyad probability bounds.
Central Limit Theorem
For large ,
where the explicit bias terms are and variances are analytic sums over probabilities and mixing weights. Nodal parameters admit a joint asymptotic normality with explicit block-diagonal covariance formulas.
Bias Correction
Plug-in empirical versions of all dyad probabilities yield ; bias-corrected estimators are
5. Testing Procedures and Inferential Extensions
To test for reciprocity (), the asymptotic normality and bias-correction yield the statistic
is rejected at significance if . Confidence intervals and two-sample tests comparing graphs are constructed via analogous formulas, with coverage frequencies empirically verified to be near nominal values.
6. Computational Complexity and Implementation
Naïve evaluation of all node triples is prohibitive at time. The estimator is efficiently computed via matrix-product chains. Configuration matrices encode dyad indicators; products such as allow extraction of all necessary triple sums for every via diagonal entries, computable in time (), or for sparse graphs. For up to 5,000, the triple-dyad estimator executes in under 1.5 seconds, compared to 8–90 seconds for MLE fitting.
7. Empirical Performance and Real-World Applications
Simulation studies across multiple density regimes and node counts show that the triple-dyad ratio estimator matches the accuracy of the MLE for global parameters and approaches it for nodal parameters as the network size grows. In a real-world example on the Sina Weibo following network (), the estimator recovered strong reciprocity () and heterogeneous node degrees, yielding statistically significant reciprocity test results. The estimator’s speed and scalability are confirmed in extensive numerical experiments (Feng et al., 10 Jan 2026).