Joint Degree-Leafdegree Distribution
- Joint degree–leafdegree distribution is a two-parameter measure that characterizes network vertices by combining overall degree and the count of leaf neighbors.
- It uses generating function techniques and two-index recursion relations to derive explicit formulas and asymptotic behaviors across various tree models.
- The framework enhances analysis of network robustness, percolation, and motif frequency by providing refined insights beyond traditional degree metrics.
The joint degree-leafdegree distribution provides a two-parameter statistical characterization of vertices in discrete trees and certain random network models, combining information about a vertex's total degree and the number of its nearest neighbors that are leaves. Formally, for a given graph or growing network, denotes the limiting fraction of vertices with degree and leafdegree (i.e., those with exactly neighbors, among which are of degree $1$). This joint distribution refines and generalizes the classical degree distribution and captures fine-structure local correlations relevant for both structural and dynamical network properties.
1. Definitions and Fundamental Properties
Let denote a finite graph (typically a tree or a sparse network) with vertices. For each vertex :
- The degree is the number of neighbors of .
- The leafdegree is the number of neighbors of that are leaves (i.e., have degree $1$).
The empirical count gives the number of vertices with . For broad classes of growing random trees (preferential attachment, random recursive trees, etc.), it can be shown via martingale concentration arguments that for each fixed ,
where is the limiting (nonrandom) joint degree-leafdegree distribution. The normalization
holds since every vertex is accounted for. Marginal distributions are given by
with (Hartle et al., 1 Feb 2026).
2. Generating Functions and Recursion Relations
The joint degree-leafdegree distribution admits a bivariate generating function
For the classical preferential attachment (PA) tree, the limiting satisfy a two-index recursion:
valid for . Transforming by the generating function gives a first-order linear PDE:
The unique analytic solution regular at with is
This generating-function framework encodes not only but also fluctuations, marginals, and structural ratios. The method generalizes to other growing tree models, such as the random recursive tree (RRT) and redirection models, each yielding its own recursion and integral expression for (Hartle et al., 1 Feb 2026).
3. Explicit Formulas and Asymptotics
From the integral form of , an explicit expression for is obtained:
Extremal values include:
- (protected nonleaves with no leaf neighbors)
For large , the conditional mean and variance of the leafdegree given degree are
Thus for high-degree vertices, the leafdegree concentrates sharply around , i.e., half of the neighbors of large- vertices are leaves with vanishingly small relative fluctuations (Hartle et al., 1 Feb 2026).
4. Marginal Distributions and Protected Vertices
The marginal leafdegree distribution,
exhibits an algebraic tail, for large . The protected vertex fraction (vertices that are not leaves and have leafdegree zero) is encoded in . Summing, the overall fraction is
with mean degree among protected vertices (Hartle et al., 1 Feb 2026).
5. Fluctuations and Limit Theorems
For fixed and large ,
and the normalized counts are asymptotically (multivariate) normal. For example:
- Number of leaves ,
- Number of degree-2 vertices ,
- Number of degree-2, leafdegree-1 vertices .
Joint distributions of these counts converge to bivariate normal with explicitly computable covariances (e.g., , ). All higher cumulants are (Hartle et al., 1 Feb 2026, Baldassarri et al., 2021).
6. Extensions and Related Models
The bivariate recursion/generating-function method is applicable to a range of tree growth mechanisms:
- Random Recursive Tree (RRT): The recursion simplifies, and the unique solution is . Degree and leafdegree distributions, as well as protected fractions, can be recovered explicitly; for the protected fraction, .
- Redirection Model: The recursion alters, leading to a more complicated, but tractable, integral representation and varying marginals and fluctuation laws.
- Pólya Trees: The joint profile of (degree-, leaf) vertices at a given level in random Pólya trees of fixed size has a generating function-based (cycle-index) construction. For , both the expectation and covariance of node counts admit asymptotic expansions, with joint normalized processes converging to linearly dependent multiples of Brownian excursion local time (Gittenberger et al., 2011).
7. Connection to Degree-Degree and Assortativity Statistics
The joint degree-leafdegree framework is related, but distinct from, the classical joint degree distribution and neighbor-degree conditional distributions as studied in degree-correlation and assortativity literature (Fotouhi et al., 2013, Samuel et al., 2020). Unlike those, the joint degree-leafdegree distribution directly counts the configuration of local neighborhoods in terms of leaf status, offering refined resolution for studying robustness, percolation, and motif frequency in networks.
In edge-based degree-degree frameworks, the probability that a random edge joins nodes of degrees and is , with marginals and conditionals given by:
The probability to reach a degree- neighbor from a leaf (degree $1$ node) is
a statistic resembling, but not equivalent to, the -marginals of (Samuel et al., 2020).
8. Significance and Applications
The joint degree-leafdegree distribution enables a comprehensive local (and nearly local) statistical characterization of the topology of random trees and related networks. Applications include:
- Quantitative analysis of motif abundance and protected vertices,
- Rigorous fluctuation (central limit) theorems for empirical substructure counts,
- Parameterization and comparison of generative tree models,
- Input to network robustness, percolation, and dynamics studies, especially in the analysis of epidemic thresholds and resilience under failures,
- Refinement beyond traditional degree and edge-based correlation statistics (Hartle et al., 1 Feb 2026, Fotouhi et al., 2013, Samuel et al., 2020).
The methodology provides both explicit formulas and asymptotics, with generating-function machinery that is tractable for both analytic and algorithmic purposes. The approach is general and adaptable to new models, supporting ongoing research into network microstructure and its macroscopic implications.