Scaled-Attachment Random Recursive Trees
- SARRTs are a class of random recursive tree models defined by nonuniform, random scaling attachment rules that generalize traditional recursive trees.
- They exhibit explicit logarithmic scaling for typical depth and height, with constants derived from renewal theory and large deviation techniques.
- The models extend to continuum limits where rescaling produces real trees, linking discrete structures to objects like Aldous's Brownian CRT.
Scaled-Attachment Random Recursive Trees (SARRTs) are a general class of random recursive tree models in which the location to which each new node attaches is determined by a random scaling process rather than uniform selection. This construction generalizes the classical random recursive tree (RRT) and admits a much broader array of depth and metric behaviors, encompassing nonuniform attachment rules, inhomogeneous tree growth, and rich continuum limits (Devroye et al., 2012, &&&1&&&). Key features include explicit logarithmic scaling constants for depth extremes and convergence to real trees under appropriate rescaling.
1. Discrete Model Definition and Attachment Rule
A SARRT is defined via a sequential growth process on vertices . For , node is attached to a parent node with label given by the rule
where is a sequence of i.i.d.\ random variables with common distribution on . The attachment process is Markovian but modulated by the random scaling inherent in the .
This construction allows for non-uniform attachment: the attachment probability depends on the realization of . The traditional RRT is recovered when , yielding uniform attachment among available parents.
A related combinatorial model employs a parameter . At each discrete time step, a vertex is added by splitting an edge chosen uniformly at random; every steps, a new leaf is appended. This yields a sequence of unlabeled rooted trees with size growing in both vertices and leaves and provides a framework for rigorous scaling limits (Ross et al., 2016).
2. Asymptotic Depth and Height Parameters
The depth of node denotes the distance (in edges) from to the root. Three canonical depth parameters characterize the large regime:
- Typical depth : depth of the last-inserted node.
- Tree height : .
- Minimum depth among youngest half : .
Their asymptotic behaviors are governed by logarithmic laws: for explicit constants , , depending only on the law of . These results hold whenever has a density, ensuring nondegeneracy of the asymptotics (Devroye et al., 2012).
The computation of these constants proceeds as follows. Set and denote its mean and variance by , . Define the log-moment generating function and its Cramér/Laplace transform , then
The constants are then given by
with , when is nondegenerate.
3. Special Case: Uniform Distribution and Explicit Constants
For , one has , and . The Laplace and Cramér transforms simplify to
yielding
The equation is solved by , so and thus the maximal tree height satisfies . Matching lower and upper bounds for height can be obtained without recourse to branching random walks, using Chernoff bounds, union bounds, and renewal-theoretic arguments (Devroye et al., 2012).
4. Scaling Limits and Continuum Real Trees
A continuum limit for SARRTs is established by embedding the discrete tree in a rescaled metric space and passing to the limit in the Gromov–Hausdorff–Prokhorov (GHP) topology. The limiting object is constructed via a line-breaking process on :
- Consider an inhomogeneous Poisson process on with rate , .
- Its jump times determine branch lengths.
- At each step, a new segment of length is attached at a point chosen uniformly with respect to length measure on the current tree.
- The projective limit yields a compact real tree with an intrinsic metric, and a canonical uniform leaf measure is inherited as the weak limit of uniform measures on leaves at each finite stage (Ross et al., 2016).
For , the discrete process coincides with Rémy's algorithm and the continuum limit is Aldous's Brownian Continuum Random Tree (CRT), with Poisson line-breaking rate $2t dt$.
5. Detailed Proof Techniques and Renewal Theories
The depth and height results leverage several probabilistic tools:
- The evolution of labels along the ancestral line of a node can be linearized as .
- The typical depth is then a hitting time for a sum of i.i.d.\ increments, facilitating the use of renewal theory and large deviation techniques.
- Chernoff bounds and union bounds are employed to obtain high-probability upper bounds for the tree height.
- Lower bounds on tree height (and, analogously, minimum depths) are derived via the second-moment (Chung–Erdős) argument and precise tail estimates from Cramér's theorem.
- In the continuum setting, couplings with Beta–Gamma and Dirichlet fragmentations justify the matching of discrete tree skeletons to the limit real tree.
- Coupling arguments with inhomogeneous Pólya urns control the number of discrete steps along arcs of the continuum tree, with sharp moment and maximal subtree size estimates (Devroye et al., 2012, Ross et al., 2016).
6. Generalizations, Special Cases, and Structural Phenomena
SARRTs encompass several natural and deterministic tree structures as special or limiting cases:
- If almost surely, the process yields a deterministic -ary complete tree with height .
- Choosing or , where are independent , interpolates between "greedy" distance trees and uniform random DAGs, with explicit formulas for the scaling exponents.
- Power-of-choice models can be implemented by sampling multiple independent values and attaching to the minimizer, producing nontrivial effects on tree distances.
The choice of generates a spectrum of scaling constants, enabling richer phase-transition behavior for tree height and minimum depths than observed in the classical uniform RRT.
7. Connection to Rémy's Algorithm, CRTs, and Related Models
When in the edge-splitting formulation, the discrete process coincides precisely with Rémy's construction of uniform (leaf-labeled) binary trees. In this case, the scaling limit is the Brownian CRT constructed via Aldous's Poisson line-breaking process. The parameter can be interpreted as controlling the exponent of the attachment process in the continuum limit, with producing the CRT and higher yielding inhomogeneous generalizations (Ross et al., 2016). For , the limiting object is star-like and not generally included within the SARRT framework.
These connections highlight the role of SARRTs as a unifying framework for discrete and continuous models of random tree growth with broad applicability in probability, combinatorics, and statistical physics.
Selected References:
- Devroye, Fawzi, Fraiman, "Depth properties of scaled attachment random recursive trees" (Devroye et al., 2012)
- Ross, Wen, "Scaling limits for some random trees constructed inhomogeneously" (Ross et al., 2016)