Papers
Topics
Authors
Recent
Search
2000 character limit reached

Scaled-Attachment Random Recursive Trees

Updated 14 February 2026
  • SARRTs are a class of random recursive tree models defined by nonuniform, random scaling attachment rules that generalize traditional recursive trees.
  • They exhibit explicit logarithmic scaling for typical depth and height, with constants derived from renewal theory and large deviation techniques.
  • The models extend to continuum limits where rescaling produces real trees, linking discrete structures to objects like Aldous's Brownian CRT.

Scaled-Attachment Random Recursive Trees (SARRTs) are a general class of random recursive tree models in which the location to which each new node attaches is determined by a random scaling process rather than uniform selection. This construction generalizes the classical random recursive tree (RRT) and admits a much broader array of depth and metric behaviors, encompassing nonuniform attachment rules, inhomogeneous tree growth, and rich continuum limits (Devroye et al., 2012, &&&1&&&). Key features include explicit logarithmic scaling constants for depth extremes and convergence to real trees under appropriate rescaling.

1. Discrete Model Definition and Attachment Rule

A SARRT is defined via a sequential growth process on vertices V={0,1,2,,N}V = \{0,1,2,\ldots,N\}. For i=1,,Ni=1,\ldots,N, node ii is attached to a parent node with label given by the rule

Parent(i):=iXi\text{Parent}(i) := \lfloor i X_i \rfloor

where (X0,X1,,XN)(X_0, X_1, \ldots, X_N) is a sequence of i.i.d.\ random variables with common distribution on [0,1)[0,1). The attachment process is Markovian but modulated by the random scaling inherent in the XiX_i.

This construction allows for non-uniform attachment: the attachment probability depends on the realization of XiX_i. The traditional RRT is recovered when XiUnif[0,1)X_i \sim \mathrm{Unif}[0,1), yielding uniform attachment among available parents.

A related combinatorial model employs a parameter N+\ell \in \mathbb{N}^+. At each discrete time step, a vertex is added by splitting an edge chosen uniformly at random; every \ell steps, a new leaf is appended. This yields a sequence of unlabeled rooted trees T()(n)T^{(\ell)}(n) with size growing in both vertices and leaves and provides a framework for rigorous scaling limits (Ross et al., 2016).

2. Asymptotic Depth and Height Parameters

The depth DiD_i of node ii denotes the distance (in edges) from ii to the root. Three canonical depth parameters characterize the large nn regime:

  • Typical depth DnD_n: depth of the last-inserted node.
  • Tree height HnH_n: max1inDi\max_{1 \leq i \leq n} D_i.
  • Minimum depth among youngest half MnM_n: minn/2inDi\min_{n/2 \leq i \leq n} D_i.

Their asymptotic behaviors are governed by logarithmic laws: Dn1μlogn,Hnαmaxlogn,MnαminlognD_n \sim \frac{1}{\mu} \log n, \qquad H_n \sim \alpha_{\mathrm{max}} \log n, \qquad M_n \sim \alpha_{\mathrm{min}} \log n for explicit constants μ\mu, αmax\alpha_{\mathrm{max}}, αmin\alpha_{\mathrm{min}} depending only on the law of X0X_0. These results hold whenever X0X_0 has a density, ensuring nondegeneracy of the asymptotics (Devroye et al., 2012).

The computation of these constants proceeds as follows. Set Y=logX0Y = -\log X_0 and denote its mean and variance by μ=E[Y]\mu = \mathbb{E}[Y], σ2=Var(Y)\sigma^2 = \operatorname{Var}(Y). Define the log-moment generating function Λ(λ)=logE[eλY]\Lambda(\lambda) = \log \mathbb{E}[e^{\lambda Y}] and its Cramér/Laplace transform Λ(z)=supλR{λzΛ(λ)}\Lambda^*(z) = \sup_{\lambda \in \mathbb{R}}\{\lambda z - \Lambda(\lambda)\}, then

Ψ(c)=cΛ(1/c)\Psi(c) = c \Lambda^*(-1/c)

The constants are then given by

αmax=inf{c>1/μ:Ψ(c)>1},αmin=sup{0c<1/μ:Ψ(c)>1}\alpha_{\mathrm{max}} = \inf\{c > 1/\mu : \Psi(c) > 1\}, \qquad \alpha_{\mathrm{min}} = \sup\{0 \leq c < 1/\mu : \Psi(c) > 1\}

with 1/μ<αmax<1/\mu < \alpha_{\mathrm{max}} < \infty, 0αmin<1/μ0 \leq \alpha_{\mathrm{min}} < 1/\mu when X0X_0 is nondegenerate.

3. Special Case: Uniform Distribution and Explicit Constants

For X0Unif[0,1)X_0 \sim \mathrm{Unif}[0,1), one has YExp(1)Y \sim \mathrm{Exp}(1), μ=1\mu=1 and σ2=1\sigma^2=1. The Laplace and Cramér transforms simplify to

Λ(λ)=log(1λ),Λ(z)=z1logz\Lambda(\lambda) = -\log(1-\lambda), \quad \Lambda^*(z) = z - 1 - \log z

yielding

Ψ(c)=1cclogc\Psi(c) = 1 - c - c \log c

The equation Ψ(c)=1\Psi(c)=1 is solved by c=ec=e, so αmax=e\alpha_{\mathrm{max}}=e and thus the maximal tree height satisfies HnelognH_n \sim e\log n. Matching lower and upper bounds for height can be obtained without recourse to branching random walks, using Chernoff bounds, union bounds, and renewal-theoretic arguments (Devroye et al., 2012).

4. Scaling Limits and Continuum Real Trees

A continuum limit for SARRTs is established by embedding the discrete tree in a rescaled metric space and passing to the limit in the Gromov–Hausdorff–Prokhorov (GHP) topology. The limiting object is constructed via a line-breaking process on R+\mathbb{R}^+:

  • Consider an inhomogeneous Poisson process on (0,)(0,\infty) with rate (+1)tdt(\ell+1) t^\ell dt, 1\ell \geq 1.
  • Its jump times C0=0<C1<C2<C_0 = 0 < C_1 < C_2 < \cdots determine branch lengths.
  • At each step, a new segment of length Ck+1CkC_{k+1} - C_k is attached at a point chosen uniformly with respect to length measure on the current tree.
  • The projective limit yields a compact real tree TT with an intrinsic metric, and a canonical uniform leaf measure μ\mu is inherited as the weak limit of uniform measures on leaves at each finite stage (Ross et al., 2016).

For =1\ell=1, the discrete process coincides with Rémy's algorithm and the continuum limit is Aldous's Brownian Continuum Random Tree (CRT), with Poisson line-breaking rate $2t dt$.

5. Detailed Proof Techniques and Renewal Theories

The depth and height results leverage several probabilistic tools:

  • The evolution of labels along the ancestral line of a node can be linearized as logL(n,j)logn(Y0+Y1++Yj1)\log L(n,j) \approx \log n - (Y_0 + Y_1 + \ldots + Y_{j-1}).
  • The typical depth DnD_n is then a hitting time for a sum of i.i.d.\ increments, facilitating the use of renewal theory and large deviation techniques.
  • Chernoff bounds and union bounds are employed to obtain high-probability upper bounds for the tree height.
  • Lower bounds on tree height (and, analogously, minimum depths) are derived via the second-moment (Chung–Erdős) argument and precise tail estimates from Cramér's theorem.
  • In the continuum setting, couplings with Beta–Gamma and Dirichlet fragmentations justify the matching of discrete tree skeletons to the limit real tree.
  • Coupling arguments with inhomogeneous Pólya urns control the number of discrete steps along arcs of the continuum tree, with sharp moment and maximal subtree size estimates (Devroye et al., 2012, Ross et al., 2016).

6. Generalizations, Special Cases, and Structural Phenomena

SARRTs encompass several natural and deterministic tree structures as special or limiting cases:

  • If X0θ(0,1)X_0 \equiv \theta \in (0,1) almost surely, the process yields a deterministic mm-ary complete tree with height (1/μ)logn\approx (1/\mu)\log n.
  • Choosing X0=max{U1,,Uk}X_0 = \max\{U_1, \ldots, U_k\} or X0=min{U1,,Uk}X_0 = \min\{U_1, \ldots, U_k\}, where UiU_i are independent Unif[0,1)\mathrm{Unif}[0,1), interpolates between "greedy" distance trees and uniform random DAGs, with explicit formulas for the scaling exponents.
  • Power-of-choice models can be implemented by sampling multiple independent XX values and attaching to the minimizer, producing nontrivial effects on tree distances.

The choice of X0X_0 generates a spectrum of scaling constants, enabling richer phase-transition behavior for tree height and minimum depths than observed in the classical uniform RRT.

When =1\ell=1 in the edge-splitting formulation, the discrete process coincides precisely with Rémy's construction of uniform (leaf-labeled) binary trees. In this case, the scaling limit is the Brownian CRT constructed via Aldous's Poisson line-breaking process. The parameter \ell can be interpreted as controlling the exponent of the attachment process in the continuum limit, with =1\ell=1 producing the CRT and higher \ell yielding inhomogeneous generalizations (Ross et al., 2016). For =0\ell = 0, the limiting object is star-like and not generally included within the SARRT framework.

These connections highlight the role of SARRTs as a unifying framework for discrete and continuous models of random tree growth with broad applicability in probability, combinatorics, and statistical physics.


Selected References:

  • Devroye, Fawzi, Fraiman, "Depth properties of scaled attachment random recursive trees" (Devroye et al., 2012)
  • Ross, Wen, "Scaling limits for some random trees constructed inhomogeneously" (Ross et al., 2016)
Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Scaled-Attachment Random Recursive Trees (SARRTs).