Papers
Topics
Authors
Recent
Search
2000 character limit reached

Robust 3DGS: Delayed Gaussian Growth

Updated 5 December 2025
  • Delayed Gaussian Growth Strategy is a technique for robust 3D reconstruction that postpones splitting or cloning Gaussians until a reliable static scene representation is achieved.
  • It employs a principled densification schedule, using an explicit delay parameter and gradient-based criteria to target only consistently static regions.
  • Empirical results demonstrate significant improvements in PSNR and SSIM by mitigating overfitting to transient artifacts and illumination variations.

The delayed Gaussian growth strategy is a core component of RobustSplat++, a system for robust 3D Gaussian Splatting (3DGS) in challenging, in-the-wild visual scenarios characterized by transient objects and illumination variations. This approach directly addresses failure modes observed in baseline 3DGS when densification—via splitting or cloning Gaussians—is not sufficiently constrained. It introduces a principled schedule: deferring all densification until a static scene representation has been established, and subsequently refining only regions consistently identified as static. Empirical analysis across diverse datasets demonstrates that this strategy substantially mitigates overfitting to transients and illumination artifacts, yielding more stable and accurate reconstructions (Fu et al., 4 Dec 2025).

1. Motivation and Problem Setting

Standard 3DGS pipelines enable densification early in training (e.g., after 500 iterations, then every 10,000 iterations). While critical for modeling fine geometric detail, premature densification during training on real-world data exacerbates two issues: (a) spawning new Gaussians tracks transient artifacts (moving objects, shadows, highlights), as the photometric loss is dominated by outliers before scene geometry stabilizes; (b) the model overfits these artifacts, producing "floater" artifacts and temporal/photometric instability. The delayed Gaussian growth schedule addresses this by ensuring that initial optimization exclusively recovers static scene structure and appearance, precluding the possibility of adapting model capacity to non-static effects early on.

2. Algorithmic Structure and Training Regime

Delayed Gaussian growth modifies the densification schedule by introducing an explicit growth delay parameter istarti_\text{start} (e.g., istart=10, ⁣000i_\text{start}=10,\!000 iterations). No splitting or cloning of Gaussians is permitted until this threshold. Afterward, densification is triggered regularly with interval Δ\Delta (e.g., every 10,000 iterations), conditioned on additional criteria:

  • For each Gaussian gjg_j, compute its accumulated position gradient over static-weighted rays:

Δpj:=rays rMt(r)Lrgb/pj(r)2\Delta p_j := \sum_{\text{rays } r} M_t(r) \cdot \Vert \partial L_\text{rgb}/\partial p_j(r) \Vert_2

where Mt(r)M_t(r) is the mask value denoting static region confidence.

  • If Δpj>τpos\Delta p_j > \tau_\text{pos} (positional-gradient threshold, typically $0.1$–$0.2$), split or clone gjg_j.
  • Optionally prune Gaussians with low opacity post-splitting.

This entire schedule is encapsulated by the indicator function

D(i)={1if iistart(iistart)modΔ=0 0otherwiseD(i) = \begin{cases} 1 & \text{if } i \geq i_\text{start} \land (i-i_\text{start}) \bmod \Delta = 0 \ 0 & \text{otherwise} \end{cases}

ensuring densification only when D(i)=1D(i)=1.

3. Mathematical Formulation

In 3DGS, each Gaussian gig_i is parameterized by center pip_i, covariance Σi\Sigma_i, opacity αi\alpha_i, and spherical harmonics coefficients cic_i. Rendering employs ordered, compositional splatting across projected 2D Gaussians:

c(p)=i=1NciαiGi2D(p)j=1i1[1αjGj2D(p)]c(p) = \sum_{i=1}^N c_i \alpha_i G_i^{2D}(p) \prod_{j=1}^{i-1} [1-\alpha_j G_j^{2D}(p)]

with a reconstruction loss

Lrgb=(1λ)CCgt1+λD-SSIM(C,Cgt)L_\text{rgb} = (1-\lambda) \cdot \Vert C - C_\text{gt} \Vert_1 + \lambda \cdot D\text{-SSIM}(C, C_\text{gt})

RobustSplat++ modifies the densification trigger mechanism to sum gradients only over rays with Mt(r)1M_t(r) \approx 1. The mask MtM_t is predicted by an MLP over DINOv2 features, and heavy static regularization is applied to MtM_t before istarti_\text{start} so Mt1M_t \to 1 everywhere in early iterations.

4. Interaction with Static-Scene Optimization

For the first istarti_\text{start} iterations, the model is restricted to the initial, sparse set of Gaussians recovered from SfM. During this phase:

  • The network recovers low-frequency structure in geometry.
  • Appearance embeddings (2D/3D) for illumination effects are learned in a smoothly varying, noise-reduced context.
  • Mask-MLP is trained with strong regularization, encouraging masks to predict all regions as static.

Since densification is completely inhibited, no model capacity is allocated to transient or ill-posed regions. After static scene geometry is locked in, the densification process begins but only operates in spatial regions and directions strongly supported by multi-view consistency, as encoded in the learned mask.

5. Empirical Evaluations

Quantitative and qualitative analyses in (Fu et al., 4 Dec 2025) provide evidence for the efficacy of delayed Gaussian growth:

  • PSNR and SSIM degrade rapidly for standard 3DGS when densification fits moving people and shadows (cf. Fig. 2), while models with delayed growth maintain higher, more stable metrics.
  • Ablations varying istarti_\text{start} (Fig. 4) show that later densification yields greater PSNR stability, both with and without mask supervision.
  • On "NeRF On-the-go" benchmarks (Table III), adding delayed growth ("+DG") improves low-occlusion PSNR from 20.2820.9220.28 \to 20.92 and SSIM from 0.6960.7110.696 \to 0.711. The full model (mask+bootstrapping+DG) materially advances results ($21.08$, $0.719$).
  • On "NeRF-OSR" (Table VI), removing delayed growth causes marked PSNR drops (from 19.1318.2219.13 \to 18.22) and SSIM (0.7070.6870.707 \to 0.687).
  • Qualitative results document clearer reconstructions of static structure, elimination of floaters, and suppression of transient artifacts.

6. Implementation Guidance and Hyperparameters

Key integration procedures and recommended values are as follows:

  • Use existing 3DGS codebase (e.g., graphdeco's official).
  • Set maximum iterations Itotal30, ⁣000I_\text{total} \approx 30,\!000.
  • Growth delay istart=10, ⁣000i_\text{start} = 10,\!000; adjust to 12, ⁣00015, ⁣00012,\!000-15,\!000 for noisy/large datasets, 8, ⁣0008,\!000 for clean ones.
  • Positional threshold τpos\tau_\text{pos}: default 3DGS, adjust ±1020%\pm10{-}20\% as needed.
  • Mask bootstrapping: low-res (224×224) DINOv2 until istarti_\text{start}, high-res (504×504) afterward.
  • Mask-MLP: two linear layers, Adam lr 10310^{-3}, mask loss weights λres=λcos=0.5\lambda_\text{res} = \lambda_\text{cos} = 0.5, λreg=2.0\lambda_\text{reg} = 2.0, warm-up βreg=2000\beta_\text{reg} = 2000.
  • Appearance-MLP: three linear layers, Adam lr 5×1045 \times 10^{-4}, 2D embedding $48$, 3D embedding $30$.
  • DINOv2 ViT-S/14 provides mask-MLP input features.

Adopting delayed Gaussian growth requires minimal code changes—typically O(1)O(1) additional lines in the training loop—and is compatible with either static or appearance-augmented 3DGS pipelines.

7. Significance and Implications

The delayed Gaussian growth strategy provides a generalizable approach for improving robustness in 3DGS-derived scene representations subject to dynamic real-world phenomena. By structurally preventing early overfitting to transients and local photometric outliers, it enables higher-fidelity reconstructions and more accurate geometry, especially in unconstrained environments. This suggests a broader principle: for geometric MLP-based models where dynamic and illumination confounders are present, initial static-structure optimization followed by capacity expansion prevents deleterious overadaptation and improves final model utility (Fu et al., 4 Dec 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Delayed Gaussian Growth Strategy.