Papers
Topics
Authors
Recent
Search
2000 character limit reached

Multi-Scale SIREN-PINN Architecture

Updated 20 January 2026
  • The paper demonstrates that Multi-Scale SIREN-PINN uses dual SIREN branches with frequency-scaled sinusoidal activations to capture both macro- and micro-scale dynamics.
  • It integrates physics-informed loss functions and coupled geometry-state learning to mitigate spectral bias and resolve high-frequency oscillations.
  • The architecture achieves robust convergence and lower error metrics, outperforming conventional PINNs in applications like heat transfer and reaction-diffusion systems.

The Multi-Scale SIREN-PINN architecture is a neural PDE solver that combines sinusoidal representation networks and physics-informed loss functions to accurately model multiscale spatiotemporal dynamics, particularly on complex manifolds or in media with highly oscillatory coefficients. Distinct from conventional PINN frameworks employing smooth activations (e.g., Tanh, ReLU), Multi-Scale SIREN-PINN leverages frequency-diverse sinusoidal activations and network initialization to resolve both macro- and micro-scale physics, and introduces coupled geometry-state learning. It demonstrates utility in domains from multiscale heat transfer (Hintermüller et al., 2023) to stochastic reaction-diffusion in curved geometries (Chrisnanto et al., 13 Jan 2026).

1. Architectural Principles and Network Design

Multi-Scale SIREN-PINN uses dual fully-connected SIREN branches, each leveraging periodic activations h(l)=sin(ω0W(l)h(l1)+b(l))h^{(l)} = \sin(\omega_0 W^{(l)} h^{(l-1)} + b^{(l)}) with frequency scaling factors ω0\omega_0 specific to each layer and branch (Chrisnanto et al., 13 Jan 2026). The architecture typically comprises:

  • Geometry Branch (Ψϕ\Psi_\phi): 5-layer SIREN mapping x=(x,y)x = (x, y) to curvature field κ(x)\kappa(x) with ω0=30\omega_0 = 30 in the first layer, followed by lower-frequency layers.
  • State Branch (Φθ\Phi_\theta): 8-layer SIREN mapping (x,y,t)(x, y, t) to complex system state A(x,t)CA(x, t) \in \mathbb{C}, initialized at ω0=10\omega_0 = 10 for the input layer.
  • Initialization follows Sitzmann et al. (2020): weights W(l)W^{(l)} and biases b(l)b^{(l)} are drawn from Uniform(6/nl1/ω0,+6/nl1/ω0)\text{Uniform}\left(-\sqrt{6/n^{l-1}}/\omega_0, +\sqrt{6/n^{l-1}}/\omega_0\right) and Uniform(π/ω0,+π/ω0)\text{Uniform}(-\pi/\omega_0, +\pi/\omega_0) respectively.

This configuration is designed to mitigate spectral bias and enable representation of both low-frequency envelopes and high-frequency defect structures.

2. Multiscale Physical Modeling via PDE-Constrained Loss

The surrogate model approximates physical states and latent geometry. In the context of a curvature-perturbed Ginzburg-Landau equation, the physics residual for the PINN is

Fθ,ϕ(x,t):=tAθμAθ(1+ib)[D0(1+ακϕ(x))Aθ]+(1+ic)Aθ2AθF_{\theta,\phi}(x, t) := \partial_t A_\theta - \mu A_\theta - (1 + ib) \nabla \cdot [D_0(1 + \alpha \kappa_\phi(x)) \nabla A_\theta] + (1 + ic) |A_\theta|^2 A_\theta

where Aθ(x,t)A_\theta(x, t) is the predicted amplitude field and κϕ(x)\kappa_\phi(x) is the learned curvature. Analogously, in heat-transfer settings, the fine-scale and coarse-scale problems are formulated as (Hintermüller et al., 2023):

  • Fine-scale PDE: Aεuε=fεA^\varepsilon u^\varepsilon = f^\varepsilon in HH, with Aε[u]=(Kε(x)u)A^\varepsilon[u] = -\nabla \cdot (K^\varepsilon(x) \nabla u) and Kε(x)K^\varepsilon(x) rapidly oscillating.
  • Coarse-scale PDE: Bilinear form bL[u](y,v)=L[u]y,vY,Yb_{L[u]}(y, v) = \langle L[u] y, v \rangle_{Y^*, Y}, with L[u]L[u] defining upscaled physics.

3. Loss Definitions and Multi-Objective Training

Multi-Scale SIREN-PINN employs a multi-objective loss integrating data fidelity, PDE residual, boundary conditions, spectral structure, and geometry regularization (Chrisnanto et al., 13 Jan 2026):

  • Data loss: LdataL_{\text{data}} over sparse observations.
  • PDE loss: LPDEL_{\text{PDE}} on collocation points sampling the domain.
  • Boundary loss: LBCL_{\text{BC}} enforces periodicity or Dirichlet constraints.
  • Spectral loss: LSpecL_{\text{Spec}} aligns predicted and empirical Fourier spectra.
  • Curvature regularization: LRegL_{\text{Reg}} via total variation of κϕ(x)\kappa_\phi(x).

The total loss is:

Ltotal=λdataLdata+λPDELPDE+λBCLBC+λSLSpec+λRLRegL_{\text{total}} = \lambda_{\text{data}} L_{\text{data}} + \lambda_{\text{PDE}} L_{\text{PDE}} + \lambda_{\text{BC}} L_{\text{BC}} + \lambda_S L_{\text{Spec}} + \lambda_R L_{\text{Reg}}

Weights λi\lambda_i are adaptively balanced using the gradient-norm ratio method, and coarse-scale regularization terms are used to accelerate convergence and act as preconditioners for low-frequency solution components (Hintermüller et al., 2023).

4. Training Protocols and Computational Workflow

Training begins with a warm-up phase (epochs $1$–$500$) where physics loss is omitted (λPDE=0\lambda_{\text{PDE}} = 0), followed by a coupled physics phase ($501$–$3000$ epochs) where all objectives are activated and balanced adaptively (Chrisnanto et al., 13 Jan 2026). Optimization is performed using Adam during exploration and L-BFGS for convergence refinement. Collocation strategies employ up to 100,000 spatiotemporal samples per epoch, with boundary and regularization points drawn randomly.

In hybrid PDE-constrained settings (Hintermüller et al., 2023), the workflow alternates between adjoint-based gradient computation (leveraging automatic differentiation for neural objectives and sparse matrix solves for finite element state and adjoint systems) and neural parameter updates. Periodic re-weighting of loss terms ensures stability through training.

5. Spectral Phase Transitions and Optimization Dynamics

A defining feature is the observed Spectral Phase Transition, typically occurring near epoch 2,100\sim 2,100 (Chrisnanto et al., 13 Jan 2026). Prior to transition, loss stagnates at 100\sim 10^0; during transition, cooperative minimization of physics and geometry losses drives LtotalL_{\text{total}} down four orders of magnitude (10010510^0 \rightarrow 10^{-5}), signifying alignment of high-frequency modes and Pareto-optimal convergence. Beyond transition, minor oscillatory behavior is present as spectral alignment is refined.

In multiscale heat transfer, inclusion of coarse-scale regularization prevents overfitting to coarse gaps and improves the convergence rate and solution fidelity, as measured by residual norms and L2L^2 errors (Hintermüller et al., 2023).

6. Quantitative Performance and Ablation Results

Performance metrics include (Chrisnanto et al., 13 Jan 2026):

  • Relative L2L_2 error: εL2=AθAGT2/AGT2\varepsilon_{L_2} = \|A_\theta - A_{GT}\|_2 / \|A_{GT}\|_2.
  • Defect-count error: ΔNdefects=NpredNGT\Delta N_{\text{defects}} = |N_{\text{pred}} - N_{\text{GT}}|.
  • Curvature reconstruction: εκ\varepsilon_\kappa, Pearson correlation ρ\rho.

In chemical turbulence experiments,

Architecture εL2\varepsilon_{L_2} ΔNdefects\Delta N_{\text{defects}} ρ\rho
Multi-Scale SIREN-PINN 1.92×1021.92 \times 10^{-2} <1< 1 $0.965$
ReLU-PINN 4.12×1014.12 \times 10^{-1} $24.5$
Tanh-PINN 1.56×1011.56 \times 10^{-1} $12.2$
Fourier-PINN 8.45×1028.45 \times 10^{-2} $5.8$
Single-Scale SIREN-PINN 9.15×1029.15 \times 10^{-2} $0.82$

In multiscale heat transfer (Hintermüller et al., 2023), hybrid PINN methods lower L2L^2 errors to O(102)O(10^{-2})O(103)O(10^{-3}) versus stalled convergence (O(101)O(10^{-1})O(100)O(10^{0})) in non-hybrid baselines. The regularization term preconditions low-frequency solution components and leads to sharper, steadier convergence.

7. Applications and Broader Implications

Multi-Scale SIREN-PINN enables direct resolution of spatiotemporal chaos in reaction-diffusion systems, reconstruction of hidden curvature fields from wave dynamics, and accurate solution of ill-posed inverse pinning problems on latent manifolds (Chrisnanto et al., 13 Jan 2026). In heat transfer, it renders efficient upscaling in media with rapidly varying properties, leveraging hybrid neural/finite-element solvers (Hintermüller et al., 2023). The framework facilitates mesh-free, data-driven discovery of physical and geometric system properties, and supports engineering of passive control strategies via geometry-informed catalytic design.

This methodology suggests a paradigm shift for multiscale modeling, allowing for the synthesis of emergent physics and complex geometry, and improved solver convergence properties in the presence of multiple scales and high-frequency features.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-Scale SIREN-PINN Architecture.