MSTAR AR-CHAIN Integrator
- MSTAR AR-CHAIN integrator is a high-precision, massively parallel algorithm for gravitational N-body problems, achieving machine-precision accuracy in close encounters.
- It utilizes a minimum spanning tree-based coordinate construction to significantly reduce numerical errors, outperforming classical chain methods by an order of magnitude.
- The integrator combines Gragg–Bulirsch–Stoer extrapolation with two-level parallelisation, enabling efficient simulations of galactic nuclei and SMBH binaries with high particle counts.
The MSTAR AR-CHAIN integrator is a high-precision, massively parallel algorithm for integrating gravitational -body systems, specifically engineered to achieve machine-precision accuracy () in configurations dominated by close encounters. Building on the algorithmic regularisation (AR) framework that underlies the AR-CHAIN method, MSTAR introduces a combinatorial coordinate construction based on minimum spanning trees (MST), together with advanced extrapolation schemes and two-level parallelisation that together circumvent computational bottlenecks of earlier approaches. The method enables direct summation integrations of regularised subsystems at a scale () and with a level of accuracy previously unattainable in galactic dynamics and stellar cluster modelling, facilitating next-generation simulations of galaxy nuclei and supermassive black hole (SMBH) binaries (Rantala et al., 2020).
1. Algorithmic Regularisation: Hamiltonian Structure and Time Transformation
MSTAR, like AR-CHAIN, operates by restructuring the classical Newtonian -body Hamiltonian
into a regularised form by extending phase space. Physical time becomes a canonical coordinate with conjugate momentum , and a fictitious regularisation time is introduced via the transformation
For both MSTAR and AR-CHAIN, the logarithmic Hamiltonian prescription yields , eliminating the singular behaviour of as pairs approach. The transformed Hamiltonian becomes
permitting a splitting into drift (momentum only) and kick (position only) operators, on which a second-order leapfrog in is exact for Keplerian orbits. As , ensures that "physical time stalls at collisions," rendering the equations of motion manifestly regular and free of force divergences (Rantala et al., 2020, Wang et al., 2021).
2. MST-Based Coordinate Construction and Stability Advantages
Unlike AR-CHAIN, which employs a single, sequential chain of coordinate differences between adjacent particles, MSTAR constructs a minimum spanning tree on the particle positions, selecting edges that connect all particles with minimal total length via (e.g.) Prim's algorithm. Each edge is directed from a child to a unique parent , with MST-chain coordinates defined as
The original coordinates are recovered by path summation to a root (com, or arbitrary particle). For evaluating vector differences , if both nodes are "close" in the tree topology (within edges via the lowest common ancestor), their MST-path is used directly; otherwise, direct coordinates are referenced.
Key advantages of the MST decomposition relative to chain coordinates include:
- Average depth , compared to chain length ; this considerably reduces floating-point accumulation errors and the arithmetic required for coordinate transforms.
- In practice, the energy drift induced by coordinate roundtrips is an order of magnitude smaller () than in the classical chain.
- The MST structure generalizes to arbitrary without pathological dependence on initial ordering or hierarchical clustering (Rantala et al., 2020).
3. Gragg–Bulirsch–Stoer Extrapolation and Adaptive Integration
MSTAR employs the Gragg–Bulirsch–Stoer (GBS) extrapolation method atop the regularised -time leapfrog integration, achieving rapid convergence toward the zero step-size solution. For a global timestep , a sequence of substep counts (Deuflhard sequence ) is chosen, with physical step , leading to a composition: where and denote drift and kick phases. Extrapolation is realized through the Neville-Aitken triangular scheme, with error estimates as
and convergence accepted when
Step size is then adapted for the next interval according to
with . As the underlying leapfrog is symmetric, the extrapolated local error acquires scaling, promoting extremely high conservation of energy, angular momentum, and Laplace–Runge–Lenz vector, e.g., , radians (Rantala et al., 2020).
4. Parallelisation Architecture and Scalability
MSTAR integrates force calculation and extrapolation parallelism using the Message Passing Interface (MPI):
- Force-loop parallelism: pairwise force computations are divided among MPI processes. Linear strong scaling is observed up to , above which communication overheads dominate.
- GBS subdivision parallelism: The substeps for each GBS order are partitioned among groups. Each group integrates its assigned substeps independently prior to global extrapolation.
- Combined efficiency: The optimal product is,
which delivers near-ideal scaling; the law arises from maximizing overall speed-up before communication and synchronisation penalties set in (Rantala et al., 2020).
5. Performance, Benchmarks, and Comparison with AR-CHAIN
| Metric | MSTAR | AR-CHAIN |
|---|---|---|
| Energy conservation | ||
| run | Weeks on 400 CPUs (1 Gyr) | Many months |
| Serial speed | $2$– faster | — |
| Parallel speedup | $55$– (MSTAR vs. serial); up to (AR-CHAIN baseline) | Saturates at CPUs, then declines |
| Scalability | Ideal to CPUs, saturates by | — |
Serial MSTAR outperforms AR-CHAIN by $2$– due to improved cache and simpler GBS logic. Parallel MSTAR gains $55$– over its serial case and, crucially, up to over AR-CHAIN for –. Scalability is robust to CPUs, compared to AR-CHAIN's sub- plateau, making MSTAR suitable for simulations with unprecedented particle counts and precision (Rantala et al., 2020).
6. API-Level Coupling with Tree and Fast Multipole Integrators
MSTAR is intended as an embedded subsystem integrator within hybrid -body codes using tree or fast multipole solvers (e.g., GADGET-3). In this role:
- Small, high-density "collisional" regions (typically near SMBHs) containing – particles are passed to MSTAR for regularised integration.
- Positions and velocities are returned to the global solver at synchronised intervals.
- MSTAR’s capacity to integrate without softening allows galactic simulations with – stars and collisional subsystems to be computed self-consistently, a key advance for capturing phenomena such as SMBH binary evolution and LISA-band gravitational wave source populations (Rantala et al., 2020).
7. Further Context: Comparison and Evolution from AR-CHAIN
The AR-CHAIN class of integrators, originally formalized by Mikkola & Merritt and subsequently extended by toolkit implementations such as SpaceHub, regularises close encounters using a linear chain of coordinate differences and combines algorithmic regularisation with compensational floating-point arithmetic and high-order symplectic steps (Wang et al., 2021). MSTAR’s MST-based coordinate system extends AR-CHAIN by reducing coordinate construction error over the chain, removing linear-order constraints, and supporting two-level parallelism required for modern high-performance computing environments.
A plausible implication is that, for future multi-physics simulations requiring accurate collisional dynamics in multi-million particle systems, MSTAR not only removes the AR-CHAIN bottleneck but defines a new scaling regime for regularised -body computation. Energy, angular momentum, and LRL invariants track reference orbits at or near machine precision across long timescales, making the technique foundational for next-generation galactic dynamics, nuclear star clusters, and gravitational wave source studies (Rantala et al., 2020, Wang et al., 2021).