Hypernetwork-Modulated Neural Quantum States
- The paper demonstrates that hypernetwork modulation in RBMs efficiently reconstructs an entire family of ground states, achieving fidelities of up to 99% across phase diagrams.
- It uses FiLM-style transformations to dynamically generate RBM bias vectors as smooth functions of the Hamiltonian parameter, enabling direct extraction of physical response functions.
- This unified, differentiable framework offers scalable quantum state tomography applicable to various many-body systems, eliminating the need for retraining at each parameter point.
Hypernetwork-modulated neural quantum states integrate neural-network quantum state ansätze with hypernetworks to create compact, differentiable models for quantum many-body wavefunctions conditioned on external control parameters. This approach allows a single model to represent an entire family of ground states for parametrized Hamiltonians, achieving efficient quantum state tomography (QST) across continuous regions of phase diagrams without the need for point-wise retraining. The framework has been validated on paradigmatic quantum models, offering high-fidelity reconstructions and enabling direct extraction of phase transition diagnostics from measurement data (Tonner et al., 28 Jan 2026).
1. Foundations: Neural-Network Quantum State Parametrization
Neural-network quantum states (NQS) employ neural architectures to parameterize pure-state wavefunctions over discrete basis configurations. The Restricted Boltzmann Machine (RBM) defines a wavefunction for spins and binary hidden units as
where and denote visible and hidden biases, respectively, and are the RBM weights. In practical QST applications, the probability is normalized and used to represent stoquastic ground states via .
Classical RBM-based QST is limited to single problem instances: each ground state corresponding to a different Hamiltonian parameter must be reconstructed via independent training. This point-wise strategy leads to inefficiencies when quantum systems are studied over an extended parameter regime, such as during phase transition analysis.
2. Hypernetwork Architecture for Quantum State Families
To overcome this limitation, hypernetwork-modulated RBMs ("HyperRBMs") incorporate a hypernetwork that dynamically generates certain RBM parameters as smooth functions of a control parameter (e.g., the transverse field in the transverse-field Ising model).
Specifically, is a multilayer perceptron (MLP) mapping to feature-wise scale and shift vectors. Only the RBM bias vectors are modulated: with weights shared across all . The expressive power derives from FiLM-style ("feature-wise linear modulation") transformations, keeping the hypernetwork's output dimension low () and scalable. The parameter map is , yielding the conditional family .
3. Training Objectives and Optimization Strategies
The model is trained for parametric QST using projective measurements from multiple values of the Hamiltonian parameter . For each , measurement datasets are collected. The loss function sums negative log-likelihoods across all support fields: Evaluation of this loss requires approximating the partition function's gradients, accomplished via Contrastive Divergence (CD-); or are used for the negative phase. The mix of Gibbs chains used in negative phase sampling includes a small fraction of randomly initialized visible states to improve mixing. Parameters are optimized jointly using Adam with an inverse–sigmoid learning rate schedule (from to ).
Physical symmetries are imposed during training when needed; for example, a spin-flip symmetry in the ferromagnetic regime is enforced by a symmetrized free-energy ansatz:
4. Physical Results: Fidelity, Response Functions, Entanglement
The HyperRBM's performance was established for the transverse-field Ising model on both 1D ( sites) and 2D (, lattices):
- State Fidelity: Overlaps between the reconstructed wavefunction and ground states from Lanczos exact diagonalization (ED) yield fidelities with shots per support point, and with shots, across the phase diagram.
- Fidelity Susceptibility: The model produces a smooth, differentiable family, enabling direct calculation of the fidelity susceptibility
Peaks in accurately identify the critical field without prior input.
- Rényi-2 Entropy Surfaces: For the 1D chain, second Rényi entropy is computed via the swap trick and RBM free energies, matching ED results for subsystems up to , with smooth -dependence and correct asymptotics in ferromagnetic/paramagnetic phases.
5. Advantages, Scalability, and Generalizations
Hypernetwork-modulated NQS provide a unified, differentiable ansatz capable of interpolating smoothly over an entire parameter regime, bypassing the need for retraining at every . Key methodological advantages include:
- Low-dimensional hypernetwork modulation (scaling as ), enabling practical training for sizable systems.
- Direct computation of physical response functions and entanglement diagnostics through gradients, supporting phase transition studies.
- Scalability to larger systems is feasible by replacing ED ground-truth data with QMC or tensor-network samples. The framework naturally generalizes to non-stoquastic models with amplitude + phase parametrizations.
- Extension to autoregressive or convolutional NQS is a potential avenue to enhance sampling efficiency and stability in high-dimensional Hilbert spaces.
6. Implications for Quantum Tomography and Phase Diagram Studies
The development of HyperRBMs establishes that a single neural-network ansatz can reconstruct quantum ground states and their phase transitions throughout the entire phase diagram from tomographic data, in contrast to conventional QST methods limited by exponential scaling and per-point retraining. The differentiability in the control parameter enables a suite of physical analyses—including fidelity susceptibility and entanglement entropy surfaces—within a single, unified framework. This suggests broad applicability to quantum device validation, quantum simulation, and many-body physics, particularly as data modalities and system sizes grow.
7. Outlook and Future Directions
Potential future developments include the use of QMC and tensor-network data for training on larger systems, the application to non-stoquastic and complex-valued quantum states via generalized RBMs, and integration with autoregressive or convolutional architectures to further improve scalability and performance in quantum state tomography and phase diagram diagnostics (Tonner et al., 28 Jan 2026).