Fidelity-Aware Feature Modulation (FAFM) in GeoOpt-Net

Updated 6 February 2026

Fidelity-Aware Feature Modulation (FAFM) is a technique that modulates feature vectors through learnable scale and shift parameters to adapt network outputs to high-fidelity quantum chemical conditions.
It integrates seamlessly with a multi-branch SE(3)-equivariant network architecture, combining radial, angular, and dihedral features for precise molecular geometry refinement.
FAFM significantly accelerates quantum-chemical workflows by reducing DFT optimization steps and ensuring DFT-quality geometries from inexpensive force-field conformers.

GeoOpt-Net is a multi-branch SE(3)-equivariant deep learning architecture designed for rapid and accurate refinement of molecular geometries, targeting single-shot prediction of density functional theory (DFT)-quality structures at the B3LYP/TZVP level directly from inexpensive, force-field-generated starting conformers. Implemented as an integrated graph neural operator with fidelity-aware calibration, GeoOpt-Net enables high-throughput, physically consistent geometry preparation for downstream quantum chemical workflows, substantially accelerating the pre-DFT optimization process without compromising on energetic or structural fidelity (Liu et al., 30 Jan 2026).

1. SE(3)-Equivariant Multi-Branch Network Architecture

GeoOpt-Net accepts as input a molecular graph $G = (V, E)$ and an initial coordinate matrix $R_\text{initial}$ , such as those produced by RDKit’s ETKDG+MMFF94 pipeline. Its architecture features three explicit message-passing streams, each encoding different order geometric invariants:

2-body stream: Processes bond lengths $r_{ij}$ .
3-body stream: Encodes angles $\theta_{ijk}$ .
4-body stream: Encodes dihedrals $\phi_{ijkl}$ .

Scalar ( $\ell=0$ ) features are represented using radial basis expansions of distances, while directional ( $\ell\geq1$ ) features leverage real spherical harmonics $Y^{(\ell)}(\hat r_{ij})$ for representation of geometric orientation. These features are combined in each stream by Clebsch–Gordan projections:

$m_{ij}^{(\ell)} = \sum_{\ell_1, \ell_2} \left[ h_i^{(\ell_1)} \otimes Y^{(\ell_2)}(\hat r_{ij}) \right]_{CG} \cdot \phi(r_{ij})$

where “ $\otimes$ ” denotes the tensor product, the subscript $_{CG}$ is Clebsch–Gordan projection, and $\phi(r)$ is a learnable radial filter.

Nonlinearities (e.g., GELU) and LayerNorm are applied strictly to scalar channels, keeping vector channels linearly updated and gated for equivariance. The three streams’ equivariant embeddings are fused via a lightweight Transformer decoder to yield a global latent $F_\theta(G, R_\text{initial}, d)$ . The refined geometry is given by

$R_\text{refined} = R_\text{initial} + F_\theta(G, R_\text{initial}, d)$

guaranteeing that SE(3) actions on the input yield the same transformation on the output.

2. Fidelity-Aware Feature Modulation (FAFM) and Two-Stage Training

GeoOpt-Net incorporates Fidelity-Aware Feature Modulation (FAFM) to inject theory- and basis-set-specific responses. Each hidden feature vector $h$ within the message-passing layers is modulated:

$\tilde h = h \odot (1 + g_d) + b_d$

with $d$ a one-hot domain embedding (e.g., “6-31G(2df,p)” or “TZVP”), and $g_d$ , $b_d$ as learnable scale and shift vectors, respectively. FAFM enables rapid re-calibration to higher-fidelity conditions via:

Stage 1 (Pre-training): Network is trained on $\sim$ 290k molecules from QM9+QM40 at B3LYP/6-31G(2df,p), with $g_d = b_d = 0$ .
Stage 2 (Fine-tuning): Weights are warm-started; FAFM is turned on for “TZVP” (high-fidelity), optimizing $g_d$ , $b_d$ , and output layers on $\sim$ 180k molecules (QMe14S dataset at B3LYP/TZVP). Only these parameters are updated.

This mechanism allows efficient specialization without full retraining, capturing systematic shifts required by a larger basis set while preserving generalizability.

3. Training Protocols and Loss Functions

The loss function is a composite over multiple geometric targets:

$L = L_\text{rmsd} + \lambda_b L_\text{bond} + \lambda_a L_\text{angle} + \lambda_d L_\text{dihedral} + \lambda_r L_\text{bond\_range}$

With definitions:

$L_\text{rmsd}$ : Root-mean-square deviation between predicted and reference coordinates.
$L_\text{bond}, L_\text{angle}, L_\text{dihedral}$ : MSE for bond lengths, angles, dihedrals.
$L_\text{bond\_range}$ : Soft bond-range constraint via softplus penalties outside physically plausible intervals.

Optimization uses AdamW (lr= $10^{-3}$ ), batch size 64, with multistep learning rate decay and gradient clipping. Implementation is in PyTorch + e3nn for equivariant operations, with a custom Transformer decoder.

4. Geometric, Energetic, and Electronic Performance

GeoOpt-Net achieves sub-milli-Å all-atom RMSD for most molecules in the ZINC20 test set ( $N=1000$ ), with log $_{10}$ (RMSD) distribution sharply peaked at $-4$ . Baseline methods (UMA, xTB, Auto3D, RDKit) show broader distributions between 0.1–1 Å. Single-point energy deviations at B3LYP/TZVP are centered near zero for GeoOpt-Net ( $\sigma < 0.05$ kcal/mol), compared to multi-kcal/mol errors for baselines.

Error decomposition:

Metric	GeoOpt-Net	Best Baseline Range
Bonds (Å)	$10^{-4}$	0.01–0.05
Angles (°)	< 0.05	0.5–2
Dihedrals (°)	$\sim$ 0.1	5–30

Dipole moments ( $\mu$ , Debye) at B3LYP/TZVP are preserved (GeoOpt-Net: 3.167 D vs. reference: 3.165 D; baselines deviate by $\sim$ 0.37–0.5 D).

5. DFT Convergence and Workflow Acceleration

GeoOpt-Net’s refined geometries satisfy 40–58% of individual DFT convergence criteria (max force, RMS force, max displacement, RMS displacement) versus $\sim$ 0% for baselines. The “All-YES” convergence rate (satisfying all four) is 65.0% under loose and 33.4% under default criteria; UMA, xTB, Auto3D, and RDKit attain 0%. Using GeoOpt-Net as a pre-DFT guess reduces the average number of DFT geometry optimization steps by $\sim$ 50% (GeoOpt-Net: $\sim$ 14 vs. $\sim$ 30–35 for baselines), leading to wall-clock time speedups of 2×–2.5×. This streamlines quantum-chemical workflows and reduces failure rates and manual intervention.

6. Scalability, Robustness, and Practical Considerations

GeoOpt-Net generalizes robustly to drug-like molecules with up to 20 rotatable bonds and 40 heavy atoms, maintaining $\Delta E < 0.1$ kcal/mol, while baseline errors grow to several kcal/mol. The network’s SE(3) equivariant design ensures geometric and energetic consistency under spatial symmetry operations. The method is implemented for neutral, closed-shell organic molecules; extension to open-shell or transition-metal systems would necessitate further training data. Memory and compute demands scale with the angular cutoff $\ell_{\max}$ and molecule size, which may become nontrivial for $>50$ heavy atoms.

7. Limitations and Extensions

GeoOpt-Net’s current domain is restricted to neutral, closed-shell organic structures. Integration of FAFM for additional quantum chemical fidelities beyond DFT (e.g., MP2, CCSD) would require further extension and calibration of the modulation mechanism. Model and hardware optimizations will be necessary for routine applications to very large biomolecules or inorganic clusters. Nonetheless, the approach transforms molecular geometry refinement preceding DFT from a multi-step bottleneck into a mesh-invariant, one-shot operation (Liu et al., 30 Jan 2026).

Markdown Report Issue Upgrade to Chat

References (1)

A Cross-Domain Graph Learning Protocol for Single-Step Molecular Geometry Refinement (2026)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Fidelity-Aware Feature Modulation (FAFM).

Fidelity-Aware Feature Modulation (FAFM) in GeoOpt-Net

1. SE(3)-Equivariant Multi-Branch Network Architecture

2. Fidelity-Aware Feature Modulation (FAFM) and Two-Stage Training

3. Training Protocols and Loss Functions

4. Geometric, Energetic, and Electronic Performance

5. DFT Convergence and Workflow Acceleration

6. Scalability, Robustness, and Practical Considerations

7. Limitations and Extensions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Fidelity-Aware Feature Modulation (FAFM) in GeoOpt-Net

1. SE(3)-Equivariant Multi-Branch Network Architecture

2. Fidelity-Aware Feature Modulation (FAFM) and Two-Stage Training

3. Training Protocols and Loss Functions

4. Geometric, Energetic, and Electronic Performance

5. DFT Convergence and Workflow Acceleration

6. Scalability, Robustness, and Practical Considerations

7. Limitations and Extensions

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research