Papers
Topics
Authors
Recent
Search
2000 character limit reached

Fourier-MIONet: Neural Operator for GCS

Updated 16 December 2025
  • The paper introduces Fourier-MIONet, a neural surrogate model that embeds Fourier Neural Operator layers within a multi-input framework to map complex inputs to nonlinear PDE solutions.
  • It employs dedicated branch and trunk networks with temporal inputs and U-FNO decoding to efficiently capture high-dimensional parameter-to-solution mappings for CO2 and brine flow.
  • The study shows significant improvements with about 3.7M parameters, 3.5× faster training, and enhanced physical consistency over 30-year simulations compared to traditional models.

Fourier-MIONet is a Fourier-enhanced multiple-input neural operator (MIONet) designed as a neural surrogate model for multiphase flow in porous media, particularly for simulating the evolution of CO2_2 and brine in geological carbon sequestration (GCS). It achieves significant computational efficiency and generalization advantages by embedding the Fourier Neural Operator (FNO) architecture within a multiple-input operator learning framework, enabling rapid, data-efficient, and physically consistent predictions of high-dimensional parameter-to-solution maps governed by nonlinear partial differential equations (PDEs) (Jiang et al., 2023).

1. Mathematical Formulation and Operator Learning Framework

Fourier-MIONet is constructed as a parametric operator learning architecture aimed at mapping multiple spatial and scalar input fields to solution fields of interest, such as CO2_2 saturation and pressure. Given nn input fields a1,a2,…,ana_1, a_2, \dots, a_n over corresponding domains DiD_i, the goal is to approximate the solution operator G\mathcal{G} such that

G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,

where uu is a field over the target domain D′D'. This is achieved through the MIONet structure, where nn branch networks 2_20 encode each input, and a trunk network 2_21 encodes target coordinates 2_22: 2_23 Temporal continuity is enforced by treating time 2_24 as an input to a trunk net, promoting physically consistent evolution across 2_25 for 2_26.

2. Network Architecture and Layer Design

Fourier-MIONet integrates FNO layers for learning global, mesh-free operator representations and U-FNO decoding to enhance expressive capacity:

  • Branch Networks:
    • Field branch: 2D convolutional encoder with three 2_27 convolution layers (20 channels each, ReLU activations, paddings), producing latent tensors of shape (batch, 2_28, 2_29) with nn0 retained Fourier modes.
    • Scalar branch: Four-layer multilayer perceptron (MLP) with widths [10, 20, 20, 20], mapping scalars to (batch, nn1).
  • Trunk Net: Three-layer MLP (widths 1→20→20→20) taking time nn2 and returning (batch, nn3).
  • Fusion Operations:
    • Branch merge: elementwise sum nn4
    • Branch–trunk merge: elementwise product nn5, yielding nn6 for each time step
  • Decoder (U-FNO): Four alternating FNO and U-Fourier layers in 2D, with each U-Fourier layer augmented by a small 2D U-Net. The decoder concludes with a nn7 convolution to map to the output channel.
  • Parameter Count: Approximately nn8 trainable parameters—one order-of-magnitude fewer than U-FNO's nn9.

FNO layers exploit the efficiency of the discrete Fourier transform: for a latent tensor a1,a2,…,ana_1, a_2, \dots, a_n0, modes a1,a2,…,ana_1, a_2, \dots, a_n1 are processed with learned multipliers a1,a2,…,ana_1, a_2, \dots, a_n2 (acting on low-frequency modes), with the spatial representation recovered by inverse transform and pointwise linear-activation additions: a1,a2,…,ana_1, a_2, \dots, a_n3

3. Governing Physical Model and Problem Setting

Fourier-MIONet addresses the nonlinear coupled PDE system describing two-phase (COa1,a2,…,ana_1, a_2, \dots, a_n4/water) flow in porous geologic formations:

  • Mass conservation: For phase a1,a2,…,ana_1, a_2, \dots, a_n5,

a1,a2,…,ana_1, a_2, \dots, a_n6

where a1,a2,…,ana_1, a_2, \dots, a_n7 is porosity, a1,a2,…,ana_1, a_2, \dots, a_n8 is phase saturation.

  • Phase flux (Darcy’s Law):

a1,a2,…,ana_1, a_2, \dots, a_n9

with absolute permeability DiD_i0, relative permeability DiD_i1, viscosity DiD_i2, and other standard parameters.

  • Capillary pressure coupling: DiD_i3.
  • Constitutive relations: DiD_i4 and DiD_i5 follow van Genuchten or Brooks–Corey models.

This formulation supports a wide range of input heterogeneity—including anisotropic and highly variable permeability fields, variable injection configurations, and process parameters—requiring an operator learning approach with strong generalization characteristics.

4. Training Regime, Resource Efficiency, and Performance

The network is trained on simulated datasets comprising 4,500 training instances and 500 test cases, with inputs discretized on a DiD_i6 grid and 24 logarithmically spaced time snapshots (DiD_i7 day to DiD_i8 years):

  • Inputs: 2D spatial fields (DiD_i9), scalar parameters (initial pressure G\mathcal{G}0, temperature G\mathcal{G}1, injection rate G\mathcal{G}2, viscosity ratio G\mathcal{G}3, irreducible water saturation G\mathcal{G}4, perforation depths).
  • Loss Function: G\mathcal{G}5 loss as in U-FNO,

G\mathcal{G}6

balancing fieldwise error and spatial gradient error.

  • Optimization: Adam optimizer (initial learning rate G\mathcal{G}7 with decay). Batch size: G\mathcal{G}8 (cases G\mathcal{G}9 time snapshots) per update.
  • Resource Usage Comparison:
Model Params CPU Mem (GiB) GPU Mem (GiB) Training Time Inference (s/case) Speedup
U-FNO G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,0M G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,1 G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,2 G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,3 h G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,4 1G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,5
Fourier-MIONet G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,6M G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,7 G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,8 G:(a1,a2,...,an)↦u,\mathcal{G}:(a_1, a_2, ..., a_n) \mapsto u,9 h uu0 uu1 (train), uu2 (infer)
  • Test Accuracy: uu3, MAE uu4, matching U-FNO.

5. Generalization, Data Efficiency, and Physical Consistency

Fourier-MIONet explicitly encodes temporal continuity in the operator by treating time uu5 as an input coordinate to the trunk network rather than as a discrete channel. This enables the model to interpolate and extrapolate uu6 smoothly over time, allowing accurate forecasting of the full 30-year COuu7 evolution from as few as 6 (nonuniformly chosen) training snapshots (test uu8, MAE uu9). This is in contrast to architectures relying on fixed time-channel discretization (e.g., U-FNO), which are unable to generalize to intermediate or unseen time points unless many snapshots are observed in training.

The approach has been validated under demanding scenarios, including permeability heterogeneity spanning four orders of magnitude, anisotropy ratios up to 10, and wide ranges of injection rates and capillary scales, consistently maintaining robust performance (Jiang et al., 2023).

6. Advantages, Limitations, and Practical Considerations

Advantages:

  • One order-of-magnitude fewer parameters than U-FNO while maintaining accuracy.
  • D′D'0 reduction in training time, D′D'1 faster inference.
  • Substantially reduced memory footprint: 85% less CPU memory, 64% less GPU memory.
  • Extreme data efficiency, supporting accurate 30-year predictions from only six data snapshots.
  • Intrinsically enforces temporal continuity and physical consistency.

Limitations:

  • The architecture relies on regular spatial grids to facilitate the FFT; extension to unstructured meshes is nontrivial.
  • Fourier convolution’s global nature may impact scalability in extremely large or high-dimensional spatial domains.
  • Hyperparameter selection (e.g., number of Fourier modes, network widths) remains application-specific.

A plausible implication is that, as with FNO-based models generally, pushing Fourier-MIONet toward very large or irregular domains may necessitate new domain decomposition or hybrid mesh strategies.

Fourier-MIONet exemplifies the integration of operator learning (MIONet) and Fourier-domain modeling (FNO), aligning with recent advances in mesh-free, global-receptive-field deep architectures for scientific machine learning. In operator learning, FNO layers enable efficient learning of parametric solution maps for PDE-governed systems, and the multi-input/trunk design allows explicit inclusion of both field and scalar process variables.

The architectural principles of Fourier-MIONet are applicable to a broad class of problems requiring solution of high-dimensional, parameterized PDEs—extending beyond geological carbon sequestration to possible applications in subsurface hydrology, reactive transport, and time-continuous simulation surrogates (Jiang et al., 2023).

Fourier neural operators have also begun to demonstrate potential in physical-layer modeling for complex electromagnetic and communications systems, highlighting the generality of the core convolutional operator-learning paradigm (Xiao et al., 6 Oct 2025). However, Fourier-MIONet's contributions are characterized by its tailored multi-input framework, physical consistency over time, and demonstrated empirical gains in data efficiency and computational resource usage for carbon storage simulation.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Fourier-MIONet.