Papers
Topics
Authors
Recent
Search
2000 character limit reached

Metriplector: From Field Theory to Neural Architecture

Published 31 Mar 2026 in cs.AI and cs.LG | (2603.29496v1)

Abstract: We present Metriplector, a neural architecture primitive in which the input configures an abstract physical system -- fields, sources, and operators -- and the dynamics of that system is the computation. Multiple fields evolve via coupled metriplectic dynamics, and the stress-energy tensor $T{μν}$, derived from Noether's theorem, provides the readout. The metriplectic formulation admits a natural spectrum of instantiations: the dissipative branch alone yields a screened Poisson equation solved exactly via conjugate gradient; activating the full structure -- including the antisymmetric Poisson bracket -- gives field dynamics for image recognition and language modeling. We evaluate Metriplector across four domains, each using a task-specific architecture built from this shared primitive with progressively richer physics: F1=1.0 on maze pathfinding, generalizing from 15x15 training grids to unseen 39x39 grids; 97.2% exact Sudoku solve rate with zero structural injection; 81.03% on CIFAR-100 with 2.26M parameters; and 1.182 bits/byte on language modeling with 3.6x fewer training tokens than a GPT baseline.

Authors (2)

Summary

  • The paper introduces Metriplector, which models neural computation as field dynamics governed by metriplectic equations, unifying vision, reasoning, and language tasks.
  • The methodology integrates physics-inspired operators, including stress-energy tensor readouts via Noether's theorem and a learned Poisson bracket, to achieve competitive accuracy with fewer parameters.
  • Empirical results demonstrate robust generalization across domains such as maze solving, Sudoku, CIFAR-100, and language modeling, outperforming conventional architectures in efficiency and accuracy.

Metriplector: Physics-Grounded Neural Architecture via Metriplectic Dynamics

Overview

The paper "Metriplector: From Field Theory to Neural Architecture" (2603.29496) introduces Metriplector, an architecture primitive that instantiates computations as dynamics over abstract physical fields governed by the metriplectic (GENERIC) formalism. The key paradigm is that input configures an abstract physical system—fields, sources, operators—and the evolution of this system, defined by metriplectic dynamics, serves as the central computation. Readout is achieved via the stress-energy tensor TμνT^{\mu\nu}, a conserved quantity derived from Noether's theorem.

Metriplector is deployed across multiple domains—maze pathfinding, Sudoku constraint satisfaction, CIFAR-100 image recognition, and causal language modeling—with each task utilizing only a subset of the metriplectic spectrum (diffusive, Hamiltonian, or full dynamics). The architecture achieves strong parameter efficiency, competitive accuracy, and robust structural generalization across tasks, notably solving Sudoku with zero structural injection and outperforming conventional architectures at similar parameter counts for CIFAR-100.

Architectural Foundations

Metriplector is built on variational mechanics and the GENERIC framework. In classical physics, the metriplectic equation unifies reversible (Hamiltonian) and irreversible (dissipative) dynamics:

z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)

where LL is a skew-symmetric Poisson bracket (energy conserving), MM is a symmetric positive semi-definite metric bracket (entropy producing), EE is the energy functional, and SS is the entropy functional. Crucially, the degeneracy conditions (M⋅∇E=0M \cdot \nabla E = 0, L⋅∇S=0L \cdot \nabla S = 0) enforce that Hamiltonian dynamics cannot produce entropy and dissipation cannot alter energy.

This formalism is adapted to neural computation: input embeddings configure a physics operator (via learned projections), fields evolve according to the metriplectic equation, and features are extracted via Noether's conserved currents. Figure 1

Figure 1: Visualization of Metriplector field interaction, depicting field evolution, gradient energies, cross-field correlations, and vorticity computation for readout.

Domain-Specific Instantiations

Dissipative Branch: Screened Poisson

For spatial reasoning (maze, Sudoku), Metriplector reduces to solving a screened Poisson equation on a graph, representing equilibrium of the dissipative dynamics:

(LW+Λ)ϕ=b(L_W + \Lambda) \phi = \mathbf{b}

Here, LWL_W is a learned Laplacian with conductances z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)0, z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)1 is per-node damping, and z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)2 is a source vector. The system is solved via conjugate gradient, with efficient implicit differentiation for gradients.

Full Metriplectic Dynamics

For recognition tasks such as CIFAR-100, both dissipative and Hamiltonian components are activated, requiring Euler integration over z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)3 field channels. The antisymmetric Poisson bracket z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)4, learned as a z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)5 matrix, enables cross-field advection and energy-conserving feature mixing—essential for fine-grained class discrimination.

Causal Branch

For autoregressive language modeling, the dissipative branch is instantiated on a causal chain, reducing to a parallelizable z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)6 recurrence (Blelloch scan). Multigrid and cross-field interactions are included to capture multi-scale context and inter-field dependencies.

Readout via Physical Conserved Quantities

A central architectural principle is the use of the stress-energy tensor z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)7 for feature extraction. Derived from spatial translation invariance in Noether's theorem, z˙=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)8 computes:

  • Gradient energy: zË™=L(z)⋅∇E(z)+M(z)⋅∇S(z)\dot{z} = L(z) \cdot \nabla E(z) + M(z) \cdot \nabla S(z)9 (diagonal and cross-field correlations)
  • Vorticity: LL0 (rotational structure)

This physics-derived readout outperforms heuristic alternatives on CIFAR-100 by up to 2.6 accuracy points, and the same feature is used for Sudoku constraint satisfaction, demonstrating the unification of recognition and reasoning via conserved physical quantities.

Empirical Results and Diagnostics

Parameter Efficiency and Scalability

For CIFAR-100 classification, Metriplector achieves 81.03% accuracy with only 2.26M parameters, 10–15LL1 fewer than DenseNet-BC (82.8% at 25.6M). The LL2-field bottleneck allows tractable pairwise physics-based interactions (LL3 features), permitting deep evolution and full cross-field coupling while maintaining efficiency. Figure 2

Figure 2: CIFAR-100 accuracy vs. parameter count; Metriplector achieves competitive performance with orders of magnitude fewer parameters compared to conventional architectures.

Ablation and Structural Guarantees

Critical architectural components are validated by controlled ablations: removing the Poisson bracket LL4 costs 13.4 points of accuracy, and violating the operator-from-input principle (decoupling LL5 from physics operators) results in field collapse and a 14.3-point drop. Figure 3

Figure 3: Ablation impacts on CIFAR-100; operator-from-input and Poisson bracket are essential for performance, confirming the necessity of full metriplectic structure.

Physics Diagnostics

The learned Poisson tensor LL6 develops degenerate singular values (signature of skew-symmetry), Frobenius norm and effective rank grow with layer depth, indicating stronger cross-field coupling in later layers. Field magnitudes and spatial variance spike at the final layer, concentrating discriminative structure for optimal Noether readout. Figure 4

Figure 4: Physics diagnostics across 12 CIFAR-100 layers; skew-symmetric structure, increasing cross-field coupling, and spatial localization in late layers.

Field Specialization

Metriplector fields remain moderate through early layers, with sharp magnitude and variance increase immediately before readout, demonstrating controlled energy concentration and feature localization. Figure 5

Figure 5: Specialized field behavior across layers; discriminative channels activate strongly only before readout.

Dynamics Budget

Hidden state evolution shows monotonic growth, with learned step sizes increasing in deeper layers, evidencing self-organized expansion of dynamics budget as depth increases. Figure 6

Figure 6: Dynamics budget analysis; controlled growth in representation and field magnitude across layers.

Cross-Domain Structural Generalization

Metriplector achieves:

  • Maze: F1 = 1.0 on 15LL715; F1 = 0.95–0.99 on 39LL839, showing robust size transfer
  • Sudoku: 97.2% exact solve rate with zero injection; object layer discovers all 3LL93 boxes without explicit constraints
  • Language Modeling: 1.182 BPB (bits per byte), outperforming GPT baseline with 3.6MM0 fewer training tokens Figure 7

    Figure 7: Language modeling BPB progression; Metriplector exhibits superior sample efficiency relative to GPT.

Implications and Future Directions

Metriplector demonstrates that encoding exact physical structure in architecture primitives enables parameter-efficient learning, robust structural generalization, and principled feature extraction via conserved currents. This constitutes a potent inductive bias, greatly reducing capacity requirements and obviating heuristic architecture design.

Parameter efficiency exceeds that of established architectures in vision and language for comparable accuracy, raising prospects for scaling to larger domains such as ImageNet or high-capacity LLMs. The operator-from-input principle and full metriplectic spectrum provide a flexible yet unified framework for task-specific architectural adaptation. Emergent behaviors—such as Sudoku box discovery and maze type separation—indicate that structure can be learned from minimal representations when supported by physical dynamics.

Future work will address symplectic integrators for Hamiltonian channels, unification across domains via a single architecture, scaling of MM1-field bottleneck, and exploration of renormalization-group analogs in multigrid object layers.

Conclusion

Metriplector instantiates a physics-native computation primitive, configuring field dynamics over learned operators and extracting features via Noether's conserved quantities. Empirical results confirm that exact physics structure, coupled with learned reasoning components, leads to parameter-efficient, transferable and robust architectures across vision, reasoning, and language modeling tasks. The stress-energy tensor readout, operator-from-input principle, and the metriplectic instantiation spectrum constitute foundational advances in neural architecture, with direct implications for future scalable, interpretable, and structure-aware AI models.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.