Finite-neuron analytical explanation of alignment and competition dynamics in modular addition networks
Establish a complete finite-neuron analytical characterization of alignment and competition dynamics—including layer-wise phase alignment and intra-neuron frequency competition—for two-layer fully connected neural networks trained on the modular addition task using standard gradient-based optimization with softmax cross-entropy loss, going beyond mean-field analyses and providing rigorous neuron-wise results.
References
While \citet{tian2024composing} and \citet{wang2025neural} provide a characterization of a simpler, mean-field dynamics, a full analytical result explaining the alignment and competition dynamics at the finite, neuron-wise level remains an open problem.
— On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking
(2602.16849 - He et al., 18 Feb 2026) in Related Work, Modular Addition and Grokking Phenomenon