Conditions under which normalization layers substitute for volume control
Determine conditions under which normalization layers in neural networks effectively substitute for volume control in distance-based log-sum-exp objectives, and characterize when they fail to do so, with respect to preventing collapse in implicit expectation-maximization dynamics.
References
Several directions remain open. Understanding when normalization layers substitute for volume control, and when they do not, would connect the implicit EM framework to practical stability concerns.
— Gradient Descent as Implicit EM in Distance-Based Neural Models
(2512.24780 - Oursland, 31 Dec 2025) in Discussion, Open Directions (Section 7, Open Directions)