Persistence of the normative shift at extreme model scales
Ascertain whether the alignment-induced normative shift and the base-model predictive advantage persist, weaken, or disappear at extreme model scales beyond those tested, thereby determining if the effect is inherent to alignment or mitigated by increased model capacity.
References
Several open questions follow naturally. Finally, testing whether the effect persists at extreme scale would clarify whether the normative shift is inherent to alignment or diminishes as models grow more capable.
— Alignment Makes Language Models Normative, Not Descriptive
(2603.17218 - Shapira et al., 17 Mar 2026) in Discussion and Conclusion