Compatibility of Data Augmentation with NOBLE
Characterize which data augmentation strategies, including Mixup and CutMix, are compatible with the NOBLE nonlinear low-rank branch in transformer training, and identify the conditions under which NOBLE’s benefits are preserved or degraded.
References
Augmentation interaction: We identify that Mixup/CutMix interferes with NOBLE, but do not fully characterize which augmentation strategies are compatible.
— NOBLE: Accelerating Transformers with Nonlinear Low-Rank Branches
(2603.06492 - Smith, 6 Mar 2026) in Limitations section (bullet list)