Validation of Arenas and 3:4 Sparsity at 70B+ Scale
Validate the behavior of the Arenas annealing residual synapse mechanism and the 3:4 structured sparsity pattern when applied to larger, server-grade large language models with 70B or more parameters within the Sherry 1.25-bit ternary quantization framework.
References
While we demonstrate that Sherry achieves a superior Pareto frontier for these scales, the behavior of the Arenas mechanism and the 3:4 sparsity pattern on larger, server-grade models (70B+) remains to be validated.
— Sherry: Hardware-Efficient 1.25-Bit Ternary Quantization via Fine-grained Sparsification
(2601.07892 - Huang et al., 12 Jan 2026) in Section: Limitation, paragraph 'Edge-Centric Model Scale'