Effect of quantisation on vocabulary–activation correspondence
Verify whether the vocabulary–activation correspondences and steering effects reported for Llama 3.1-70B under 4-bit NF4 quantisation persist at full precision by replicating analyses without quantisation.
References
Quantisation compresses weight representations and may affect activation dynamics. The effects we report are measured within the quantised model and are internally consistent, but we have not verified that the same correspondences hold at full precision.
— When Models Examine Themselves: Vocabulary-Activation Correspondence in Self-Referential Processing
(2602.11358 - Dadfar, 11 Feb 2026) in Section 6.5 Limitations (Quantisation)